Artificial Intelligence thread

chlosy · Feb 25, 2025

mossen said:
Alibaba released their reasoning model for Qwen.

Please, Log in or Register to view URLs content!

Gave it a spin against Deepseek's R1. Used thinking and web search for both models.

Just a simple vibe question. Told them a specific sum for 2018 and then asked to adjust for inflation "today". I purposefully did not give a date to see how well they would approximate today being Feb of 2025.

DeepSeek failed miserably, gave me Oct 2023. The answer was obviously incorrect as a result of that.

By contrast, Qwen gave me the correct date (Feb 2025) but also approximated inflation adjustment as far as it could. Its final answer was very close to what I calculated. Moreover, DeepSeek was very slow whereas Qwen was much faster (but still not as fast as it should be). I really like DeepSeek but clearly it should not rest on its laurels. The other Chinese labs are catching up fast.

Alibaba Qwen logo, the colors and shape of Israel???

mossen · Feb 25, 2025

mossen said:
I really like DeepSeek but it should not rest on its laurels. The other Chinese labs are catching up fast.

Proof that Wenfeng reads this forum.

source:

Please, Log in or Register to view URLs content!

Hyper · Feb 25, 2025

Eventine said:
Anthropic is back on the throne for coding for the time being, but their "hybrid" model claim is mostly a marketing trick - reasoning being built on top of an existing base model, any of the current reasoning services can be told not to reason. Not to mention the new 3.7 Claude model is entirely behind the high Anthropic pay wall, so only enterprises will be paying for it. Just doesn't seem to hold that much value with a $15 / million tokens cost, especially since they'll charge you for the reasoning tokens. Anthropic's refusal to lower their prices even after Deep Seek is probably going to back fire.

That said, with all the recent model releases, it looks like the US is being forced to mount a counter attack - probably sooner than they would have wanted, given the lack of anything ground breaking. Out of the recent releases, Grok 3, o3-mini, and Gemini 2.0 Flash Thinking are the most promising American offerings and do appear to be moving the needle forward on value proposition. But we'll see how long they can last.

Anthropic trains on Amazon who along with Microsoft paid a premium for H100. Costed them around 40k per gpu. Others like Meta bought it at a price of 25k-30k.

vincent · Feb 25, 2025

mossen said:
Alibaba released their reasoning model for Qwen.

Please, Log in or Register to view URLs content!

Gave it a spin against Deepseek's R1. Used thinking and web search for both models.

Just a simple vibe question. Told them a specific sum for 2018 and then asked to adjust for inflation "today". I purposefully did not give a date to see how well they would approximate today being Feb of 2025.

DeepSeek failed miserably, gave me Oct 2023. The answer was obviously incorrect as a result of that.

By contrast, Qwen gave me the correct date (Feb 2025) but also approximated inflation adjustment as far as it could. Its final answer was very close to what I calculated. Moreover, DeepSeek was very slow whereas Qwen was much faster (but still not as fast as it should be). I really like DeepSeek but clearly it should not rest on its laurels. The other Chinese labs are catching up fast.

I think DeepSeek’s training data is only up to 2023.

tphuang · Feb 25, 2025

We are in interesting territory when Midea is putting Full DeepSeek R1 on AC unit

Please, Log in or Register to view URLs content!

also with DreamMe's S50 series of iRobot in the DreamHome app

Please, Log in or Register to view URLs content!

Lenovo's Yoga AI PC Is deeply integrating with DeepSeek edge models and also into Moto AI razr foldable phone

Please, Log in or Register to view URLs content!

Oppo ColorOS 15 phones integrating full blooded version of DeepSeek-R1

Please, Log in or Register to view URLs content!

ByteDance's Doubao AI assistant incorporating deep thinking function, but their own AI rather than DeepSeek

Please, Log in or Register to view URLs content!

Dahua offering a whole series of inference machines for DeepSeek using domestic ARM (Maybe Kunpeng) CPU and Ascend GPU. 7 products for different model sizes.

Please, Log in or Register to view URLs content!

Tinnove's AI cockpit using Tencent cloud has incorporated DeepSeek models.

Please, Log in or Register to view URLs content!

tphuang · Feb 25, 2025

https://twitter.com/i/web/status/1894402440964755930

Alibaba Wan got launched. image to video generation.

siegecrossbow · Feb 25, 2025

vincent said:
I think DeepSeek’s training data is only up to 2023.

There is technically a web search option that can be enabled I think. With that you’ll get more up to date info.

Overbom · Feb 25, 2025

tphuang said:
https://twitter.com/i/web/status/1894402440964755930

Alibaba Wan got launched. image to video generation.

Wan (former WanX..) looks extremely impressive from what I have seen so far. Benchmarks don't give it justice, much better than other models

I really dont think there is any stopping the AI train.

https://twitter.com/i/web/status/1894415843255357646

https://twitter.com/i/web/status/1894415856467415140

https://twitter.com/i/web/status/1894415879791939697

https://twitter.com/i/web/status/1894415890487415120

https://twitter.com/i/web/status/1894415896225210448

siegecrossbow · Feb 25, 2025

Overbom said:
Wan (former WanX..) looks extremely impressive from what I have seen so far. Benchmarks don't give it justice, much better than other models

I really dont think there is any stopping the AI train.

Kling has tough competition. Does Wan offer a free version?

Overbom · Feb 25, 2025

siegecrossbow said:
Kling has tough competition. Does Wan offer a free version?

Wan is open source. You can expect people to offer it in a few days

I managed to test it for a bit the moment it was announced but now hugging face demo can't take any more requests due to heavy load..

Anyway, free version is in (not sure if it's the 14b or lighter version) huggingface and on qwen.chat

Please, Log in or Register to view URLs content!

Artificial Intelligence thread

chlosy

Junior Member

mossen

Senior Member

Hyper

Junior Member

vincent

Grumpy Old Man

tphuang

General

tphuang

General

siegecrossbow

Field Marshall

Overbom

Brigadier

siegecrossbow

Field Marshall

Overbom

Brigadier