Artificial Intelligence thread

tphuang · Feb 16, 2025

people, you seem to just post the WeChat and Baidu using DeepSeek search news without mentioning the obvious implication.

DeepSeek related inference is going through the roof and these tech giants think they have the compute to handle them.

vincent · Feb 16, 2025

tphuang said:
people, you seem to just post the WeChat and Baidu using DeepSeek search news without mentioning the obvious implication.

DeepSeek related inference is going through the roof and these tech giants think they have the compute to handle them.

You posted plenty of news of data/compute centres being built in China.

tphuang · Feb 16, 2025

vincent said:
You posted plenty of news of data/compute centres being built in China.

Yes but this is going to really test out that and verify if they actually have the computer to handle all this prompting.

tphuang · Feb 16, 2025

tphuang said:
Please, Log in or Register to view URLs content!

View attachment 145631
Atlas 800I A2 link above, using Ascend-910 NPU & Kunpeng-920 CPU. I would assume 8 NPUs in there.
Can do 1911 Tokens/s of inference

More details on this one

https://twitter.com/i/web/status/1891329884934287837

Looks like the 1911 token/s is with 2 Atlas 800I A2, so probably 16 Ascend-910B NPUs here.

They also offer FusionCube3000 for other distilled R1 models

Hyper · Feb 16, 2025

tphuang said:
More details on this one

https://twitter.com/i/web/status/1891329884934287837

Looks like the 1911 token/s is with 2 Atlas 800I A2, so probably 16 Ascend-910B NPUs here.

They also offer FusionCube3000 for other distilled R1 models

Any comparison with Nvidia ?

tphuang · Feb 17, 2025

Hyper said:
Any comparison with Nvidia ?

I think earlier in this thread. It was said that H200 with 8 card cluster can do 3800 tps.
So, maybe Ascend-910C can do around 2000 tps, possibly more with optimization (910C should be > 2x of 910B in compute). That matches the 910C being 60%+ inference performance of H100 theory. H200 should be better than H100 in even inference since it holds more memory & has much higher memory bandwidth. These things don't matter as much for inference as training, but I'd imagine still matters a little bit.

https://twitter.com/i/web/status/1891458558471880838

Baidu is in trouble imo.

Tencent has now fully integrated DeepSeek into search and assistant across all its platforms. Why would anyone still use Baidu to do the searching if Tencent search uses the same technology?

Franklin · Feb 17, 2025

There is plenty of computing power in China for Deepseek and Qwen for now. But if everyone and their grandmother continues to jump on the AI bandwagon in China i'am not so sure about the future.

Please, Log in or Register to view URLs content!

tokenanalyst · Feb 17, 2025

Please, Log in or Register to view URLs content!

tphuang · Feb 17, 2025

Please, Log in or Register to view URLs content!

Skyworth has launched a DeepSeek integrated new smart TV called G7F Pro

as for the AI functionalities it support

G7F Pro 支持方言识别与模糊语义理解，可按用户要求智能剪辑、生成氛围音乐、互动绘画；教育场景下，还可当作 AI“口语陪练”，还有定制绘本、个性化语音合成功能；生活服务方面，其支持生成出行计划、家庭备忘录、行程预订、日程提醒等多场景服务。

sunnymaxi · Feb 17, 2025

Tencent Hunyuan team announces the public of their LLM Turbo-S and HunyuanVideo I2V in Q1 2025..

https://twitter.com/i/web/status/1891507545325404639

Artificial Intelligence thread

tphuang

General

vincent

Grumpy Old Man

tphuang

General

tphuang

General

Hyper

Junior Member

tphuang

General

Franklin

Captain

tokenanalyst

Lieutenant General

tphuang

General

sunnymaxi

Colonel