Artificial Intelligence thread

tokenanalyst · Feb 2, 2025

tphuang said:
It seems like Chinese models need to make their performance better in other languages, like Russian for example

Please, Log in or Register to view URLs content!

As far I understand apart from the synthetic data I think DeepSeek bought their training data, so is probably more bias towards English and Chinese. But the model is open source and can be finetuned using Russian data relatively easy.

vincent · Feb 2, 2025

https://twitter.com/i/web/status/1885536238553440535

https://twitter.com/i/web/status/1885829916337344522

https://twitter.com/i/web/status/1885526162002329924

9dashline · Feb 3, 2025

https://twitter.com/i/web/status/1886195704202399955

This is crazy growth rate, fastest adoption in the world by far....

valysre · Feb 3, 2025

vincent said:
https://twitter.com/i/web/status/1885536238553440535

https://twitter.com/i/web/status/1885829916337344522

https://twitter.com/i/web/status/1885526162002329924

What an interesting phenomenon. I hope the explanation is not so crass as OpenAI just copy-pasting R1's open-sourced weights.

9dashline · Feb 3, 2025

https://twitter.com/i/web/status/1886226239972610433

Now that Huawei Ascend is inferencing for DeepSeek and as the cost of intelligence trends towards electricity price to run the inferencing...

China needs to give AI to the world for free, not just in terms of open source and open weights but actual inferencing as well... chat app, web browser, api

(or at the very least, make SOTA models available priced at cost)

In the words of Steven Bannon, killing OpenAI is 100x more important....

China must deprive USA from weaponising AI and their evil plans of wholesale conversion of value of human intellect and labor worldwide....

Sam Altman made clear his intent to be God and enslave all the rest of humanity whom arent "chosen"

Their rationale is that if o3 etc can save a company the need for a PhD, then they feel that is what it should be priced at, even if the actual cost for them to run inference is 100x cheaper

For every $2 China spends to subsidize AI for the world, thats $58 the US via OpenAI isnt making....

China has subsidized the world before, it needs to do so again... this AI race is 1000x more important than tiktok, temu, solar, 5g, or EV

https://twitter.com/i/web/status/1886222189269065758

in Scaman's mind, you are cheating him out of the $499.50 that he already feels entitled to....

no wonder "Open"AI is lobbying hard to make open source illegal

https://twitter.com/i/web/status/1886216375649120722

caudaceus · Feb 3, 2025

tphuang said:
I'm watching for this to happen. Rest of the world using DeepSeek to develop their AI sector because cost is so much lower.

Please, Log in or Register to view URLs content!

Usage and hope for it in Brazil

Please, Log in or Register to view URLs content!

Thailand using DeepSeek

Please, Log in or Register to view URLs content!

DeepSeek and Indonesia

Please, Log in or Register to view URLs content!

Please, Log in or Register to view URLs content!

looks like in fact DeepSeek is working with AlonOS to set up data center in Indonesia

Please, Log in or Register to view URLs content!

It seems like Chinese models need to make their performance better in other languages, like Russian for example

Please, Log in or Register to view URLs content!

Really intriguing that Llama didn't spark the same development compared to DeepSeek

tokenanalyst · Feb 3, 2025

Baidu Smart Cloud Qianfan fully supports DeepSeek-R1/V3 calls

"Baidu Smart Cloud" announced today that the DeepSeek-R1 and DeepSeek-V3 models have been officially launched on the Baidu Smart Cloud Qianfan platform.

The model connected this time has fully integrated the Qianfan reasoning link and Baidu's exclusive content security operator to achieve model security enhancement and enterprise-level high availability guarantee. It also supports comprehensive BLS log analysis and BCM alarms, helping users to build intelligent applications safely and stably.

According to reports, the Qianfan platform is committed to providing users with full-process, one-stop AI services. In addition to powerful model resources, it also matches a complete one-stop model effect tuning tool chain, including data processing, model fine-tuning, model evaluation, model quantification and other key links, to help companies deeply optimize model performance according to their own business needs. At the same time, the Qianfan platform has excellent model reasoning hosting capabilities, supports various mainstream reasoning frameworks such as vLLM, LMDeploy, TensorRT-LLM, SGLang, and also supports custom import and deployment of models, providing developers with a highly flexible development environment.

Please, Log in or Register to view URLs content!

tokenanalyst · Feb 3, 2025

caudaceus said:
Really intriguing that Llama didn't spark the same development compared to DeepSeek

It did spark the same development, Llama was the one who sparked the open LLM revolution and that include DeepSeek.

european_guy · Feb 3, 2025

Yet another try at DeepSeek's GPU number

https://twitter.com/i/web/status/1886388826928812205

This Korean guy usually post legit stuff on NAND and DRAM news.

• DeepSeek’s training costs are relatively low, but its inference costs remain high.
• ByteDance’s pricing is cheaper than DeepSeek’s.
• When conducting inference using the same 10,000 GPUs, ByteDance can reduce costs through economies of scale.
• DeepSeek currently possesses 20,000 training GPUs.
• They have recently urgently secured an additional 10,000 to 20,000 GPUs.
• However, this still falls short of ByteDance’s level of 100,000 GPUs.

tphuang · Feb 3, 2025

https://twitter.com/i/web/status/1886485743998279944

Qwen has moved up the Elo rating leaderboard. 3 out of top 10 are Chinese models now.

Artificial Intelligence thread

tokenanalyst

Lieutenant General

vincent

Grumpy Old Man

9dashline

Captain

valysre

Junior Member

9dashline

Captain

caudaceus

Senior Member

tokenanalyst

Lieutenant General

Baidu Smart Cloud Qianfan fully supports DeepSeek-R1/V3 calls

tokenanalyst

Lieutenant General

european_guy

Junior Member

tphuang

General

Artificial Intelligence thread

Lieutenant General

Grumpy Old Man

Captain

Junior Member

Captain

Senior Member

Lieutenant General

Baidu Smart Cloud Qianfan fully supports DeepSeek-R1/V3 calls​

Lieutenant General

Junior Member

General

Baidu Smart Cloud Qianfan fully supports DeepSeek-R1/V3 calls