Artificial Intelligence thread

tokenanalyst

Brigadier
Registered Member
It seems like Chinese models need to make their performance better in other languages, like Russian for example

Please, Log in or Register to view URLs content!
As far I understand apart from the synthetic data I think DeepSeek bought their training data, so is probably more bias towards English and Chinese. But the model is open source and can be finetuned using Russian data relatively easy.
 

9dashline

Captain
Registered Member

Now that Huawei Ascend is inferencing for DeepSeek and as the cost of intelligence trends towards electricity price to run the inferencing...

China needs to give AI to the world for free, not just in terms of open source and open weights but actual inferencing as well... chat app, web browser, api

(or at the very least, make SOTA models available priced at cost)

In the words of Steven Bannon, killing OpenAI is 100x more important....

China must deprive USA from weaponising AI and their evil plans of wholesale conversion of value of human intellect and labor worldwide....

Sam Altman made clear his intent to be God and enslave all the rest of humanity whom arent "chosen"

Their rationale is that if o3 etc can save a company the need for a PhD, then they feel that is what it should be priced at, even if the actual cost for them to run inference is 100x cheaper

For every $2 China spends to subsidize AI for the world, thats $58 the US via OpenAI isnt making....

China has subsidized the world before, it needs to do so again... this AI race is 1000x more important than tiktok, temu, solar, 5g, or EV


in Scaman's mind, you are cheating him out of the $499.50 that he already feels entitled to....

no wonder "Open"AI is lobbying hard to make open source illegal

 
Last edited:

caudaceus

Senior Member
Registered Member
I'm watching for this to happen. Rest of the world using DeepSeek to develop their AI sector because cost is so much lower.

Please, Log in or Register to view URLs content!

Usage and hope for it in Brazil

Please, Log in or Register to view URLs content!

Thailand using DeepSeek

Please, Log in or Register to view URLs content!

DeepSeek and Indonesia
Please, Log in or Register to view URLs content!

Please, Log in or Register to view URLs content!

looks like in fact DeepSeek is working with AlonOS to set up data center in Indonesia

Please, Log in or Register to view URLs content!

It seems like Chinese models need to make their performance better in other languages, like Russian for example

Please, Log in or Register to view URLs content!
Really intriguing that Llama didn't spark the same development compared to DeepSeek
 

tokenanalyst

Brigadier
Registered Member

Baidu Smart Cloud Qianfan fully supports DeepSeek-R1/V3 calls​


"Baidu Smart Cloud" announced today that the DeepSeek-R1 and DeepSeek-V3 models have been officially launched on the Baidu Smart Cloud Qianfan platform.

The model connected this time has fully integrated the Qianfan reasoning link and Baidu's exclusive content security operator to achieve model security enhancement and enterprise-level high availability guarantee. It also supports comprehensive BLS log analysis and BCM alarms, helping users to build intelligent applications safely and stably.

According to reports, the Qianfan platform is committed to providing users with full-process, one-stop AI services. In addition to powerful model resources, it also matches a complete one-stop model effect tuning tool chain, including data processing, model fine-tuning, model evaluation, model quantification and other key links, to help companies deeply optimize model performance according to their own business needs. At the same time, the Qianfan platform has excellent model reasoning hosting capabilities, supports various mainstream reasoning frameworks such as vLLM, LMDeploy, TensorRT-LLM, SGLang, and also supports custom import and deployment of models, providing developers with a highly flexible development environment.

Please, Log in or Register to view URLs content!
 

european_guy

Junior Member
Registered Member
Yet another try at DeepSeek's GPU number


This Korean guy usually post legit stuff on NAND and DRAM news.

• DeepSeek’s training costs are relatively low, but its inference costs remain high.
• ByteDance’s pricing is cheaper than DeepSeek’s.
• When conducting inference using the same 10,000 GPUs, ByteDance can reduce costs through economies of scale.
• DeepSeek currently possesses 20,000 training GPUs.
• They have recently urgently secured an additional 10,000 to 20,000 GPUs.
• However, this still falls short of ByteDance’s level of 100,000 GPUs.
 
Last edited:
Top