Miscellaneous News

Chevalier

Captain
Registered Member
Please, Log in or Register to view URLs content!

americans are reporting their DMs are being monitored, you won’t realise it but it seems there’s a “night of long knives” situation going on in America and a media blackout. It was only via TikTok that we learned of the ICE arrests before mainstream media came out to do damage control.


what use were all those H1bs if China still beat the U.S. in tech?


Every major U.S. company in tech or chips now resides in China at the goodwill of the Chinese COMMUNIST party.

Please, Log in or Register to view URLs content!

native Americans are being rounded up by immigration agents…. Yes s you heard that right. But I must say, this sounds a lot like….”Uighur concentration camps”
 
Last edited:

FairAndUnbiased

Brigadier
Registered Member
You cannot run the full Deep Seek model in a Rasperry Pi. It is mathematically impossible to run a 671 billion parameters model on anything less than a cluster of work station GPUs.

What you're talking about is running the distilled versions of the model, which is significantly weaker than the full R1, and while still impressive, it's more for hobby developers than enterprise users.

The real benefit of Deep Seek is the ~20x reduction in enterprise API costs (according to benchmarks - Deep Seek generates more thinking tokens so costs a bit more than it might seem just looking at cost/token vs. O1). That is a consequence of several factors, of which cheaper training costs is only a small contributor.

It took Meta ~$60 million to train Llama 3.1. But how much money do you think they spent on engineering resources within the company? A team of ~200 researchers/engineers costs >$250 million a year for a Silicon Valley company to keep around. Add that on top of all the GPUs they had to buy to build up infrastructure, the support staff, the building costs, and we're talking a billion dollars a year for a generative AI team. It is well known that Open AI pays its researchers $1 million a year. Compared to the training costs - estimated at ~$100 million for O1 - this is a much larger expense.

That's the thing that Deep Seek was able to get around, not just from significantly reducing model training costs, but from superior value/currency spent, which is shared by all Chinese companies relative to the West. The cost to train a model is just a fraction of the cost it takes to maintain a viable LLM product. It's everything else where the cost difference makes the most impact, and that's also why I think, longer term, China's real advantage won't be from algorithmic innovations - which will be quickly copied - it'll be from structural advantages, which are far more sustainable.
If it's labor cost then how come the same Indians that are H1Bs in the US can't do it in India?
 

Eventine

Junior Member
Registered Member
If it's labor cost then how come the same Indians that are H1Bs in the US can't do it in India?
It's not just labor cost. It's labor cost + quality. Scientific talent in India is weak - not a single Indian university is in the top 100 of institutions. Their best university ranks like, 118th?

China has both the talent and the cost advantage. While the US only has the talent, but not the cost. That's how China is able to make structural plays like charging 1/20th the API cost for comparable performance.
 

tygyg1111

Captain
Registered Member
View attachment 144351
Wow so the NIH grant pause might be an even bigger deal than I thought. The new admin is actually going full anti-science. The odd thing is that NIH is actually pretty bipartisan because there’s a lot of conservative scientists too… wonder if this is Musk’s doing…

Cue the quote "The best way to stop a civilizations progress is to kill their science". This is an exemplary nth example of "Do nothing, win" in 2025.
 

FairAndUnbiased

Brigadier
Registered Member
It's not just labor cost. It's labor cost + quality. Scientific talent in India is weak - not a single Indian university is in the top 100 of institutions. Their best university ranks like, 118th?

China has both the talent and the cost advantage. While the US only has the talent, but not the cost. That's how China is able to make structural plays like charging 1/20th the API cost for comparable performance.
But these Indians are working as H1Bs in the US so they can at least work under direction.
 

dingyibvs

Senior Member
You cannot run the full Deep Seek model in a Rasperry Pi. It is mathematically impossible to run a 671 billion parameters model on anything less than a cluster of work station GPUs.

What you're talking about is running the distilled versions of the model, which is significantly weaker than the full R1, and while still impressive, it's more for hobby developers than enterprise users.

The real benefit of Deep Seek is the ~20x reduction in enterprise API costs (according to benchmarks - Deep Seek generates more thinking tokens so costs a bit more than it might seem just looking at cost/token vs. O1). That is a consequence of several factors, of which cheaper training costs is only a small contributor.

It took Meta ~$60 million to train Llama 3.1. But how much money do you think they spent on engineering resources within the company? A team of ~200 researchers/engineers costs >$250 million a year for a Silicon Valley company to keep around between compensation and benefits. Add that on top of all the GPUs they had to buy to build up infrastructure, the support staff, the building costs, and we're talking a billion dollars a year for a generative AI team. It is well known that Open AI pays its researchers $1 million a year, so they're probably paying even more.

That's the thing that Deep Seek was able to get around, not just from significantly reducing model training costs, but from superior value/currency spent, which is shared by all Chinese companies relative to the West. The cost to train a model is just a fraction of the cost it takes to maintain a viable LLM product. It's everything else where the cost difference makes the most impact, and that's also why I think, longer term, China's real advantage won't be from algorithmic innovations - which will be quickly copied - it'll be from structural advantages, which are far more sustainable.

Stop spewing nonsense, salary is only a small part of the cost.
 

Attachments

  • 75a07ded-884e-4acf-8f88-2d5ccf67f826.png
    75a07ded-884e-4acf-8f88-2d5ccf67f826.png
    89.9 KB · Views: 56

TK3600

Major
Registered Member
Top