Miscellaneous News

xypher

Senior Member
Registered Member
$850M usd capex though. While R1 can run on a desktop apparently.
If you have EPYC/Xeon along with around 700 GB RAM (671 GB for the FP8 model + room for context) or a similar amount of VRAM on GPUs, then yeah, you can run R1 locally. If you have the same rig as above, then you can also run LLaMA 3.1 405B locally - albeit, if you run on CPU, then it would work painfully slow unlike R1 because 405B is a dense model while R1 is MoE with 37B activated parameters at a time. However, if it's for your personal usage, then it does not make any sense to spend so much money when the DeepSeek API is dirt cheap. Plus, you can use stuff like web search through their Web UI.

$850M Capex is for the datacentre, DeepSeek also was not trained on a desktop, lol, they had 2048xH800 iirc. It is still impressive because typically the models of that class require at least the same number of cards as this big LLaMA - i.e. 16k H100, so DeepSeek pulling this off with only 2k H800s, which are gimped versions of H100, is huge.
I've seen people run distilled versions locally on Raspberries and mobile phones. It's what makes the "ccp is lying about hardware specs" cope all the more hillarious
Distilled versions are not really comparable to the full R1 though, not even in the same league. Most are not even the best in their parameter size category, mainly because these distilled versions did not go through the same training process as R1 with RL and all of that, only fine-tuned on the R1 outputs.

I wonder what the Qwen team will deliver this time; their Qwen 2.5 models are still the best or close to the top in their respective parameter sizes. For example, Qwen-2.5-32b-Coder is the best coding model at this size, no contest.
 
Last edited:

pmc

Major
Registered Member
It'll be cute if they think they can out-compete in compensation to a country that makes $1 trillion trade surplus a year.
$1T is not big amount when you consider dollar devaluation and population size. these numbers population adjusted even Russia achieved much before. Russia has reached such a stage that it no longer need to sell anything based on high tech. it can just sell other stuff and still create trade surplus. This is not Abu Dhabi that is far richer and they are not going to compromise on real estate prices at higher end to attract any one. they can just boost salaries unofficially.
Please, Log in or Register to view URLs content!

Dubai unveils $8.7tn plan to double its economy by 2033​


Please, Log in or Register to view URLs content!

Putin praises UAE president for interest in developing artificial intelligence​

Putin added: “We know what level of international ranking the UAE reached last year,
 

plawolf

Lieutenant General
Whatever. It takes a few seconds to make a signature but each legal process lasts for months if not years. So Donald, just keep signing. Eventually the backlog will be so huge it would take centuries to unwind.

It would be a mistake to dismiss Trump out of hand as an idiot. If we forget about who is championing the policy and just look at the strategy behind the moves, it’s actually not without merit.

Firstly, Trump is someone who cares deeply about his legacy and how he will be remembered. That is easily evident from his desperate pathological need to literally put his name on everything. As such, he would not be all that interested in spamming loads of orders that then get systematically undone even before he leaves office. As such, this blitz of EOs is not just a real life DOS attack. While that is certainly part of the method, it’s more of a means to an end rather than the end in itself.

I think the strategy behind this move is simple. It’s about momentum and critical mass. America needs a revolution, it’s as simple as that. Trump is trying to give them enough of the wholesale societal change of a revolution to head off the actual revolution and do away with the oligarchy getting guillotined part, since his head will also roll in that case. Worst case, all of his EOs does get overturned eventually, but in the months or even years until then, he has already changed reality on the ground. The hope is that while legally speaking he hasn’t got a leg to stand on, but if he can achieve results fast enough, the legislature and people might just decide they actually liked what he was selling once the courts overturns his EOs and demand those changes gets made back into actual laws. Even if the people don’t like the methods enough to enshrine them as law, they might like the results enough to not seek to undo what Trump has achieved. Because if there is one thing Trump loves more than putting his name on buildings, it’s the idea of people saying he was right.
 

iewgnem

Junior Member
Registered Member
$1T is not big amount when you consider dollar devaluation and population size. these numbers population adjusted even Russia achieved much before. Russia has reached such a stage that it no longer need to sell anything based on high tech. it can just sell other stuff and still create trade surplus. This is not Abu Dhabi that is far richer and they are not going to compromise on real estate prices at higher end to attract any one. they can just boost salaries unofficially.
Except this $1T is derived from Chinese industrial prices, which translate into volume far larger than $1T.
 

plawolf

Lieutenant General
If Trump offers up EUV, workstation GPU plus other assorted things that China currently want but can't buy due to ban, so as to balance trade should China accept?

For Trump this doesn't seem like an impossibility, but on the other hand if China agrees it could hurt domestic efforts already underway.

What good are those machines when they come preloaded with kill switches? Hard pass.
 

BillRamengod

New Member
Registered Member

The US thinking everyone else wastes time spinning as much as they do. Slavery isn’t slavery if you just don’t call it slavery. Don’t call Marco Rubio… Marco Rubio so that China can save face and still talk the US Secretary of State even though he’s under China’s sanction list.
It's so funny that speaker was so confused that she directly asked in English.

Please, Log in or Register to view URLs content!
An article I read earlier on Zhihu perfectly answers this question.
(1) Why the Name Change?
In China, the translation of foreign dignitaries' names has always been based on the Xinhua News Agency's translations. In a 2016 article by the People's Daily titled "Why was 'Trump' translated as '特朗普 (Telangpu)' instead of '川普 (Chuanpu)'?", Li Xuejun, the director of the Xinhua News Agency's Transliteration Room, mentioned that "transliterations must be unified and should be the responsibility of Xinhua News Agency."
According to the article, the principle for foreign name transliteration is to "follow the owner's pronunciation and adhere to common practice." More than 50 years ago, 'Trump' was already translated as '特朗普 (Telangpu)'. Therefore, when Trump was elected President of the United States in 2016, Xinhua News Agency reported his name as '特朗普', and all official media followed suit, adhering to the principle of unified transliteration.
However, this principle of unified transliteration mainly applies to official reports. In the private sphere, whether people use '特朗普' or '川普' is a matter of personal freedom, as it is merely a phonetic translation of a name. But official reports on foreign dignitaries must follow Xinhua News Agency's translations to maintain consistency.
Previously, Rubio was not the U.S. Secretary of State but a senator, and thus not considered a foreign dignitary. As a result, media reports on Rubio's name were not consistent, with variations such as '鲁比奥', '卢比奥', and '鲁比欧'. Even Xinhua News Agency's reports used both '卢比奥' and '鲁比奥'. However, once Rubio became the U.S. Secretary of State, he was then regarded as a foreign dignitary, and the transliteration of his name had to follow the latest release by Xinhua News Agency. On January 21st, Xinhua News Agency did issue a separate transliteration release for Rubio.
Therefore, from this point onward, all official media reports regarding Rubio must use the unified transliteration. This is the truth behind the so-called renaming of Rubio. It is not at all a matter of changing Rubio's name to accommodate his visit, as if it were child's play.
 

Chevalier

Captain
Registered Member
Oh it’s starting.

Please, Log in or Register to view URLs content!

People are waking up to the fact that they’ve been fed garbage tier propaganda by the MSM all their lives.
I fear they’re taking the wrong lesson from this XHS situation; rather than banding together into a workers revolt and a new armed revolution t9 usher in a new socialist America, I fear their response is to try to become an illegal immigrant in China or worse, thailand style sexpat in Asia. It’s bad enough that Asia has to deal with harassment from creepy Indians but an enclave of colonisers in Asia is the last thing China nor the rest of east Asia needs.
Saying the quiet part loud. How long does NATO has left after this?.

Maybe Denmark should get some J-35's instead.

What a pitiful state. How the mighty have fallen, Denmark the ardent champion of globohomo is now to be devoured to perpetuate white Anglo supremacy. If the Danes do not fight, they may as well castrate themselves and give over their women to men of superior genes.
Red Note is causing trouble for the U.S. embassy!

Please, Log in or Register to view URLs content!
This is where we should cosplay as Anglos and other westerners by tut tutting about “how can we trust the U.S. Government’s numbers”?! ”funny numbers”

Love how the masturbators to the US are upset over the cost of Chinese AI. Does that make any difference to the fact that everyone agrees that China’s model works? That’s the illusion they created for themselves hence why the problem they face and no one else. Yes they created a bubble because they were lying so they can exploit and make as much money as they can. From neutral eyes they were trying perpetrate the biggest crime in the history of the world. Is that China's fault? China didn't help maintain the illusion. Want to be as petty as their logic? "OpenAI" is actually "ClosedAI" since it's not open-source and because their AI doesn't point that out, it fails as an AI.
It’s racism. It’s always been racism, because now every caucasoid and wannabe caucasoid realises they have nothing that they can Base their superiority complex on with regards to China, plus only a few months ago westerners viewed Chinese as lower than them. XHS showed them the truth, and 6th gen fighters and Zhuhai reminded them of their mortality, if you were say a WASP elite, Jewish cabalist or Mormon CIA worker, all of whom depend and perpetuate Anglo American whites supremacy, China is the single greatest threat and challenge to your ability to exploit the entire globe.
Please, Log in or Register to view URLs content!

We need to Roman Goodbye all these parasites on society!
You just know it isn’t just normal sex workers but Epstein level crap as well.
 

Eventine

Junior Member
Registered Member
Nope, you can give OpenAI billions of $ (and people have) and they still wouldn't be able to run O1 on a Raspberry Pi
Millennia of meritocratic tradition isn't something you can buy, China did not become the world's oldest and most successful civilization by simply being less corrupt.
Being institutionally and culturally corrupt to the bone is certainly not helping Americans, but at end of the day even if they were not corrupt they'll still not be China's equal, they need to be around for at least another 2000 years to begin to even think about it.
You cannot run the full Deep Seek model in a Rasperry Pi. It is mathematically impossible to run a 671 billion parameters model on anything less than a cluster of work station GPUs.

What you're talking about is running the distilled versions of the model, which is significantly weaker than the full R1, and while still impressive, it's more for hobby developers than enterprise users.

The real benefit of Deep Seek is the ~20x reduction in enterprise API costs (according to benchmarks - Deep Seek generates more thinking tokens so costs a bit more than it might seem just looking at cost/token vs. O1). That is a consequence of several factors, of which cheaper training costs is only a small contributor.

It took Meta ~$60 million to train Llama 3.1. But how much money do you think they spent on engineering resources within the company? A team of ~200 researchers/engineers costs >$250 million a year for a Silicon Valley company to keep around between compensation and benefits. Add that on top of all the GPUs they had to buy to build up infrastructure, the support staff, the building costs, and we're talking a billion dollars a year for a generative AI team. It is well known that Open AI pays its researchers $1 million a year, so they're probably paying even more.

That's the thing that Deep Seek was able to get around, not just from significantly reducing model training costs, but from superior value/currency spent, which is shared by all Chinese companies relative to the West. The cost to train a model is just a fraction of the cost it takes to maintain a viable LLM product. It's everything else where the cost difference makes the most impact, and that's also why I think, longer term, China's real advantage won't be from algorithmic innovations - which will be quickly copied - it'll be from structural advantages, which are far more sustainable.
 
Last edited:
Top