Artificial Intelligence thread

9dashline

Major
Registered Member
Yeah it could all be a stunt, but it doesn't really matter: either they can't afford to run anything above Opus 4.8, or they're not allowed to run anything above Opus 4.8, in both scenarios they're f*ed, because China can both afford to run it and are giving them out for free to anyone in the world.

If it is a compute problem and US models are being capped due to compute while Chinese models aren't, that would transcend irony and enter poetic territory
I had the feeling with Fable they finally hit a wall of deminishing returns... The thing was 10 trillion parameter, twice the cost of OPUS and by my tests, no where near the capability they hyped it up to be

Why else would both Dario and Altman come out in recent days to call for global pause on AI development

Then Anthropic got called out for making Fable secretly sabotaging anyone using it for AI development including lawful fine tuning etc....

Plus they were only doing this as a stunt until June 23... It was a limited PR stunt for their IPO and then Fable was going to be pulled from subscriptions and only available via extra credit spends or API at $50 per million output tokens...

Clearly this was unsustainable and the US AI bubble was already popping anyway so maybe the US gov did this as a way to give these hyperscalers a way out etc lol
 

Engineer

Major
If it is a compute problem and US models are being capped due to compute while Chinese models aren't, that would transcend irony and enter poetic territory
Fm1CIcZaYAMv96e.jpg
 

Some1Guy

Junior Member
Registered Member
I had the feeling with Fable they finally hit a wall of deminishing returns... The thing was 10 trillion parameter, twice the cost of OPUS and by my tests, no where near the capability they hyped it up to be

Why else would both Dario and Altman come out in recent days to call for global pause on AI development

Then Anthropic got called out for making Fable secretly sabotaging anyone using it for AI development including lawful fine tuning etc....

Plus they were only doing this as a stunt until June 23... It was a limited PR stunt for their IPO and then Fable was going to be pulled from subscriptions and only available via extra credit spends or API at $50 per million output tokens...

Clearly this was unsustainable and the US AI bubble was already popping anyway so maybe the US gov did this as a way to give these hyperscalers a way out etc lol
Considering that we're in a oil, LNG & Helium crisis due to the closure of the strait of Hormuz, it's not that crazy that they would pull the plug now.

Since Helium is essential for the production of computer chips and 40% of the supply was essentially taken offline of the supply of Helium and the fact that the US can't make up for all of the lost supply then there's bound to be a drop in the production of computer chips resulting in the slow down of data center rollout and than there's 20% of oil & 30% LNG cutoff which effects electricity generation which would again lead to a halt in data center rollout and lead to problems with powering the remaining data centers.

Than there's the problem with nuclear power and getting the reactor grade uranium where Russia has a monopoly, it also doesn't help that it takes a lot of time to build reactors in the US. There's a theory that the reason the US tried to steal the enriched uranium from Iran is that it was needed as fuel for reactors to power data centers, them failing might have lead them to steal enriched uranium from Venezuela.

Someone could probably do a more in depth analysis since i am not a expert in energy supply chains and this is just on the surface analysis.
 

bsdnf

Senior Member
Registered Member
I believe Anthorpic can obtain an exemption from the (extremely corrupt) Trump administration, or launch Opus5.

AI stocks are too important to the US stock market; in my view, this is blackmail by the Trump administration, and it won't last long.
 

iewgnem

Captain
Registered Member
I believe Anthorpic can obtain an exemption from the (extremely corrupt) Trump administration, or launch Opus5.

AI stocks are too important to the US stock market; in my view, this is blackmail by the Trump administration, and it won't last long.
Anthropic was going to pull Fable next week anyway, they're not getting directly screwed by pulling Fable a week early.

What they're getting f*ed by is going forward they won't be able to build any model better than Fable, nor can any other American company build any model better than Fable, without limiting them to USG customers only which means USG is now on the hook to pay for their entire datacenter buildout.

AI stock might be too important for US stock market, but so is oil supply, it's not about the intent, it's about the competency, or lack thereof.
 

dripblackcoffee

New Member
Registered Member
I had the feeling with Fable they finally hit a wall of deminishing returns... The thing was 10 trillion parameter, twice the cost of OPUS and by my tests, no where near the capability they hyped it up to be

Why else would both Dario and Altman come out in recent days to call for global pause on AI development

Then Anthropic got called out for making Fable secretly sabotaging anyone using it for AI development including lawful fine tuning etc....

Plus they were only doing this as a stunt until June 23... It was a limited PR stunt for their IPO and then Fable was going to be pulled from subscriptions and only available via extra credit spends or API at $50 per million output tokens...

Clearly this was unsustainable and the US AI bubble was already popping anyway so maybe the US gov did this as a way to give these hyperscalers a way out etc lol
10 trillion would be around the estimate of gpt 4.5 by the way, fable is in fact really good and anthropic did actually find something that made scaling up work well, but thats not to discount the fact that kimi 2.7 code and glm 5.2 is also really good. Its just that the talk of fable being bad or just marketing is pure cope, who knows what wouldve happened if they kept serving it, maybe they wouldve quantized it to make it cheaper and worse like they (allegedly) did to 4.6 and even then that model was really good no matter what. Also helium comes in grades and the biggest chip fabs (tsmc, intel, samsung) can secure helium from alternative sources and they also have recycling, they will be fine
 

tokenanalyst

Lieutenant General
Registered Member

Moore Threads announces open-sourcing of MusaCoder: the first domestically developed, full-featured GPU full-stack training code for large-scale models.​


Moore Threads announced the official release and open source of MusaCoder, a large-scale dedicated code model for generating low-level GPU operators.

Moore Threads stated that MusaCoder is the industry's first open-source large model that has completed the entire training and verification process based on a domestically produced GPU computing platform. Its complete post-training process was completed on the Kua'e Intelligent Computing Cluster built on MTT S5000.

According to reports, MusaCoder includes two parameter scales, 9B and 27B, and is mainly designed for GPU low-level operator generation tasks, with a focus on supporting the automatic generation of high-performance CUDA/MUSA native kernel code from PyTorch standard operators.

This capability can lower the barrier for developers to write low-level GPU operators by hand and improve the efficiency of code generation, verification and optimization in high-performance GPU computing scenarios.

In terms of performance, in the KernelBench test, the MusaCoder-27B-RL achieved Overall Pass@8 93.2% and Avg.@8 88.60%, surpassing mainstream state-of-the-art code models such as Claude Opus 4.7, DeepSeek-V4 Pro, GLM-5.1, and Kimi K2.6, reaching the current industry-leading level.

According to Moore Threads, MusaCoder's full-stack training and verification process, including SFT (supervised fine-tuning), RFT (rejection sampling fine-tuning), RL (reinforcement learning), asynchronous rollout, online compilation and execution verification, and reward calculation, is all completed using the Kua'e Intelligent Computing Cluster built on MTT S5000.​

Please, Log in or Register to view URLs content!
 
Top