Artificial Intelligence thread

bsdnf · May 21, 2026

9dashline said:
Hmmm Google couldnt use Gemini to code a capcut themselves?

They haven't even fixed the antigravity.

HighGround · May 21, 2026

Weren’t people dooming about Qwen when Junyang Lin said he was gonna leave?

Feels like Qwen’s sped up and improved their releases…

tphuang · May 21, 2026

HighGround said:
Weren’t people dooming about Qwen when Junyang Lin said he was gonna leave?

Feels like Qwen’s sped up and improved their releases…

he was probably the hindrance.

meedicx said:
From the Qwen 3.7 max blog post, they described how the model optimized its own inference kernel achieving a 10x speedup on Alibaba's self-developed ZW-M890 PPU AI chips. This is another data point reinforcing my previous point that CUDA has no moat in the age of agentic coding. Ironically, the success of CUDA training AI coding models has destroyed its own moat.

View attachment 175336

Please, Log in or Register to view URLs content!

https://twitter.com/i/web/status/2057655884793168280

my thought on this given the commentary on this from Qwen team member. Huge gain here if Alibaba's own Zhenwu-890 provides the compute it needs for self improvement.

Also, this removes the bullshit about domestic chips can't be used to train. If you can run AI models on them to improve kernel, then they can also write kernels for them for CPT and post training.

BlackWindMnt · May 22, 2026

9dashline said:
Hmmm Google couldnt use Gemini to code a capcut themselves?

Gemini please analyze capcut bytecode and recreate it, ooh and make no mistakes.

mossen · May 22, 2026

The Pareto Frontier. Chart made with Claude.

5 out of 7 models are Chinese on the pareto frontier. Looks pretty good. Qwen 3.7-Max is clearly the best non-US model publicly available and also very cost-effective for the intelligence you get.

Someone on Twitter said that V4 flash is now the new Gemini Flash given that Google no longer seems interested in releasing very cheap but strong models. That seems right. Compare V4 flash with Gemini 3 Flash on the chart. Same intelligence but crazy cost difference.

bsdnf · May 22, 2026

Deepseek V4 is clearly not smart enough, but its extremely high kv-cache hit rate has really changed a lot, it can accomplish a lot of tasks with enough harness and skill constraints.

bsdnf · May 22, 2026

Deepseek: After the discount event ends on May 31st, the V4-Pro price will be permanently adjusted to one-quarter of the original price.

The original price is time-limited; the discounted price is permanent

sunnymaxi · May 22, 2026

China’s AI start-up funding triples to US$16b in first quarter amid bets on LLMs.

Funding for China’s artificial-intelligence-related start-ups jumped nearly threefold year on year in the first quarter, as investors poured capital into large language models (LLMs) and embodied AI amid growing optimism over the country’s technology ecosystem.

AI-related start-ups secured more than 110 billion yuan (US$16.2 billion) in the first three months of the year, representing a 185 per cent surge from the same period last year, according to data released on Thursday by Beijing-based venture capital and private equity research firm Zero2IPO Research.

The AI boom has helped lift China’s broader private equity and venture capital market. Total investment activity reached 2,568 deals worth 234.4 billion yuan in the March quarter

Please, Log in or Register to view URLs content!

mossen · May 22, 2026

That makes sense. V4 flash is a great model but V4 Pro was overpriced compared to Kimi & MiMo. This resets everything.

Speaking of DeepSeek. Their funding round just keep getting bigger (was $7 billion last week).

Please, Log in or Register to view URLs content!

Wenfeng is probably the most idealistic and idiosyncratic founder in AI. He also mandates max 6-8 hours per day of work while keeping the organisation as flat as possible. Hard not to root for the company.

P.S. DeepSeek noted in their V4 model card that they are getting tons of Huawei inference compute by EOY. So if they are permacutting prices now, then that bodes well. They should be able to use that inference compute to massively scale up to meet demand.

meedicx · May 22, 2026

mossen said:
Someone on Twitter said that V4 flash is now the new Gemini Flash given that Google no longer seems interested in releasing very cheap but strong models. That seems right. Compare V4 flash with Gemini 3 Flash on the chart. Same intelligence but crazy cost difference.

After a month of release, people are discovering the value of DeepSeek V4 Flash. Since its release, it has gained usage every week on Open Router and is now the top model by tokens (weekly chart).

I have already used over 1B tokens this month of V4 flash using the official api (not through OpenRouter) and it has cost less than 100 yuan.

Artificial Intelligence thread

bsdnf

Senior Member

HighGround

Senior Member

tphuang

General

BlackWindMnt

Major

mossen

Senior Member

bsdnf

Senior Member

bsdnf

Senior Member

sunnymaxi

Colonel

mossen

Senior Member

meedicx

Junior Member