Artificial Intelligence thread

siegecrossbow · Jan 23, 2026

Sounds like a major breech of National Security. Why is no one talking about this???

SanWenYu · Jan 23, 2026

siegecrossbow said:
Sounds like a major breech of National Security. Why is no one talking about this???

They are probably afraid of being embarrassed by their funny pronunciation. "Jipu"?!

GulfLander · Jan 24, 2026

antiterror13 said:
Do you have figures for USA, EU, UK, Japan, SK, Canada, Australia?

My guess is USA have ~5,000, EU ~500, Japan ~500, UK ~100 EFLOPS

do you have link of those estimates?

PopularScience · Jan 24, 2026

AAAI 2026 outstanding paper award.
3/5 are Chinese.

Please, Log in or Register to view URLs content!

meedicx · Jan 26, 2026

The time has arrived for pre-Chinese New Year model releases

First up is the new Qwen3-Max-Thinking model from Alibaba with benchmarks on par with the best US models.

Please, Log in or Register to view URLs content!

Most notably, Qwen team seems to have discovered a new test-time scaling technique allowing it to significantly improve performance

bsdnf · Jan 26, 2026

Kimi K2.5 has begun A/B testing

bsdnf · Jan 27, 2026

bsdnf said:
Kimi K2.5 has begun A/B testing
View attachment 168682

well, it's not just a test, it's been officially released.

Eventine · Jan 27, 2026

Released here:

Please, Log in or Register to view URLs content!

Key Features

Native Multimodality: Pre-trained on vision–language tokens, K2.5 excels in visual knowledge, cross-modal reasoning, and agentic tool use grounded in visual inputs.

Coding with Vision: K2.5 generates code from visual specifications (UI designs, video workflows) and autonomously orchestrates tools for visual data processing.

Agent Swarm: K2.5 transitions from single-agent scaling to a self-directed, coordinated swarm-like execution scheme. It decomposes complex tasks into parallel sub-tasks executed by dynamically instantiated, domain-specific agents.

Looks like the Chinese labs outside of Deep Seek are mostly moving to native multimodal models, following Western labs. Of course, Deep Seek v4 could also be a multimodal model, we'll just have to see.

Everybody is rushing to release models before Chinese New Years. Alibaba, Minimax, and now Moon Shot. z.AI released recently as well. That just leaves Deep Seek. If they manage to crash the Western stock market again before Chinese New Years, that'd result in some real fire works. But regardless, it's great to see Chinese models are at least keeping up on the bench marks. The real test, however, will be ecosystem, context length, and continual learning in 2026 - three important bottle necks to current foundation model performance.

mossen · Jan 27, 2026

Having tried the new Kimi K2.5 and Qwen Max-Thinking models I can only say... the gap between Qwen and Kimi is getting larger, not smaller. Kimi is simply in another league. Qwen not only did poorly, it straight-up hallucinated on several of my questions.

K2 was already a good model but it is now clearly great with the 2.5 update. The reasoning performance is on par with Opus 4.5 and GPT5.2-high in my testing.

Of all the major Chinese labs, Moonshot is probably most deserving of being flooded with GPUs and investment. Too bad geopolitics will prevent it. The talent density of the team is astounding.

Hyper · Jan 27, 2026

Eventine said:
Released here:
Please, Log in or Register to view URLs content!

Looks like the Chinese labs outside of Deep Seek are mostly moving to native multimodal models, following Western labs. Of course, Deep Seek v4 could also be a multimodal model, we'll just have to see.

Everybody is rushing to release models before Chinese New Years. Alibaba, Minimax, and now Moon Shot. z.AI released recently as well. That just leaves Deep Seek. If they manage to crash the Western stock market again before Chinese New Years, that'd result in some real fire works. But regardless, it's great to see Chinese models are at least keeping up on the bench marks. The real test, however, will be ecosystem, context length, and continual learning in 2026 - three important bottle necks to current foundation model performance.

Agent Swarm is multi agent reinforcement learning right ?? Should be very compute demanding if so.

Artificial Intelligence thread

siegecrossbow

Field Marshall

SanWenYu

Major

GulfLander

Brigadier

PopularScience

Senior Member

meedicx

Junior Member

bsdnf

Senior Member

bsdnf

Senior Member

Eventine

Senior Member

Key Features

mossen

Senior Member

Hyper

Junior Member

Artificial Intelligence thread

Field Marshall

Major

Brigadier

Senior Member

Junior Member

Senior Member

Senior Member

Senior Member

Key Features​

Senior Member

Junior Member

Key Features