Artificial Intelligence thread

Eventine

Senior Member
Registered Member
Released here:
Please, Log in or Register to view URLs content!


Key Features​

  • Native Multimodality: Pre-trained on vision–language tokens, K2.5 excels in visual knowledge, cross-modal reasoning, and agentic tool use grounded in visual inputs.
  • Coding with Vision: K2.5 generates code from visual specifications (UI designs, video workflows) and autonomously orchestrates tools for visual data processing.
  • Agent Swarm: K2.5 transitions from single-agent scaling to a self-directed, coordinated swarm-like execution scheme. It decomposes complex tasks into parallel sub-tasks executed by dynamically instantiated, domain-specific agents.

Looks like the Chinese labs outside of Deep Seek are mostly moving to native multimodal models, following Western labs. Of course, Deep Seek v4 could also be a multimodal model, we'll just have to see.

Everybody is rushing to release models before Chinese New Years. Alibaba, Minimax, and now Moon Shot. z.AI released recently as well. That just leaves Deep Seek. If they manage to crash the Western stock market again before Chinese New Years, that'd result in some real fire works. But regardless, it's great to see Chinese models are at least keeping up on the bench marks. The real test, however, will be ecosystem, context length, and continual learning in 2026 - three important bottle necks to current foundation model performance.
 

mossen

Senior Member
Registered Member
Having tried the new Kimi K2.5 and Qwen Max-Thinking models I can only say... the gap between Qwen and Kimi is getting larger, not smaller. Kimi is simply in another league. Qwen not only did poorly, it straight-up hallucinated on several of my questions.

K2 was already a good model but it is now clearly great with the 2.5 update. The reasoning performance is on par with Opus 4.5 and GPT5.2-high in my testing.

Of all the major Chinese labs, Moonshot is probably most deserving of being flooded with GPUs and investment. Too bad geopolitics will prevent it. The talent density of the team is astounding.
 

Hyper

Junior Member
Registered Member
Released here:
Please, Log in or Register to view URLs content!




Looks like the Chinese labs outside of Deep Seek are mostly moving to native multimodal models, following Western labs. Of course, Deep Seek v4 could also be a multimodal model, we'll just have to see.

Everybody is rushing to release models before Chinese New Years. Alibaba, Minimax, and now Moon Shot. z.AI released recently as well. That just leaves Deep Seek. If they manage to crash the Western stock market again before Chinese New Years, that'd result in some real fire works. But regardless, it's great to see Chinese models are at least keeping up on the bench marks. The real test, however, will be ecosystem, context length, and continual learning in 2026 - three important bottle necks to current foundation model performance.
Agent Swarm is multi agent reinforcement learning right ?? Should be very compute demanding if so.
 
Top