Artificial Intelligence thread

BlackWindMnt

Captain
Registered Member
Not sure if you guys were aware but there are some developer influencers streaming 24/7 one week of game development with mostly claude prompting on twitch and youtube. Just look for theprimetimeagen/theprimeagen on youtube and twitch.
They are all experienced senior developers so this is really interesting view regarding llm (vibe)coding for all to see.
 

OptimusLion

Junior Member
Registered Member
Improvements in DeepSeek-V3-0324 (more complete on the official account):

- Reasoning performance: The new V3 model draws on the reinforcement learning technology used in the training process of the DeepSeek-R1 model, which greatly improves the performance level in reasoning tasks, and achieves scores higher than GPT-4.5 on mathematics and code-related evaluation sets.
- Front-end development
- Chinese writing upgrade: The new V3 model is further optimized based on the writing level of R1, and at the same time, the content quality of medium and long text creation is particularly improved.
- Chinese search capability optimization: The new V3 model can output more detailed and accurate content for report generation instructions in online search scenarios, and the layout is clearer and more beautiful.
- Others: tool calls, role-playing, Q&A chat

 

AI Scholar

Just Hatched
Registered Member
I don’t believe AI progress is slowing down at all... If you focus solely on China, you’ll see an exceptionally rapid trajectory of advancement. We’ve gone from relatively weak models in 2023 to Qwen2.5 approaching GPT-4’s level last year, and now the latest DeepSeek model became the top non-reasoning AI. What is slowing down is AI progress in the U.S, so I’d argue that China is likely to take the lead by the end of 2025. Once that happens, it won’t look like progress is stagnating anymore, because China will be the one leading all the benchmarks.
 

Eventine

Junior Member
Registered Member
Speaking of AI developments, Gemini 2.5 just dropped

Please, Log in or Register to view URLs content!

1742924722012.png

Claims to have caught up to the state of the art across the board, and beats Claude 3.7 on certain bench marks like AIME and Humanity's Last Exam. Not sure how it compares to the new Deep Seek, but it certainly seems to crush the old R1. Oh, and features the Google classic 1 million context window with a 2 million context window soon to ship. Though of course, we don't know yet how well it deals with those windows beyond just supporting them.

This IS a thinking a model, but if Google keeps with their 1,500 free API requests a day, Open AI and Anthropic are going to be crying to sleep tonight.

As for Deep Seek, looking forward to R2, which based on the performance of V3.1, should be able to beat the new state of the art, but we'll see.

Competition seems to be heating up again.
 
Last edited:
Top