Artificial Intelligence thread

tphuang

Lieutenant General
Staff member
Super Moderator
VIP Professional
Registered Member
it's actually illogical to continue to put foundational models together with reasoning models. You can't get high speed responses with reasoning models. Having good foundational model is really important.

The fact they can get such a great foundational model and put it on open source is a big deal. Since it's so much better than Llama 3.1.

I wish Alibaba team can put out a higher parameter version of Qwen for public use.
 

9dashline

Captain
Registered Member
Hopefully before 2024 is over they release deepseek r1 weights, and we get a surprise of only 16b

Now that we have open source better than sonnet 3.5 it will put pressure on claude anthropic to up the ante.
 
Last edited:
Top