Artificial Intelligence thread

9dashline · Dec 25, 2024

Please, Log in or Register to view URLs content!

Looks like deepseek dropping something today

9dashline · Dec 25, 2024

looks like it's best at coding, just under o1 high pro mode

It beats all closed source nonreasoning models

Overbom · Dec 25, 2024

Huge. On LiveBench, DeepSeek V3, is second best non-reasoning model (after gemini-exp-1206). And best open source model

Can't wait to see the reasoning variant of this.

tphuang · Dec 25, 2024

it's actually illogical to continue to put foundational models together with reasoning models. You can't get high speed responses with reasoning models. Having good foundational model is really important.

The fact they can get such a great foundational model and put it on open source is a big deal. Since it's so much better than Llama 3.1.

I wish Alibaba team can put out a higher parameter version of Qwen for public use.

9dashline · Dec 25, 2024

Hopefully before 2024 is over they release deepseek r1 weights, and we get a surprise of only 16b

Now that we have open source better than sonnet 3.5 it will put pressure on claude anthropic to up the ante.

tphuang · Dec 26, 2024

Huge investment by Xiaomi into AI coming and this will actually put AI in products rather than just more bullshitting chatbots.

https://twitter.com/i/web/status/1872162380450496860

9dashline · Dec 26, 2024

9dashline said:
Hopefully before 2024 is over they release deepseek r1 weights, and we get a surprise of only 16b

Now that we have open source better than sonnet 3.5 it will put pressure on claude anthropic to up the ante.

Antrophic is in troubles.... Deepseek v3 now miles ahead in terms of cost per intelligence, just 28 cents per million tokens... while Claudes cheapest hauiku model is $5 per million output

v3 is better than antropic top claude model, yet the cheapest claude model, already way way worse than deepseek v3, charging multiples what deepseek charges for its v3

Im calling it now, Antrophic is finished, and Amazons own llm, nova series, weaker than claude at debut, is now DOA

Ive been using them a lot this year but recently noticed they constantly switch to concise answers to save on costs...they were already in bad spot before deepseek v3, now with the new Google competition... its game over

9dashline · Dec 26, 2024

tphuang said:
Huge investment by Xiaomi into AI coming and this will actually put AI in products rather than just more bullshitting chatbots.

https://twitter.com/i/web/status/1872162380450496860

Couple weeks ago LG released its own LLMs, claimed to be better than Qwen, turned out to be complete fraud...

Im confident Xiaomi will do a lot better

Overbom · Dec 26, 2024

9dashline said:
Antrophic is in troubles.... Deepseek v3 now miles ahead in terms of cost per intelligence, just 28 cents per million tokens... while Claudes cheapest hauiku model is $5 per million output

v3 is better than antropic top claude model, yet the cheapest claude model, already way way worse than deepseek v3, charging multiples what deepseek charges for its v3

Im calling it now, Antrophic is finished, and Amazons own llm, nova series, weaker than claude at debut, is now DOA

Ive been using them a lot this year but recently noticed they constantly switch to concise answers to save on costs...they were already in bad spot before deepseek v3, now with the new Google competition... its game over

Tbf Anthropic hasn't released anything big for a while. I expect that in the following few months they will release new top models (LLM and reasoning?)

9dashline · Dec 26, 2024

Deepseek needs to put out a reason tuned version of v3 685b, surpass o1 and catch up with o3.

Artificial Intelligence thread

9dashline

Major

9dashline

Major

Attachments

Overbom

Brigadier

tphuang

General

9dashline

Major

tphuang

General

9dashline

Major

Attachments

9dashline

Major

Overbom

Brigadier

9dashline

Major