Artificial Intelligence thread

9dashline · Dec 27, 2024

Overbom said:
I know, but if you listen to his interviews it is clear that he is being pressured by shareholders to close-source Llama because of the exponential (?) increase in AI dev costs.

DeepSeek V3 just blew a huge hole on his argument that an infinitely growing multi-billion investment is necessary.

Similarly for Sam, you can see that OpenAI will get pressured.

In the West, Google seems to be the best going forward on inference and scaling up.

Yup, Google is probably almost gleeful, for now, that the likes of deepseek and qwen are helping it clear out space from its competitors....

GulfLander · Dec 27, 2024

Overbom said:
Definitely rattled.
OpenAI researcher

https://twitter.com/i/web/status/1872725062618628284

Did Deepseek use data from western sources abt this thing or something?

Fatty · Dec 27, 2024

GulfLander said:
Did Deepseek use data from western sources abt this thing or something?

For English prompts, it will have to be trained on English data, which naturally has mostly western sources

diadact · Dec 28, 2024

Did Deepseek use data from western sources abt this thing or something?

They didn't post-train it well primarily due to resource constraints
The next update(1 or 2 months) will have none of these problems and will crush every LLM available rn

Randomuser · Dec 28, 2024

https://twitter.com/i/web/status/1872659946766237853

Sick burn

Overbom · Dec 28, 2024

diadact said:
They didn't post-train it well primarily due to resource constraints
The next update(1 or 2 months) will have none of these problems and will crush every LLM available rn

Reasoning is the big one to await for. In their paper they mentioned specifically how their R1 reasoning model helped them a lot on generating synthetic data.

If R1 was so helpful, which I don't rate it too highly tbh, then I can only imagine the leap forward with a R2 reasoning model fine tuned on a DeepSeek V3 LLM

iewgnem · Dec 28, 2024

Randomuser said:
https://twitter.com/i/web/status/1872659946766237853

Sick burn

It's fascinating how American are so devoid of individuality and mental agency, even after witnessing objective demonstration to the contrary they're still convinced of their indoctrination to "worry" about DeepSeek.

luminary · Dec 28, 2024

an indian OpenAI employee lol

https://twitter.com/i/web/status/1872715395997811060

I guess when you're outcompeted by a scale of 300x the only thing left to do is cope.

Well guess what, if you had done GPT o1 correctly, maybe you would have also gotten the o1 visa you want.

https://twitter.com/i/web/status/1872726832627499313

When you can't compete, just change the benchmarks:

https://twitter.com/i/web/status/1872444303974543859

BlackWindMnt · Dec 28, 2024

ooh lol you can just change the AI benchmark, if that isn't a "scam VC out of money, for big bonus" tech then i don't know what is.

Overbom · Dec 28, 2024

luminary said:
an indian OpenAI employee lol

https://twitter.com/i/web/status/1872715395997811060
I guess when you're outcompeted by a scale of 300x the only thing left to do is cope.

Well guess what, if you had done GPT o1 correctly, maybe you would have also gotten the o1 visa you want.

https://twitter.com/i/web/status/1872726832627499313

When you can't compete, just change the benchmarks:

https://twitter.com/i/web/status/1872444303974543859

Something is fishy with the results. Gemini 1.5 flash and Grok-beta are actually hot garbage.
Also no way Gemma 2 9b is better

Whole thing smells. Smells bad

Artificial Intelligence thread

9dashline

Captain

GulfLander

Brigadier

Fatty

Junior Member

diadact

New Member

Randomuser

Captain

Overbom

Brigadier

iewgnem

Senior Member

luminary

Senior Member

BlackWindMnt

Major

Overbom

Brigadier