Artificial Intelligence thread

tokenanalyst · Dec 26, 2024

Please, Log in or Register to view URLs content!

luminary · Dec 26, 2024

Some o3 commentary

https://twitter.com/i/web/status/1870522846902166015

Basically it seems the test creator is an openAI shill and intentionally omitted a few things that put the credibility of the test in jeopardy.

https://twitter.com/i/web/status/1870584792028446926

9dashline said:
Please, Log in or Register to view URLs content!

Looks like deepseek dropping something today

Overbom said:
from their paper, training took 2.788M H800 GPU hours. For 2 months training run it's roughly only 2000 H800

Please, Log in or Register to view URLs content!
View attachment 141574

This is insanity. It is trained with 10% the compute it took to train LLama 405.

Hearing quite a few US analysts + engineers confused and coping this morning.

9dashline · Dec 26, 2024

luminary said:
Some o3 commentary

https://twitter.com/i/web/status/1870522846902166015
Basically it seems the test creator is an openAI shill and intentionally omitted a few things that put the credibility of the test in jeopardy.

https://twitter.com/i/web/status/1870584792028446926

This is insanity. It is trained with 10% the compute it took to train LLama 405.

Hearing quite a few US analysts + engineers confused and coping this morning.

I had a SORA feeling about o3..... it was supposed to be part of "shipmas" but not only was o3 not shipped, it was announced with only benchmarks and not even a live demonstration.... the only reason Altman would do that is because he had something to hide and was in fake it until make it mode, otherwise he would be bragging with real live examples... instead of just a rigged chart/ fake benchmark etc... and they even told the ARC AGI dude to not disclose how much compute was used and the hardware/cost lol.... probably millions of dollars for a fake horse and pony show

And after I purchased the o1 pro mode for $200, I really expected, wanted and hoped that pro version of SORA would be way better than Kling, but in all my tests, its worse than Kling...

luminary · Dec 26, 2024

LMAOOO

https://twitter.com/i/web/status/1872274841208057935

9dashline · Dec 26, 2024

luminary said:
LMAOOO

https://twitter.com/i/web/status/1872274841208057935

wait, this is that annihilation dude? lmao

9dashline · Dec 26, 2024

With that definition of AGI and the projection that OAI won't even make a profit until 2029, and now price pressure from DeepSeek, it doesn't seem like OAI will ever achieve "AGI" lol

Please, Log in or Register to view URLs content!

Fatty · Dec 26, 2024

Why doesn’t CCP force Huawei or Alibaba to give Deepseek more GPUs? Seems like they are doing very well

SanWenYu · Dec 26, 2024

Fatty said:
Why doesn’t CCP force Huawei or Alibaba to give Deepseek more GPUs? Seems like they are doing very well

CCP does not micromanage the nation at this level.

siegecrossbow · Dec 26, 2024

9dashline said:
I had a SORA feeling about o3..... it was supposed to be part of "shipmas" but not only was o3 not shipped, it was announced with only benchmarks and not even a live demonstration.... the only reason Altman would do that is because he had something to hide and was in fake it until make it mode, otherwise he would be bragging with real live examples... instead of just a rigged chart/ fake benchmark etc... and they even told the ARC AGI dude to not disclose how much compute was used and the hardware/cost lol.... probably millions of dollars for a fake horse and pony show

And after I purchased the o1 pro mode for $200, I really expected, wanted and hoped that pro version of SORA would be way better than Kling, but in all my tests, its worse than Kling...

Are you using the free or paid version of Kling?

tphuang · Dec 26, 2024

Overbom said:
from their paper, training took 2.788M H800 GPU hours. For 2 months training run it's roughly only 2000 H800

Please, Log in or Register to view URLs content!
View attachment 141574

Yes, I saw that, but still quite unbelievable to do this.

My point is that it's in Deepseek's interest and China's interest to not fully acknowledge just how many GPUs they have or are using to training. So, I do put a Question mark in stuff like this.

Artificial Intelligence thread

tokenanalyst

Lieutenant General

luminary

Senior Member

9dashline

Captain

luminary

Senior Member

9dashline

Captain

9dashline

Captain

Fatty

Junior Member

SanWenYu

Major

siegecrossbow

Field Marshall

tphuang

General