Artificial Intelligence thread

Engineer · Dec 31, 2024

Fatty said:
https://twitter.com/i/web/status/1873986764941255171

imo if China can corner the AI markets in developing countries by offering cheap alternatives, even if the models are not as good, it will be a very good thing.

Hopefully Chinese companies can also expand this business into countries like Russia that would want their LLMs and other AI agents to not be trained on western-biased sources.

That's nice and all, but the question is how to access it along with other Chinese LLMs for someone living in the West and depends on sevices in English.

Hyper · Dec 31, 2024

Only DeepSeek is profitable at low prices. Others are losing money. Source is DeepSeek CEO in an interview.

tokenanalyst · Dec 31, 2024

Hyper said:
Only DeepSeek is profitable at low prices. Others are losing money. Source is DeepSeek CEO in an interview.

While big AI companies are trying to create huge models consuming billions in GPUs benefiting Nvidia pockets DeepSeek has really made use of innovation to create these models for cheap using millions instead of billions, so cheap in fact that they can open source the model AND STILL make money, is incredible. If more efficient AI models become the norm Nvidia could be in deep trouble.

Overbom · Dec 31, 2024

Engineer said:
That's nice and all, but the question is how to access it along with other Chinese LLMs for someone living in the West and depends on sevices in English.

OpenRouter offers access to the popular DeepSeek and Qwen Chinese LLM at competitive prices

Engineer · Dec 31, 2024

Overbom said:
OpenRouter offers access to the popular DeepSeek and Qwen Chinese LLM at competitive prices

I should clarify that the question is about accessing Chinese LLMs directly on Chinese servers without going through middlemen in the West.

gpt · Dec 31, 2024

9dashline said:
by the anglo logic with NGAD, if deepseek v3 is what CCP allows to be shown to the world, just imagine what they actually have in secret.

does anyone really believe China would pin its AGI hopes solely on quant guys who only have 2000 gimped GPUs and a 5 mil training budget?

https://twitter.com/i/web/status/1873408599868747801

I think it's important to get a practical definition of AGI first: a highly autonomous system that outperforms humans on most economically valuable (ie. useful) tasks. What a lot of techbros and laypeople are thinking fall more into the category of ASI.

With that in mind, several firms (incl. chinese) have actually built tools internally that scale, not quite infinitely, but close enough and they seem to have reached above human performance on most tasks - unfortunately at the cost of being much more expensive than hiring a few thousand humans to do it.

I talked to some folks who did work around this last year and the consensus seems to be there is no limit to how smart you could get a swarm of agents using different base models at the bottom end, if you don't mind burning through GPU cycles . At the time even this was a completely open question. It's still the case that no one has build an interactive system that _really_ scales - even the startups and off the record conversations I've had with people in these companies say that they are still using Python across a single data center.

IMO AGI the way it is defined above is now no longer a dream but a question of if we want to:
1). Start building nuclear power plants like it's 1950
2). Wait and hope that Moore's law keeps applying to GPUs until the cost of something like o3 drops to something affordable

Broader AI community haven't fully understood potential of agent swarms but top firms are deep into it already, and yes, some chinese researchers are well aware of it. Future breakthroughs such as getting AI compute to consume 1% of the energy it does today would be extremely significant.

Overbom · Dec 31, 2024

gpt said:
Broader AI community haven't fully understood potential of agent swarms but top firms are deep into it already, and yes, some chinese researchers are well aware of it

AI community expects 2025 to be the year of the AI agents

In other news, Xiaomi has hired (poached?..) one of DeepSeek key founders

https://twitter.com/i/web/status/1873591219017945580

tphuang · Dec 31, 2024

https://twitter.com/i/web/status/1874206239481356579

CNBC people gets it finally. Once you open source and commoditize leading edge AI, these tech companies who built their valuation on frontier tech are really not worth much.

Fatty · Jan 1, 2025

https://twitter.com/i/web/status/1874196675130253512

this is a very good point I saw on X. It is not an exaggeration to say a lot of companies and funds have gone “all in” on generative AI. Just screwing some of them over can have a ripple effect on the US financial industry and overall economy

GulfLander · Jan 1, 2025

tphuang said:
https://twitter.com/i/web/status/1874206239481356579

CNBC people gets it finally. Once you open source and commoditize leading edge AI, these tech companies who built their valuation on frontier tech are really not worth much.

The woman interviewed by cnbc:

Please, Log in or Register to view URLs content!

Deirdre Bosa is the anchor of “TechCheck,” CNBC’s tech-industry franchise. Prior to this, she was anchor of “TechCheck” in its daily program format that ran from 2021 to 2023. Previously, Bosa was a technology correspondent and contributor reporting out of Vancouver, Canada, for the network. She joined CNBC in 2012, covering the markets and economies of London and Singapore and co-anchoring “Squawk Box Asia,” “Squawk Box Europe,” and “Worldwide Exchange.” Earlier, Bosa was an anchor and reporter for CCTV News International based in Beijing and a contributor to Fox Business News. She began her career working for multinational corporations Barrick Gold in Toronto and Rio Tinto in Shanghai.

Artificial Intelligence thread

Engineer

Major

Hyper

Junior Member

tokenanalyst

Lieutenant General

Overbom

Brigadier

Engineer

Major

gpt

Junior Member

Overbom

Brigadier

tphuang

General

Fatty

Junior Member

GulfLander

Brigadier