This, it's been my experience since the early 2000s that western forums are just.... dumber.I think this just show the superiority of Chinese netizens over Western netizens. Garbage in Garbage out.
This, it's been my experience since the early 2000s that western forums are just.... dumber.I think this just show the superiority of Chinese netizens over Western netizens. Garbage in Garbage out.
Seems India is quite happy with DeepSeek open free model. Since it now enables India to get into the AI game for little cost which was something that was almost insurmontable for them before this.India lauds Chinese AI lab DeepSeek, plans to host its models on local servers
The Chinese language itself definitely plays a role here, given how transformers and attention (basic building blocks of LLM) work. But the liveliness of available Chinese content, i.e., how creatively the Chinese people use the language, might be an even bigger factor.Some folks were discussing the linguistics differences between Chinese and English earlier. Here's a thread with several examples of Deepseek R1 being more eloquent in Chinese than English.
Also nice to see 红楼梦 referenced.
I am somewhat familiar with this space. Short answer is that hedge funds are very willing to pay for research to find edge in the markets.So I've been thinking, we know Deepseek is the side project of HiFly fund and the founder had to pitch the idea to his partners. But what was his pitch?
Is it 1: I want to spend our resources (idle or not), pay for the extra power bill, redirect our existing people and hire new talent, so we can develop the worlds best model and give it to everyone for free, because we are a hedge fund and charity is our thing.
Or is it 2: same as one, but add "Americans are in an AI bubble and this will allow our fund to make more money than ever by shorting it"
It's not creating the model to create competitive advantage that's interestingI am somewhat familiar with this space. Short answer is that hedge funds are very willing to pay for research to find edge in the markets.
Some of the best weather models, for example, are at certain hedge funds. They pay meteorologists and physicists top dollar to help them generate edge in the commodities markets.
Wall street has also been using NLP models on earnings data for decades, so it’s not like LLMs are that big of a leap from normal hedge fund research. Furthermore, I have read that high flyer already has been using AI/ML techniques in their strategies already, so it probably wasn’t that big of a sell, especially since idle GPUs are considered a loss due to depreciation.
The whole “make money by releasing model then short the markets” seems like a big reach to me.
Deepseek is literally everywhere now... I never seen any AI model get adopted this quick. This is the real AI Diffusion...