Artificial Intelligence thread

PopularScience · Feb 15, 2025

A technological miracle! Tsinghua breaks through the computing power problem of large models. RTX 4090 can run the full version of DeepSeek single-handedly

Please, Log in or Register to view URLs content!

9dashline · Feb 15, 2025

PopularScience said:
A technological miracle! Tsinghua breaks through the computing power problem of large models. RTX 4090 can run the full version of DeepSeek single-handedly

Please, Log in or Register to view URLs content!

This legit? whats the catch, mich slpwer tps?

OptimusLion · Feb 15, 2025

Tsinghua University's open source project breaks through the bottleneck of large model computing power: RTX 4090 single card achieves full-blooded operation of DeepSeek-R1

It is very difficult for ordinary users to run the real full-blooded version of DeepSeek-R1 on ordinary hardware, and the cost of renting a server is stressful even for developers

Please, Log in or Register to view URLs content!

dingyibvs · Feb 15, 2025

9dashline said:
This legit? whats the catch, mich slpwer tps?

Some have shown that you can run the full version on a $2000 rig with a lot of system RAM, it's just pretty slow. Looks like this rig also has a lot of system RAM, so the question is how much does a single 4090 speed up the process?

tphuang · Feb 15, 2025

you just need to have enough VRAM to hold the model to run it. inference speed is another story.

pbd456 · Feb 15, 2025

VRAM is on chip like cache? Or more like hbm?

Proteus · Feb 15, 2025

dingyibvs said:
Some have shown that you can run the full version on a $2000 rig with a lot of system RAM, it's just pretty slow. Looks like this rig also has a lot of system RAM, so the question is how much does a single 4090 speed up the process?

Please, Log in or Register to view URLs content!

The post right above you have the answer.
This the stream video test on 14 RTX 3090s.

tphuang · Feb 15, 2025

Please, Log in or Register to view URLs content!

Sugar has added support for DeepSeek model in its machines. It also supports V3, Janus Pro, Qwen 2.5, Llama and ChatGLM. Uses domestic x86 CPU and GPGPU

Please, Log in or Register to view URLs content!

WeChat now testing with DeepSeek-R1. Again, this is going to be huge for AI search adoption with R1

Please, Log in or Register to view URLs content!

ByteDance CEO said they did not move fast enough with reasoning model and Deepeek moved faster and got there first.

huemens · Feb 15, 2025

PopularScience said:
A technological miracle! Tsinghua breaks through the computing power problem of large models. RTX 4090 can run the full version of DeepSeek single-handedly

Please, Log in or Register to view URLs content!

I wish they would prioritize some domestic hardware for this kind of efforts rather than improving efficiency for more Nvidia products. Since no one other than Chinese researchers are looking to squeeze this much of performance from lower-end hardware, if they just do this with domestic products they could have more leveled the playing field between domestic cards and much higher-end Nvdia cards. Kind of like how Huawei has been matching performance of western 4nm chips by optimizing their designs on 7nm node beyond what western chipmakers, with access to leading edge fabs, are looking into.

Randomuser · Feb 15, 2025

Seeing so many companies in China attaching and customizing DeepSeek to fit them.

Maybe that's the real thing that's important. Not how many benchmarks it can beat but the open source nature that allows the entire ecosystem to adjust based on the new technology. Something like wouldn't happen with stuff like OpenAI because they aren't gonna give away their stuff for free especially when these companies have so much money.

Artificial Intelligence thread

PopularScience

Senior Member

9dashline

Captain

OptimusLion

Junior Member

dingyibvs

Senior Member

tphuang

General

pbd456

Junior Member

Proteus

New Member

tphuang

General

huemens

Junior Member

Randomuser

Captain