A lot of important stuff is posted in this thread. This is a great idea.
However, I would like to translate, because I feel many posters do not fully understand, therefore they do not appreciate it as much as they should.
Instead of laughing, some wonder what is this. So I would like translate a bit.
There was the CPU the central processing unit, the semiconductor chip for the computer, the brains that adds and subtracts, then stores the results into memory.
After the CPU, some people wanted to play games, so Jensen Huang delivered the GPU the graphics processing unit. This chip was not to replace the CPU chip, it was to help it, so we could play games, back in the day.
Then, as technology evolved, people realized that the GPU was great at training the LLM or the AI models.
That is where we are today.
Now this post by comrade tokenanaylst, shows a new design for a new chip, specifically designed to handle the tokens from Large Language Models.
If we had CPU, the GPU, then this maybe a LPU, whatever, you get the picture.
The GPU was to handle the graphics better so we can play games. Well, maybe not me, but I knew people who played a lot of computer games, they still do too.
What seems curious, as I really do not know much about this stuff even though I am trying to translate it, is that the LLM uses a lot of matrix calculations.
Look at that design of this chip. They design like matrix, to basically handle those LLM matrix calculations.
Whatever goes through the LLM calculation, what comes out is not created equal, so you have those weights. So with a more effecient way of calculating or processing those weights, the more effecient or better your model, aka your LLM, will be.
So this design of this chip, is to improve preformance of the LLM, and it could improve it radically. Maybe we can run even more advanced LLM on the simple cell phone. Like on the cell phone, with this chip. Eventually, that is what someone will want and drive it there.
That is why it is imperative that Jensen Huang and Nvidia has access to the China market. That way he knows what is going on. If someone is designing a chip like this, and you're not, then guess what? Now you're IBM or something.
Some easy reading. Blah blah blah.