Artificial Intelligence thread

tokenanalyst · Jan 25, 2025

Liu Qingfeng: iFLYTEK will be the national team for full-stack autonomous and controllable large models

Liu Qingfeng, Chairman of iFLYTEK, revealed at the company's annual meeting that in the face of an environment with limited computing power, high demands and fierce competition, iFLYTEK has, on the one hand, unswervingly promoted the upgrade of its autonomous and controllable general large model base, and on the other hand, promoted the implementation of large model applications through GBC linkage. iFLYTEK Spark currently has a complete set of solutions from top-level planning to implementation.

Liu Qingfeng clarified the strategic ecological niche of iFlytek Spark Big Model: adhere to independent control, meet the rigid needs of people's livelihood, meet national strategic needs, and enable scientific research and production. "Focus on independent and controllable base capabilities and key application tracks, and maintain strategic focus with a long-term spirit." He revealed that in the new stage of general artificial intelligence development, iFlytek will become a national team for full-stack independent and controllable big models.
Previously, after the release of iFlytek Spark 4.0 Turbo, it surpassed GPT-4o in mathematical and coding capabilities. Released by authoritative domestic and foreign organizations such as the Allen Institute for Artificial Intelligence and OpenAI, iFlytek Spark 4.0-Turbo surpassed GPT-4o in 9 of the 14 mainstream test sets covering different task types such as understanding and reasoning, comprehensive exams, mathematics and science, and coding, with a relative efficiency improvement of 50%.

According to the comparison of the results on the code generation HumanEval test set, iFlytek Spark 4.0 Turbo has a slight gap with GPT-4o in tasks such as Python, Java, and JavaScript, and exceeds GPT-4o in C++ capabilities. In real application scenarios, based on the iFLYCode-Eval-2.0 code practical scenario test set built by the National Key Laboratory of Cognitive Intelligence, iFlytek Spark 4.0 Turbo has surpassed GPT-4o in tasks such as code generation, code error detection, and unit testing. At the same time, the Spark Code 7B version was launched to meet the needs of extremely fast response tasks such as code generation and code completion, with the best results in the industry.

Please, Log in or Register to view URLs content!

tokenanalyst · Jan 25, 2025

Please, Log in or Register to view URLs content!

tphuang · Jan 25, 2025

tokenanalyst said:
View attachment 144379

Please, Log in or Register to view URLs content!

it's very interesting to me that they have Russian names in there. It would make sense for Russians to use Ascend NPUs.

although on the content of the paper itself,,I'm not actually sure what you get out of FP8

pbd456 · Jan 25, 2025

I am new to this thread. Finally download deepseek and use it to generate some backtracking program to find some (known) combinatorial structures. Pretty impressive that whatever i suggest, it figures out why i ask (such as better pruning, or generalisation to other structures) which result in a better c program that i can run on my phone.

Not really sure what i can do with it really as i stop writing papers for living.

Also, it is interesting to me that AIME was used as a benchmark as i personally did AIME over 30 years ago.

siegecrossbow · Jan 25, 2025

Randomuser said:
Been thinking.

With Deepseek coming out, doesn't that mean the whole AI thing hyped by techbros and H1B Indians is basically a huge ass scam?

Like why do we need all this money and immigration when some literal who guys back in China did it for just $5 million on some old GPUs?

Honestly Deepseek implications are not been discussed hard enough given this fact.

More like incompetence.

tokenanalyst · Jan 25, 2025

pbd456 said:
I am new to this thread. Finally download deepseek and use it to generate some backtracking program to find some (known) combinatorial structures. Pretty impressive that whatever i suggest, it figures out why i ask (such as better pruning, or generalisation to other structures) which result in a better c program that i can run on my phone.

Not really sure what i can do with it really as i stop writing papers for living.

Also, it is interesting to me that AIME was used as a benchmark as i personally did AIME over 30 years ago.

What hardware you using to run this model? Full or Quants?

pbd456 · Jan 25, 2025

tokenanalyst said:
What hardware you using to run this model? Full or Quants?

I m running on my less than 100 usd phone. I am not finding anything new at all. Just a test to see how well it codes. If you care, it found a cyclic bibd(73,4,1) using backtracking. Deepseek generated the c code. I install termux, and other packages to compile and run on my poco phone.
I used to find a lot of designs on computer, but lost interest.

pbd456 · Jan 25, 2025

I was just emailing my buddy (full prof in mat dept and we had over 20 papers together) that deepseek would do well on two tasks that i suck at. 1) write programs to generate structure (i know the algorithm but too lazy to write unless i have to) 2) write papers because my grammar sucks.

Anyways i quit my tenured professor job in spring 2020 during covid19.

9dashline · Jan 25, 2025

pbd456 said:
I m running on my less than 100 usd phone. I am not finding anything new at all. Just a test to see how well it codes. If you care, it found a cyclic bibd(73,4,1) using backtracking. Deepseek generated the c code. I install termux, and other packages to compile and run on my poco phone.
I used to find a lot of designs on computer, but lost interest.

R1 is competitor to o1, its not meant to be ran inference on phones...

Both R1 and o1 way stronger than Llama 405b

Yet Llama 405b requires 16x H100 GPUs to run... thats upwards of 1 million dollars

4 orders of magntitude your phone hardware cost

I suggest you get the deepseek.apk and use that instead

tphuang · Jan 25, 2025

https://twitter.com/i/web/status/1883228769701290440

inference cost for DeepSeek according to various host. Notice how DeepSeek's own server is so much lower?

https://twitter.com/i/web/status/1882963584042651978

And yes, people are already shifting R1 because you can do it offline.

Artificial Intelligence thread

tokenanalyst

Lieutenant General

Liu Qingfeng: iFLYTEK will be the national team for full-stack autonomous and controllable large models

tokenanalyst

Lieutenant General

tphuang

General

pbd456

Junior Member

siegecrossbow

Field Marshall

tokenanalyst

Lieutenant General

pbd456

Junior Member

pbd456

Junior Member

9dashline

Captain

tphuang

General

Artificial Intelligence thread

Lieutenant General

Liu Qingfeng: iFLYTEK will be the national team for full-stack autonomous and controllable large models​

Lieutenant General

General

Junior Member

Field Marshall

Lieutenant General

Junior Member

Junior Member

Captain

General

Liu Qingfeng: iFLYTEK will be the national team for full-stack autonomous and controllable large models