Artificial Intelligence thread

tokenanalyst

Brigadier
Registered Member

Liu Qingfeng: iFLYTEK will be the national team for full-stack autonomous and controllable large models​


Liu Qingfeng, Chairman of iFLYTEK, revealed at the company's annual meeting that in the face of an environment with limited computing power, high demands and fierce competition, iFLYTEK has, on the one hand, unswervingly promoted the upgrade of its autonomous and controllable general large model base, and on the other hand, promoted the implementation of large model applications through GBC linkage. iFLYTEK Spark currently has a complete set of solutions from top-level planning to implementation.

Liu Qingfeng clarified the strategic ecological niche of iFlytek Spark Big Model: adhere to independent control, meet the rigid needs of people's livelihood, meet national strategic needs, and enable scientific research and production. "Focus on independent and controllable base capabilities and key application tracks, and maintain strategic focus with a long-term spirit." He revealed that in the new stage of general artificial intelligence development, iFlytek will become a national team for full-stack independent and controllable big models.
Previously, after the release of iFlytek Spark 4.0 Turbo, it surpassed GPT-4o in mathematical and coding capabilities. Released by authoritative domestic and foreign organizations such as the Allen Institute for Artificial Intelligence and OpenAI, iFlytek Spark 4.0-Turbo surpassed GPT-4o in 9 of the 14 mainstream test sets covering different task types such as understanding and reasoning, comprehensive exams, mathematics and science, and coding, with a relative efficiency improvement of 50%.

According to the comparison of the results on the code generation HumanEval test set, iFlytek Spark 4.0 Turbo has a slight gap with GPT-4o in tasks such as Python, Java, and JavaScript, and exceeds GPT-4o in C++ capabilities. In real application scenarios, based on the iFLYCode-Eval-2.0 code practical scenario test set built by the National Key Laboratory of Cognitive Intelligence, iFlytek Spark 4.0 Turbo has surpassed GPT-4o in tasks such as code generation, code error detection, and unit testing. At the same time, the Spark Code 7B version was launched to meet the needs of extremely fast response tasks such as code generation and code completion, with the best results in the industry.​


Please, Log in or Register to view URLs content!
 

tphuang

Lieutenant General
Staff member
Super Moderator
VIP Professional
Registered Member

pbd456

Junior Member
Registered Member
I am new to this thread. Finally download deepseek and use it to generate some backtracking program to find some (known) combinatorial structures. Pretty impressive that whatever i suggest, it figures out why i ask (such as better pruning, or generalisation to other structures) which result in a better c program that i can run on my phone.

Not really sure what i can do with it really as i stop writing papers for living.

Also, it is interesting to me that AIME was used as a benchmark as i personally did AIME over 30 years ago.
 

siegecrossbow

General
Staff member
Super Moderator
Been thinking.

With Deepseek coming out, doesn't that mean the whole AI thing hyped by techbros and H1B Indians is basically a huge ass scam?

Like why do we need all this money and immigration when some literal who guys back in China did it for just $5 million on some old GPUs?

Honestly Deepseek implications are not been discussed hard enough given this fact.

More like incompetence.
 

tokenanalyst

Brigadier
Registered Member
I am new to this thread. Finally download deepseek and use it to generate some backtracking program to find some (known) combinatorial structures. Pretty impressive that whatever i suggest, it figures out why i ask (such as better pruning, or generalisation to other structures) which result in a better c program that i can run on my phone.

Not really sure what i can do with it really as i stop writing papers for living.

Also, it is interesting to me that AIME was used as a benchmark as i personally did AIME over 30 years ago.
What hardware you using to run this model? Full or Quants?
 

pbd456

Junior Member
Registered Member
What hardware you using to run this model? Full or Quants?
I m running on my less than 100 usd phone. I am not finding anything new at all. Just a test to see how well it codes. If you care, it found a cyclic bibd(73,4,1) using backtracking. Deepseek generated the c code. I install termux, and other packages to compile and run on my poco phone.
I used to find a lot of designs on computer, but lost interest.
 

pbd456

Junior Member
Registered Member
I was just emailing my buddy (full prof in mat dept and we had over 20 papers together) that deepseek would do well on two tasks that i suck at. 1) write programs to generate structure (i know the algorithm but too lazy to write unless i have to) 2) write papers because my grammar sucks.

Anyways i quit my tenured professor job in spring 2020 during covid19.
 

9dashline

Captain
Registered Member
I m running on my less than 100 usd phone. I am not finding anything new at all. Just a test to see how well it codes. If you care, it found a cyclic bibd(73,4,1) using backtracking. Deepseek generated the c code. I install termux, and other packages to compile and run on my poco phone.
I used to find a lot of designs on computer, but lost interest.
R1 is competitor to o1, its not meant to be ran inference on phones...

Both R1 and o1 way stronger than Llama 405b

Yet Llama 405b requires 16x H100 GPUs to run... thats upwards of 1 million dollars

4 orders of magntitude your phone hardware cost

I suggest you get the deepseek.apk and use that instead
 
Top