Artificial Intelligence thread

9dashline · Jan 20, 2025

Once Deepseek comes out with R2 at o3 level, Im never paying $200 /mo to OAI again

tokenanalyst · Jan 20, 2025

9dashline said:
$200 /mo to OAI again

Again?

9dashline · Jan 20, 2025

tokenanalyst said:
Again?

Im on month 2... its worth it, but only because no competitor to o1 pro yet

Engineer · Jan 20, 2025

9dashline said:
Once Deepseek comes out with R2 at o3 level, Im never paying $200 /mo to OAI again

That's on you. By November of last year you already have no reason to pay them.

doggydogdo · Jan 20, 2025

tphuang · Jan 20, 2025

tokenanalyst said:
Looks like MiniCPM launch was overshadowed by the launch of DeepSeek

A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
MiniCPM-o 2.6
MiniCPM-o 2.6 is the latest and most capable model in the MiniCPM-o series. The model is built in an end-to-end fashion based on SigLip-400M, Whisper-medium-300M, ChatTTS-200M, and Qwen2.5-7B with a total of 8B parameters. It exhibits a significant performance improvement over MiniCPM-V 2.6, and introduces new features for real-time speech conversation and multimodal live streaming. Notable features of MiniCPM-o 2.6 include:

-Leading Visual Capability. MiniCPM-o 2.6 achieves an average score of 70.2 on OpenCompass, a comprehensive evaluation over 8 popular benchmarks.
-State-of-the-art Speech Capability. MiniCPM-o 2.6 supports bilingual real-time speech conversation with configurable voices in English and Chinese.
-Strong Multimodal Live Streaming Capability. As a new feature, MiniCPM-o 2.6 can accept continous video and audio streams independent of user queries, and support real-time speech interaction.
-Strong OCR Capability and Others. Advancing popular visual capabilites from MiniCPM-V series, MiniCPM-o 2.6 can process images with any aspect ratio and up to 1.8 million pixels (e.g., 1344x1344).
-Superior Efficiency. In addition to its friendly size, MiniCPM-o 2.6 also shows state-of-the-art token density (i.e., number of pixels encoded into each visual token).
-Easy Usage. MiniCPM-o 2.6 can be easily used in various ways: (1) llama.cpp support for efficient CPU inference on local devices, (2) int4 and GGUF format quantized models in 16 sizes, (3) vLLM support for high-throughput and memory-efficient inference, (4) fine-tuning on new domains and tasks with LLaMA-Factory, (5) quick local WebUI demo setup with Gradio, and (6) online web demo on server.

Please, Log in or Register to view URLs content!

there is in fact several releases today out of China that's not from deepseek.

I already posted one, but Bytedance had more AI update. Sensetime also had a big update.

But who really cares about those at this point?

Deepseek just wreck the entire Silicon Valley AI ecosystem by giving power to the people and taking it away from these tech overlords.

What is now the justification for buying 1 million H100 at this point or investing in huge data centers? You are going to be competing against people building distilled reasoning models off Llama, deepseek, Qwen and other open sourced models.

What is the justification for Nvidia's continued valuation? Or any of these other guys like TSMC, SK Hynix?

Fatty · Jan 20, 2025

tphuang said:
there is in fact several releases today out of China that's not from deepseek.

I already posted one, but Bytedance had more AI update. Sensetime also had a big update.

But who really cares about those at this point?

Deepseek just wreck the entire Silicon Valley AI ecosystem by giving power to the people and taking it away from these tech overlords.

What is now the justification for buying 1 million H100 at this point or investing in huge data centers? You are going to be competing against people building distilled reasoning models off Llama, deepseek, Qwen and other open sourced models.

What is the justification for Nvidia's continued valuation? Or any of these other guys like TSMC, SK Hynix?

LLMs are just the bells and whistles. Robotics and computer vision are what will actually generate real value, imo. Deepseek is what’s visible now but I think companies like Unitree are what will actually destroy Silicon Valley

tphuang · Jan 20, 2025

Fatty said:
LLMs are just the bells and whistles. Robotics and computer vision are what will actually generate real value, imo. Deepseek is what’s visible now but I think companies like Unitree are what will actually destroy Silicon Valley

you have no idea what you are talking about.

siegecrossbow · Jan 20, 2025

tphuang said:
no, you are over analyzing things. Anthropic has amazing backing.

And beyond that, America is capable of copium to a level nobody here can imagine. We can just pretend that China and deepseek don't exist.

And we will keep shouting the loudest so that rest of the "free world" will continue to buy our bullshit.

btw, I'm only half joking about this. As long as American propaganda is well and alive, the idea of American being ahead in AI will keep getting pumped out there.

Deepseek is now very famous in the AI community, but how many people in the investor community know about it?

And what about all the other AI companies in China that have developed great stuff? Who knows about the work Bytedance is doing? And byte dance is huge.

I don’t know about that. With how redpilled the TikTok click baiters have been getting I think the Gen Z will overestimate anything from China, and these are the people entering the workforce.

9dashline · Jan 20, 2025

00CuriousObserver said:
What can I say. nb.

View attachment 144073

Sam Altman also has now scheduled a closed door meeting with top US gov leaders later this weeks.... and at least publicly, gotten a bit more humble since 24 hours ago lol

Please, Log in or Register to view URLs content!

Artificial Intelligence thread

9dashline

Captain

tokenanalyst

Lieutenant General

9dashline

Captain

Engineer

Major

doggydogdo

Junior Member

tphuang

General

A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

MiniCPM-o 2.6

Fatty

Junior Member

tphuang

General

siegecrossbow

Field Marshall

9dashline

Captain

Artificial Intelligence thread

Captain

Lieutenant General

Captain

Major

Junior Member

General

A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone​

MiniCPM-o 2.6​

Junior Member

General

Field Marshall

Captain

A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

MiniCPM-o 2.6