Artificial Intelligence thread

tphuang · Sep 25, 2025

siegecrossbow said:
Heard that despite great stats all around Qwen is pretty bad at programming tasks.

there is actually a pretty good way for you to verify these claims. You can just test it out on their website. I have recently switched from DeepSeek to Qwen for my work related questions.

mossen · Sep 25, 2025

Important to be skeptical with doomer takes, even if risks are non-trivial.

Hinton has been pretty notorious for years but there is rarely any follow-up. Glad people are taking a more critical stance.

https://twitter.com/i/web/status/1971173766106587321

---

I tried the Qwen-3 MAX model. I asked it a very simple question about local traffic here in Sweden. It got basic things badly wrong. This is a 1 trillion parameter model.

I asked Claude Opus 4.1 (non-thinking) without search the same thing and it 1-shotted the task. The Qwen team still has some ways to go.

The task I gave was not advanced in any way. At that size, they should have enough world knowledge to get it right. Even when I turned on search, Qwen 3 MAX couldn't complete the task even if it improved and got some aspects right.

In my casual testing, Deepseek is still the best Chinese lab. Moonshot Kimi takes the number two slot.

tamsen_ikard · Sep 25, 2025

mossen said:
<blockquote class="twitter-tweet"><p lang="en" dir="ltr">In 2016, Geoffrey Hinton warned students not to train as radiologists, the field was so ripe for AI automation. <br><br>Today, there are more new radiologist jobs than ever and radiologist wages are up 48%. Yet AI *has* exploded in the field.<br><br>So what happened?<a href="
Please, Log in or Register to view URLs content!
">
Please, Log in or Register to view URLs content!
</a> <a href="
Please, Log in or Register to view URLs content!
">pic.twitter.com/0Q1XAXm5PL</a></p>— Sam Bowman (@s8mb) <a href="
https://twitter.com/i/web/status/1971173766106587321
">September 25, 2025</a></blockquote> <script async src="
Please, Log in or Register to view URLs content!
" charset="utf-8"></script>

Probably what happens in every field, AI doesn't really kill jobs, it just makes jobs easier and you can do things faster.

I think all this AI killing jobs doom gloom will turn out to be wrong in hindsight.

tphuang · Sep 25, 2025

I can tell you right now that AI is killing job demand for entry level software programmers.

Hyper · Sep 25, 2025

tphuang said:
I can tell you right now that AI is killing job demand for entry level software programmers.

Yeah, Radiology is not a good comparison since healthcare has to be well regulated.

Eventine · Sep 25, 2025

Hyper said:
Yeah, Radiology is not a good comparison since healthcare has to be well regulated.

There’s also the issue of data. There’s only so much labeled radiology data out there and you can’t generate more without radiologists and clinical studies.

You can easily generate more programming data, because you can always just run the code to see if it works, so as long as you have compute, you can generate an infinite amount of training data for programming.

Hyper · Sep 25, 2025

Eventine said:
There’s also the issue of data. There’s only so much labeled radiology data out there and you can’t generate more without radiologists and clinical studies.

You can easily generate more programming data, because you can always just run the code to see if it works, so as long as you have compute, you can generate an infinite amount of training data for programming.

And also privacy concerns. Healthcare data is thoroughly protected.

daifo · Sep 25, 2025

siegecrossbow said:
Heard that despite great stats all around Qwen is pretty bad at programming tasks.

I haven't gone too deep with these ai tools but they have a very generous free cli tool similar to gemini cli or claude code.

Please, Log in or Register to view URLs content!

Eventine · Sep 25, 2025

mossen said:
Important to be skeptical with doomer takes, even if risks are non-trivial.

Hinton has been pretty notorious for years but there is rarely any follow-up. Glad people are taking a more critical stance.

https://twitter.com/i/web/status/1971173766106587321

---

I tried the Qwen-3 MAX model. I asked it a very simple question about local traffic here in Sweden. It got basic things badly wrong. This is a 1 trillion parameter model.

I asked Claude Opus 4.1 (non-thinking) without search the same thing and it 1-shotted the task. The Qwen team still has some ways to go.

The task I gave was not advanced in any way. At that size, they should have enough world knowledge to get it right. Even when I turned on search, Qwen 3 MAX couldn't complete the task even if it improved and got some aspects right.

In my casual testing, Deepseek is still the best Chinese lab. Moonshot Kimi takes the number two slot.

I tend to agree that no Chinese model is yet at the level of Claude at agentic programming; likely there's some ingredient that is still missing.

But at the same time, I don't think Chinese models targeting the domestic market will need to worry about competing with Anthropic in the near future:

Please, Log in or Register to view URLs content!

Basically ANY company that is >=50% owned by a company based in China is banned from using Anthropic API services (and since Cladue is entirely closed source, there isn't anyway for anyone else to host it). This goes further than China's own ban because you can't bypass it via corporate VPNs, side contracts, offshore hosting, etc. The only way to get around it is having a personal subscription and using a VPN but with how expensive Claude is, I don't think it's going to be a common thing.

Hyper · Sep 25, 2025

9dashline said:
Looks like Alibaba is cooking

View attachment 161529

One reason why I think that startups ultimately have no place is that they have so staying power at all. Alibaba, Bytedance have other sources of revenue and infrastructure that others don't have. They can keep spending and keep bringing new models till startups run out of money. And startups already use AliCloud to reach customers so Alibaba still wins.

Artificial Intelligence thread

tphuang

General

mossen

Senior Member

tamsen_ikard

Captain

tphuang

General

Hyper

Junior Member

Eventine

Senior Member

Hyper

Junior Member

daifo

Major

Eventine

Senior Member

Hyper

Junior Member