Distillation has always been a pretty dumb cope since if it were just a matter of data, Gemini or Grok would at minimum be just as good as open models because you can distill open models to your hearts content, and yet even with half a million GPUs Elon's Collosus couldn't even put Grok on the chart.Isn’t it funny how Anthropic accuses China of distilling its AI to make their own and yet here you have Mythos that Anthropic has walled off to only a select few and yet GLM 5.2 surfaces that some in the Western world says beats it in some measures. I thought they walled it off so China couldn’t get it from them? China already had it without them and didn’t advertise it until Anthropic started waving it around like gun to tell everyone they had it as a threat.
Anthropic like to talk about distillation because Chinese labs publish the majority of public research, research that Anthropic implement into their own models, so they're forced into trying to differentiate themselves with data instead of tech.
And the thing is in this context even discussing Mythos at all is a false dichotomy, because the actual model you need to compare Mythos to are also classified Chinese model that only Beijing and Chinese institutions can use. The only model that you should compare GLM5.2 is Opus 4.8.