apparently they also have a long version with 10 million token context windowjust api though, not open weights afaik
apparently they also have a long version with 10 million token context windowjust api though, not open weights afaik
well, they've had big office in Silicon Valley for a while now, especially ByteDance.Chinese tech firms building AI teams in Silicon Valley?
this will be open weights soon?
go Deepseek, some pretty strong claims here. Btw, o1-preview to me is the golden standard in reasoning, but it is so expensive (because it generates so many tokens and it's cost per 1m token is 6x that of gpt-4o). So, it would be good if there is a little competition here.
I would assume that weights will be available, but my understanding with o1 type is that there are additional reasoning steps. So is just having weights enough to replicate the model?this will be open weights soon?
yes should be, see:I would assume that weights will be available, but my understanding with o1 type is that there are additional reasoning steps. So is just having weights enough to replicate the model?