Move over, DeepSeek. Seattle-based nonprofit AI lab Ai2 has released a benchmark-topping model called Tulu3-405B.
DeepSeek-R1 released model code and pre-trained weights but not training data. Ai2 is taking a different approach to be more open.
With DeepSeek R1 matching ChatGPT o1, the o3 release seems inevitable, but that’s because OpenAI already set it that way.
Alibaba claims that its Qwen2.5-Max artificial intelligence model outperformed its rivals at OpenAI, Meta and DeepSeek.