News
LAION and Intel have released Empathic-Insight, a suite of models and datasets that can analyze facial images and audio files across 40 emotion categories, covering not only emotional but also ...
OpenAI has significantly updated ChatGPT's search feature: it now handles longer contexts, better follows instructions, answers complex questions with several parallel searches, and allows users to ...
To reach the performance of FineWeb-Edu, other datasets like C4 or Dolma need up to 10 times more training data. This again shows the effectiveness of focusing on high quality educational data, ...
Anthropic is adding two new features to its AI assistant Claude: an agent-based research feature that performs multiple sequential searches on its own, and a Google Workspace integration that provides ...
The team, led by Mehrdad Farajtabar, created a new evaluation tool called GSM-Symbolic. This tool builds on the GSM8K mathematical reasoning dataset and adds symbolic templates to test AI models more ...
Seedream 3.0 ranks ahead of GPT-4o in image quality benchmarks In benchmarks such as the Artificial Analysis Arena—where users compare outputs from different models—Seedream 3.0 initially ranked first ...
Earlier plans for a dedicated OpenAI chip factory with TSMC are on hold. TSMC CEO C.C. Wei called OpenAI CEO Sam Altman's proposals "too aggressive," citing concerns about facility utilization rates.
OpenAI has introduced a new family of language models—GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano—exclusively for use via its API. According to the company, these models are targeted at professional ...
OpenAI has released GPT-4.5 as a "research preview," describing it as their largest and best model for chat. The new model is initially available to ChatGPT Pro users and developers, with Plus and ...
To address this challenge, Deepmind is exploring methods that allow AI systems to evaluate their own outputs. One approach is AI debate, in which models provide feedback on each other’s answers, ...
Researchers at Arizona State University have evaluated the planning capabilities of OpenAI's new AI model o1 using the PlanBench benchmark. O1 showed significant progress compared to traditional large ...
The neuroscientist Jean-Rémi King leads the Brain & AI team in Meta’s AI division. In an interview with The Decoder, he discusses the connection between AI and neuroscience, the challenges of ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results