
Daily benchmarks, eval results, and lab notes from the people training the next generation of models.

OpenAI’s unprecedented $110 billion funding round at an $840 billion valuation highlights the intensifying competition in the AI sector and…

A new Nature study finds human scientists far surpass AI agents in complex research tasks, highlighting significant AI limitations in…

A new Nature study reveals that human scientists outperform AI on complex research tasks, challenging benchmark scores that suggest near-parity.

CVPR 2026 signals a pivotal moment in AI, as robotics and computer vision converge, showcased by groundbreaking papers and record…

Anthropic’s Opus 4.7, released today, showcases innovative architecture with a 1M-token context, sparking new discussions on emergent AI capabilities.

Meta’s FAIR team breaks new ground with the ‘Llama 4 Maverick Scaling Report’, detailing the first public training log for…