Since Gemini 3 made its debut, it has successfully held the top spot on the LMArena leaderboard. This leaderboard is a crowdsourced ranking where thousands of real users compare AI models head-to-head across a wide range of tasks, voting on which response is better. But when it comes to reaching the toughest reasoning benchmarks, there’s …
Continue reading This tiny AI startup just crushed Google’s Gemini 3 on a key reasoning test — here’s what we know