The Test That Broke GPT-5: Why ARC-AGI-2 Proves We're Nowhere Near Human-Level AI

GPT-5 Pro scores 18.3% on the new benchmark. The previous version? 70.2%. Francois Chollet's test exposes what AI still can't do — and it's not what you'd expect.

---

Related Reading

- Which AI Hallucinates the Least? We Tested GPT-5, Claude, Gemini, and Llama on 10,000 Facts. - Llama 4 Beats GPT-5 on Coding and Math. Open-Source Just Won. - Frontier Models Are Now Improving Themselves. Researchers Aren't Sure How to Feel. - You Can Now See AI's Actual Reasoning. It's More Alien Than Expected. - Scientists Used AI to Discover a New Antibiotic That Kills Drug-Resistant Bacteria