Claude Opus 4 Sets New Record on Agentic Coding: 72% on SWE-Bench Verified
Anthropic's latest model autonomously fixes real GitHub issues better than any AI before. Developers report it can now handle multi-file refactors that took hours.
In-depth coverage, analysis, and updates on SWE-Bench in AI and tech. 1 articles on AI Pulse.
Anthropic's latest model autonomously fixes real GitHub issues better than any AI before. Developers report it can now handle multi-file refactors that took hours.