Anthropic launched Claude Opus 4.5, setting a new benchmark for AI coding and agentic capabilities
The model achieved 80.9% accuracy on the rigorous SWE-bench assessment, outperforming rivals
Opus 4.5 scored higher than any human candidate on Anthropic's internal performance engineering take-home exam



