evals.report
BenchmarksLabsCompareRun guides
AnthropicClaude Opus

Claude Opus 4.6 thinking

Anthropic · Claude Opus. Released Feb 5, 2026.

2 results

Benchmark results 2

Compare this model
BenchmarkCategoryScoreMetricStatusDate
LMArenaChat preference1499source-defined ratingOfficialMay 27, 2026Details
SWE-bench ProCoding51.90%% resolvedOfficialMay 30, 2026Details