AnthropicClaude
Claude Mythos Preview
Anthropic · Claude. Released Apr 7, 2026.
9 results
Benchmark results 9
Compare this model| Benchmark | Category | Score | Metric | Status | Date | |
|---|---|---|---|---|---|---|
| SWE-bench Verified | Coding | 93.9% | % resolved | Unverified | Apr 7, 2026 | Details |
| SWE-bench Pro | Coding | 77.8% | % resolved | Unverified | Apr 7, 2026 | Details |
| GPQA Diamond | Reasoning | 94.6% | accuracy | Unverified | Apr 7, 2026 | Details |
| Humanity's Last Exam | Reasoning | 56.8% | accuracy | Unverified | Apr 7, 2026 | Details |
| OSWorld | Agents | 79.6% | task success rate | Unverified | Apr 7, 2026 | Details |
| GAIA: A Benchmark for General AI Assistants | Agents | 52.3% | accuracy | Unverified | Apr 7, 2026 | Details |
| METR Task-Completion Time Horizons | Agents | 1044.8 min | 50% time horizon | Official | Apr 7, 2026 | Details |
| CharXiv | Multimodal | 93.2% | accuracy | Unverified | Apr 7, 2026 | Details |
| SWE-bench Multimodal | Coding | 59.0% | % resolved | Verified | Apr 7, 2026 | Details |