Claude Code + Claude Opus 4.8

Agent systems · Agent.

Claude Code + Claude Opus 4.8 is a model from Agent systems in the Agent family. evals.report tracks 2 reported Claude Code + Claude Opus 4.8 benchmark scores across Terminal-Bench 2.1, SWE-Marathon — each shown with its benchmark, metric, source status, and date, and never combined into a single ranking.

2 results

Benchmark results 2

Compare this model

Benchmark	Category	Score	Metric	Status	Date
Terminal-Bench 2.1	Agents	78.9%	task success	Verified	—	Details
SWE-Marathon	Agents	26.0%	resolution rate (pass@1)	Official	—	Details