Terminus 2 + GPT-5.5

Agent systems · Agent.

Terminus 2 + GPT-5.5 is a model from Agent systems in the Agent family. evals.report tracks 2 reported Terminus 2 + GPT-5.5 benchmark scores across Terminal-Bench 2.1, SWE-Marathon — each shown with its benchmark, metric, source status, and date, and never combined into a single ranking.

2 results

Benchmark results 2

Compare this model

Benchmark	Category	Score	Metric	Status	Date
Terminal-Bench 2.1	Agents	78.2%	task success	Verified	—	Details
SWE-Marathon	Agents	6.0%	resolution rate (pass@1)	Official	—	Details