LabsCognition
Cognition
Maker of Devin and the SWE-family coding models (Windsurf/Devin); creator of the FrontierCode benchmark.
Models 1
Progress by benchmark
Show progress on
Single benchmark only
This view shows FrontierCode (weighted score (Diamond)) only. Other benchmarks use different metrics and are not directly comparable.
Progress matrix
Scores are not normalised across benchmarks. Each column uses its own metric. Compare columns independently.