evals.report
BenchmarksSourcesLabsCompareRun guides

LiveCodeBench Pro

A live competitive-programming benchmark that rates LLMs with a Codeforces-style Elo on fresh contest problems.

CodingCodeforces EloHigher is better

What this benchmark measures

A live competitive-programming benchmark that rates LLMs with a Codeforces-style Elo on fresh contest problems.

Rows on this page are sourced from public benchmark artifacts, leaderboard exports, or source-linked model reports. Each row keeps benchmark version, source model name, and available run details attached to the score.

The metric shown here is Codeforces Elo. It should be interpreted within LiveCodeBench Pro and the LiveCodeBench Pro source context, not compared as part of a site-wide ranking.

What to be careful about

Elo is contest-relative and not comparable to pass@1 coding benchmarks; keep it separate.

No composite ranking
evals.report never combines benchmarks. Codeforces Elo on LiveCodeBench Pro is its own number — don’t average it with other metrics.