evals.report
BenchmarksSourcesLabsCompareRun guides

LiveCodeBench Pro

A live competitive-programming benchmark that rates LLMs with a Codeforces-style Elo on fresh contest problems.

CodingCodeforces EloHigher is better

Problems and tooling are published; ratings are computed from live Codeforces-style contests.

Benchmark
LiveCodeBench Pro
Dataset
Not provided
Metric
Codeforces Elo

1Expected output

Use the official source links for current output format, submission steps, and benchmark-specific result files.

2Submit results

Keep source URL, source model name, benchmark version, harness, and run context attached to any reported score.

Gotchas

Elo is contest-relative and not comparable to pass@1 coding benchmarks; keep it separate.
Do not mix this benchmark's metric with unrelated benchmark metrics.