Run guidesReasoning
Run LiveBench
The same run guide is also available from the benchmark detail page.
Reasoningscore
Official repo includes run_livebench.py, scoring utilities, and download_leaderboard.py.
1Expected output
Use the official source links for current output format, submission steps, and benchmark-specific result files.
2Submit results
Keep source URL, source model name, benchmark version, harness, and run context attached to any reported score.
Gotchas
Show the LiveBench global average as a source-scoped LiveBench metric only; do not mix with unrelated benchmarks.
Do not mix this benchmark's metric with unrelated benchmark metrics.