Question 1

What is LiveCodeBench Pro?

Accepted Answer

A live competitive-programming benchmark that rates LLMs with a Codeforces-style Elo on fresh contest problems. It is a coding benchmark measured by Codeforces Elo.

Question 2

What does Codeforces Elo mean on LiveCodeBench Pro?

Accepted Answer

LiveCodeBench Pro reports Codeforces Elo; higher is better. Scores are shown only within LiveCodeBench Pro and are never averaged with other benchmarks.

Question 3

What is the top reported LiveCodeBench Pro score?

Accepted Answer

Gemini 3 Deep Think has the top reported score on LiveCodeBench Pro: 3298 (Codeforces Elo).

Question 4

Why do LiveCodeBench Pro scores differ across runs?

Accepted Answer

Harness, scaffold, reasoning effort, and prompt setup change results, so two runs of the same model can differ. evals.report keeps each score with its run context so the differences stay visible.

Question 5

Does evals.report rank models across benchmarks?

Accepted Answer

No. LiveCodeBench Pro scores are shown within their own metric; evals.report never combines benchmarks into a composite ranking or a single "best model".

Model	Lab	Score↓	Source model	Status	Date
Gemini 3 Deep Think	Google DeepMind	3298	Gemini 3 Deep Think	Official	Dec 4, 2025	Details
DeepSeek V4 ProOpen	DeepSeek	3206	DeepSeek V4 Pro Max	Unverified	Apr 24, 2026	Details
Gemini 3.1 Pro Preview	Google DeepMind	2887	Gemini 3.1 Pro Preview	Official	Feb 19, 2026	Details
Gemini 3 Pro	Google DeepMind	2439	Gemini 3 Pro Preview	Official	Nov 18, 2025	Details
GPT-5.2	OpenAI	2393	GPT-5.2-high (2025-12-11)	Official	Dec 11, 2025	Details
Gemini 3 Flash	Google DeepMind	2316	Gemini 3 Flash Preview	Official	Dec 17, 2025	Details
GPT-5.1	OpenAI	2269	GPT-5.1-high (2025-11-13)	Official	Nov 12, 2025	Details
GPT-5	OpenAI	2176	GPT-5-high (2025-08-07)	Official	Aug 7, 2025	Details
o4-mini	OpenAI	2092	o4-mini-high (2025-04-16)	Official	Apr 16, 2025	Details
Gemini 2.5 Pro	Google DeepMind	1769	Gemini 2.5 Pro	Official	Mar 25, 2025	Details
Qwen3 235B A22B Instruct 2507Open	Alibaba / Qwen	1673	Qwen3-235B-A22B-Thinking-2507	Official	Jul 21, 2025	Details
Claude Sonnet 4.5	Anthropic	1412	Claude 4.5 Sonnet Thinking	Official	Sep 29, 2025	Details
GPT-OSS-120BOpen	OpenAI	1299	GPT OSS 120B	Official	Aug 5, 2025	Details
Gemini 2.5 Flash	Google DeepMind	1288	Gemini 2.5 Flash	Official	Apr 17, 2025	Details
DeepSeek R1Open	DeepSeek	1284	DeepSeek R1 (2025-05-28)	Official	Jan 20, 2025	Details
Qwen3 Max	Alibaba / Qwen	1226	Qwen3-Max	Official	Sep 5, 2025	Details
DeepSeek V3 0324Open	DeepSeek	1124	DeepSeek V3 (2025-03-24)	Official	Mar 24, 2025	Details
o3	OpenAI	1010	o3-high (2025-04-16)	Official	Apr 16, 2025	Details
GPT-4.1	OpenAI	606	GPT-4.1	Official	Apr 14, 2025	Details
Claude 3.5 Sonnet	Anthropic	572	Claude 3.5 Sonnet	Official	Jun 20, 2024	Details
Llama 4 MaverickOpen	Meta	528	Llama 4 Maverick	Official	Apr 5, 2025	Details
GPT-4o	OpenAI	210	GPT-4o (2024-11-20)	Official	May 13, 2024	Details