evals.report
BenchmarksLabsCompareRun guides

Gemini 3 Pro

Google DeepMind · Gemini. Released Nov 18, 2025.

12 results

Benchmark results 12

Compare this model
BenchmarkCategoryScoreMetricStatusDate
FrontierMathReasoning37.6%accuracyOfficialMay 30, 2026Details
Berkeley Function Calling LeaderboardTool use72.51%accuracyOfficialApr 12, 2026Details
LMArenaChat preference1479source-defined ratingOfficialMay 27, 2026Details
LiveBenchReasoning73.39%scoreOfficialJan 8, 2026Details
LiveCodeBench ProCoding2439Codeforces EloOfficialMay 31, 2026Details
SWE-bench ProCoding43.30%% resolvedOfficialMay 30, 2026Details
GPQA DiamondReasoning92.6%accuracyOfficialMay 30, 2026Details
SWE-bench VerifiedCoding72.9%% resolvedOfficialMay 30, 2026Details
Humanity's Last ExamReasoning38.3%accuracyOfficialMay 31, 2026Details
MMMU-ProMultimodal81.0%accuracyOfficialApr 8, 2026Details
AIME (OTIS Mock)Reasoning91.4%accuracyOfficialMay 30, 2026Details
SimpleQA VerifiedOther72.9%accuracyOfficialMay 30, 2026Details