evals.report
BenchmarksLabsCompareRun guides

Gemini 3.1 Pro Preview

Google DeepMind · Gemini. Released Mar 5, 2026.

14 results

Benchmark results 14

Compare this model
BenchmarkCategoryScoreMetricStatusDate
FrontierMathReasoning36.9%accuracyOfficialMay 30, 2026Details
DeepSWECoding9.88%% resolvedOfficialDetails
ARC-AGI-3Reasoning0.42%accuracyOfficialMay 19, 2026Details
ARC-AGI-2Reasoning77.08%accuracyOfficialMay 19, 2026Details
LMArenaChat preference1481source-defined ratingOfficialMay 27, 2026Details
LiveBenchReasoning79.93%scoreOfficialJan 8, 2026Details
LiveCodeBench ProCoding2887Codeforces EloOfficialMay 31, 2026Details
SWE-bench ProCoding46.10%% resolvedOfficialMay 30, 2026Details
GPQA DiamondReasoning94.1%accuracyOfficialMay 30, 2026Details
SWE-bench VerifiedCoding75.6%% resolvedOfficialMay 30, 2026Details
Humanity's Last ExamReasoning45.9%accuracyOfficialMay 31, 2026Details
MMMU-ProMultimodal80.5%accuracyOfficialApr 8, 2026Details
AIME (OTIS Mock)Reasoning95.6%accuracyOfficialMay 30, 2026Details
SimpleQA VerifiedOther77.3%accuracyOfficialMay 30, 2026Details