evals.report
BenchmarksLabsCompareRun guides

Gemini 3 Flash

Google DeepMind · Gemini. Released Feb 17, 2026.

9 results

Benchmark results 9

Compare this model
BenchmarkCategoryScoreMetricStatusDate
FrontierMathReasoning35.64%accuracyOfficialMay 30, 2026Details
DeepSWECoding5.16%% resolvedOfficialDetails
LMArenaChat preference1466source-defined ratingOfficialMay 27, 2026Details
LiveCodeBench ProCoding2316Codeforces EloOfficialMay 31, 2026Details
SWE-bench ProCoding34.63%% resolvedOfficialMay 30, 2026Details
SWE-bench VerifiedCoding75.4%% resolvedOfficialMay 30, 2026Details
Humanity's Last ExamReasoning36.6%accuracyOfficialMay 31, 2026Details
AIME (OTIS Mock)Reasoning92.8%accuracyOfficialMay 30, 2026Details
SimpleQA VerifiedOther67.4%accuracyOfficialMay 30, 2026Details