evals.report
BenchmarksLabsCompareRun guides

Gemini 3.5 Flash

Google DeepMind · Gemini. Released Apr 20, 2026.

9 results

Benchmark results 9

Compare this model
BenchmarkCategoryScoreMetricStatusDate
FrontierMathReasoning38.97%accuracyOfficialMay 30, 2026Details
DeepSWECoding28.32%% resolvedOfficialDetails
ARC-AGI-2Reasoning72.08%accuracyOfficialMay 19, 2026Details
LMArenaChat preference1482source-defined ratingOfficialMay 27, 2026Details
LiveBenchReasoning75.02%scoreOfficialJan 8, 2026Details
GPQA DiamondReasoning92.8%accuracyOfficialMay 30, 2026Details
Humanity's Last ExamReasoning42.5%accuracyOfficialMay 31, 2026Details
AIME (OTIS Mock)Reasoning95.6%accuracyOfficialMay 30, 2026Details
SimpleQA VerifiedOther68.4%accuracyOfficialMay 30, 2026Details