evals.report
BenchmarksLabsCompareRun guides

GPT-5.1

OpenAI · GPT. Released Nov 13, 2025.

8 results

Benchmark results 8

Compare this model
BenchmarkCategoryScoreMetricStatusDate
FrontierMathReasoning31.03%accuracyOfficialMay 30, 2026Details
LiveCodeBench ProCoding2269Codeforces EloOfficialMay 31, 2026Details
GPQA DiamondReasoning87.6%accuracyOfficialMay 30, 2026Details
SWE-bench VerifiedCoding68.0%% resolvedOfficialMay 30, 2026Details
Humanity's Last ExamReasoning27.2%accuracyOfficialMay 31, 2026Details
MMMU-ProMultimodal79.0%accuracyOfficialApr 8, 2026Details
AIME (OTIS Mock)Reasoning88.6%accuracyOfficialMay 30, 2026Details
SimpleQA VerifiedOther48.9%accuracyOfficialMay 30, 2026Details