evals.report
BenchmarksLabsCompareRun guides

GPT-5.4

OpenAI · GPT. Released Mar 5, 2026.

7 results

Benchmark results 7

Compare this model
BenchmarkCategoryScoreMetricStatusDate
FrontierMathReasoning47.6%accuracyOfficialMay 30, 2026Details
DeepSWECoding55.53%% resolvedOfficialDetails
ARC-AGI-3Reasoning0.21%accuracyOfficialMay 19, 2026Details
LMArenaChat preference1472source-defined ratingOfficialMay 27, 2026Details
SWE-bench VerifiedCoding76.9%% resolvedOfficialMay 30, 2026Details
Humanity's Last ExamReasoning40.28%accuracyOfficialMay 31, 2026Details
MMMU-ProMultimodal82.1%accuracyOfficialApr 8, 2026Details