evals.report
BenchmarksLabsCompareRun guides

Kimi K2.5

Moonshot AI · Kimi. Released Jan 1, 2026.

4 results

Benchmark results 4

Compare this model
BenchmarkCategoryScoreMetricStatusDate
FrontierMathReasoning27.9%accuracyOfficialMay 30, 2026Details
GPQA DiamondReasoning87.6%accuracyOfficialMay 30, 2026Details
SWE-bench VerifiedCoding73.8%% resolvedOfficialMay 30, 2026Details
AIME (OTIS Mock)Reasoning92.2%accuracyOfficialMay 30, 2026Details