evals.report
BenchmarksLabsCompareRun guides

Kimi K2.6

Moonshot AI · Kimi. Released Apr 1, 2026.

7 results

Benchmark results 7

Compare this model
BenchmarkCategoryScoreMetricStatusDate
FrontierMathReasoning38.97%accuracyOfficialMay 30, 2026Details
DeepSWECoding23.89%% resolvedOfficialDetails
LiveBenchReasoning72.17%scoreOfficialJan 8, 2026Details
GPQA DiamondReasoning90.8%accuracyOfficialMay 30, 2026Details
SWE-bench VerifiedCoding76.7%% resolvedOfficialMay 30, 2026Details
Humanity's Last ExamReasoning29.9%accuracyOfficialMay 31, 2026Details
AIME (OTIS Mock)Reasoning96.1%accuracyOfficialMay 30, 2026Details