evals.report
BenchmarksLabsCompareRun guides

Kimi K2 Instruct

Moonshot AI · Kimi. Released Jul 11, 2025.

2 results

Benchmark results 2

Compare this model
BenchmarkCategoryScoreMetricStatusDate
Berkeley Function Calling LeaderboardTool use59.06%accuracyOfficialApr 12, 2026Details
SWE-bench ProCoding27.67%% resolvedOfficialMay 30, 2026Details