evals.report
BenchmarksLabsCompareRun guides
Z.aiGLM

GLM-5.2

Z.ai · GLM. Released Jun 16, 2026.

GLM-5.2 is a model from Z.ai in the GLM family, released Jun 16, 2026. evals.report tracks 11 reported GLM-5.2 benchmark scores across Humanity's Last Exam, AIME 2026, GPQA Diamond, MathArena HMMT February 2026, SWE-bench Pro, DeepSWE, Terminal-Bench 2.1, MCP Atlas, and 3 more — each shown with its benchmark, metric, source status, and date, and never combined into a single ranking.

11 results

Benchmark results 11

Compare this model
BenchmarkCategoryScoreMetricStatusDate
Humanity's Last ExamReasoning40.5%accuracyVerifiedJun 16, 2026Details
AIME 2026Reasoning99.2%accuracyVerifiedJun 16, 2026Details
GPQA DiamondReasoning91.2%accuracyVerifiedJun 16, 2026Details
MathArena HMMT February 2026Reasoning92.5%accuracyVerifiedJun 16, 2026Details
SWE-bench ProCoding62.1%% resolvedVerifiedJun 16, 2026Details
DeepSWECoding46.2%% resolvedVerifiedJun 16, 2026Details
Terminal-Bench 2.1Agents81.0%task successVerifiedJun 16, 2026Details
MCP AtlasTool use76.8%pass rateVerifiedJun 16, 2026Details
SWE-MarathonAgents13.0%resolution rate (pass@1)VerifiedJun 16, 2026Details
FrontierSWEAgents74%dominance scoreOfficialJun 16, 2026Details
PostTrainBenchAgents34.3%weighted average scoreVerifiedJun 16, 2026Details