Z.aiGLM
GLM-5.2
Z.ai · GLM. Released Jun 16, 2026.
GLM-5.2 is a model from Z.ai in the GLM family, released Jun 16, 2026. evals.report tracks 11 reported GLM-5.2 benchmark scores across Humanity's Last Exam, AIME 2026, GPQA Diamond, MathArena HMMT February 2026, SWE-bench Pro, DeepSWE, Terminal-Bench 2.1, MCP Atlas, and 3 more — each shown with its benchmark, metric, source status, and date, and never combined into a single ranking.
11 results
Benchmark results 11
Compare this model| Benchmark | Category | Score | Metric | Status | Date | |
|---|---|---|---|---|---|---|
| Humanity's Last Exam | Reasoning | 40.5% | accuracy | Verified | Jun 16, 2026 | Details |
| AIME 2026 | Reasoning | 99.2% | accuracy | Verified | Jun 16, 2026 | Details |
| GPQA Diamond | Reasoning | 91.2% | accuracy | Verified | Jun 16, 2026 | Details |
| MathArena HMMT February 2026 | Reasoning | 92.5% | accuracy | Verified | Jun 16, 2026 | Details |
| SWE-bench Pro | Coding | 62.1% | % resolved | Verified | Jun 16, 2026 | Details |
| DeepSWE | Coding | 46.2% | % resolved | Verified | Jun 16, 2026 | Details |
| Terminal-Bench 2.1 | Agents | 81.0% | task success | Verified | Jun 16, 2026 | Details |
| MCP Atlas | Tool use | 76.8% | pass rate | Verified | Jun 16, 2026 | Details |
| SWE-Marathon | Agents | 13.0% | resolution rate (pass@1) | Verified | Jun 16, 2026 | Details |
| FrontierSWE | Agents | 74% | dominance score | Official | Jun 16, 2026 | Details |
| PostTrainBench | Agents | 34.3% | weighted average score | Verified | Jun 16, 2026 | Details |