LabsAlibaba / Qwen
Models 7
Qwen 3 Coder 480B
Qwen · qwen3-coder-480b
2025-07-22
1 results
Qwen3 235B A22B Instruct 2507
Qwen · qwen3-235b-a22b-instruct-2507
2025-07-25
4 results
Qwen3 Max
Qwen · qwen3-max
2025-09-23
1 results
Qwen3.5 Max Preview
Qwen · qwen3.5-max-preview
2026-03-01
1 results
Qwen 3.6 Plus
Qwen · qwen 3.6 plus
2026-03-31
3 results
Qwen 3.6 Max Preview
Qwen · qwen 3.6 max
2026-04-20
3 results
Qwen3.7 Max Preview
Qwen · qwen3.7-max-preview
2026-05-01
2 results
Progress by benchmark
Show progress on
Qwen 3 Coder 480B
Jul 22, 2025
—
Qwen3 235B A22B Instruct 2507
Jul 25, 2025
—
Qwen3 Max
Sep 23, 2025
—
Qwen3.5 Max Preview
Mar 1, 2026
—
Qwen 3.6 Plus
Mar 31, 2026
—
Qwen 3.6 Max Preview
Apr 20, 2026
—
Qwen3.7 Max Preview
May 1, 2026
—
Single benchmark only
This view shows SWE-bench Verified (% resolved) only. Other benchmarks use different metrics and are not directly comparable.
Progress matrix
| Model | SWE-bench Verified % resolved | GPQA Diamond accuracy | LiveCodeBench Pro Codeforces Elo | Berkeley Function Calling Leaderboard accuracy | LiveBench score | Terminal-Bench 2.1 task success | SWE-bench Pro % resolved | DeepSWE % resolved | Humanity's Last Exam accuracy | MMMU-Pro accuracy | LMArena source-defined rating | ARC-AGI-3 accuracy | ARC-AGI-2 accuracy | FrontierMath accuracy | AIME (OTIS Mock) accuracy | SimpleQA Verified accuracy |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Qwen 3 Coder 480B Qwen | — | — | — | — | — | — | 38.70% | — | — | — | — | — | — | — | — | — |
| Qwen3 235B A22B Instruct 2507 Qwen | — | — | 1673 | 52.15% | — | — | 21.41% | — | — | — | — | — | — | — | — | 50.1% |
| Qwen3 Max Qwen | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 67.5% |
| Qwen3.5 Max Preview Qwen | — | — | — | — | — | — | — | — | — | — | 1470 | — | — | — | — | — |
| Qwen 3.6 Plus Qwen | — | 87.4% | — | — | — | — | — | — | — | — | — | — | — | — | 90.6% | 49.1% |
| Qwen 3.6 Max Preview Qwen | — | 89.1% | — | — | — | — | — | — | — | — | — | — | — | — | 91.1% | 56.9% |
| Qwen3.7 Max Preview Qwen | — | — | — | — | 74.29% | — | — | — | — | — | 1474 | — | — | — | — | — |
Scores are not normalised across benchmarks. Each column uses its own metric. Compare columns independently.