evals.report
BenchmarksLabsCompareRun guides

OLMo 3.1-Think 32B

Allen Institute for AI · OLMo 3. Released Dec 12, 2025.

2 results

Benchmark results 2

Compare this model
BenchmarkCategoryScoreMetricStatusDate
GPQA DiamondReasoning57.5%accuracyUnverifiedDec 12, 2025Details
Design ArenaChat preference1029EloVerifiedDec 12, 2025Details