evals.report
BenchmarksLabsCompareRun guides
BenchmarksReasoning

MMLU-ProX

A multilingual extension of MMLU-Pro spanning 29 typologically diverse languages with 11,829 parallel reasoning-focused multiple-choice questions (10 answer choices) per language, measuring LLM reasoning and knowledge across linguistic and cultural boundaries.

ReasoningaccuracyHigher is better

No run guide for this benchmark yet.