evals.report
BenchmarksLabsCompareRun guides
BenchmarksMultimodal

Video-MMMU

A multi-discipline benchmark evaluating large multimodal models' ability to acquire and apply knowledge from expert-level professional videos across six disciplines through three cognitive stages (Perception, Comprehension, Adaptation), measured by question-answering accuracy.

MultimodalaccuracyHigher is better

No run guide for this benchmark yet.