AI Benchmark
Benchmarks
Models
Cost
Compare
About
Benchmarks
All
Coding
Japanese
Knowledge
Math
Overall
Reasoning
Vision
Vision
MMMU-Pro
Massive Multi-discipline Multimodal Understanding (harder version).
Metrics: Accuracy (%)
Paper
Dataset