Benchmarks

All Coding Japanese Knowledge Math Overall Reasoning Vision
Vision

MMMU-Pro

Massive Multi-discipline Multimodal Understanding (harder version).

Metrics: Accuracy (%)