FrontierMath

Math

Cutting-edge mathematics problems (Tiers 1-3).

Metrics
Accuracy (%)

How to Run

Request access from Epoch AI, then follow their evaluation protocol

Leaderboard

Rank Model Provider Parameters Score
1 GPT-5.2 OpenAI Unknown 40.3%