Arena Leaderboard

Elo rankings from human & LLM judge votes

Battle Now
Judge
RankModelCompanyEloWin%W / L / TBattles
1Nanonets OCR3nanonets114479.5%16W / 3L / 3T22
2Nanonets OCR2+nanonets112375.0%18W / 5L / 3T26
3GPT-5 Miniopenai103857.9%8W / 5L / 6T19
4GPT-5.2openai102857.4%12W / 8L / 7T27
5Gemini 2.5 Flash · Thinkinggoogle102250.0%6W / 6L / 4T16
6Claude Sonnet 4.6anthropic101452.3%8W / 7L / 7T22
7GPT-5.4openai100946.9%3W / 4L / 9T16
8GPT-5.4 · Low Reasoningopenai100547.8%8W / 9L / 6T23
9Gemini 3.1 Progoogle99146.2%4W / 5L / 4T13
10Gemini 2.5 Progoogle98647.4%6W / 7L / 6T19
11Claude Opus 4.6 · Low Thinkinganthropic98243.8%4W / 7L / 13T24
12Claude Sonnet 4.6 · Thinkinganthropic97648.4%11W / 12L / 9T32
13Claude Opus 4.6anthropic94640.9%4W / 8L / 10T22
14GPT-4.1openai94535.0%4W / 10L / 6T20
15Gemini 2.5 Flashgoogle94538.2%4W / 8L / 5T17
16GPT-5.4 · Medium Reasoningopenai94343.8%7W / 10L / 7T24
17Gemini 3 Flashgoogle90325.0%2W / 11L / 5T18