GPT-5.4

OpenAI400K context2026-03-05
Overall Rank
#3
of 27 models
Overall Score
81.0
avg across benchmarks
Best Task
Text Extraction
91.1
Weakest Task
Visual QA
78.2
Benchmark Performance
OlmOCR Benchv1.0
8/27
| Overall | ArXiv Math | H&F | Long/Tiny | Multi-Col | Old Scans | Scans Math | Tables |
|---|---|---|---|---|---|---|---|
| 73.4 | 83.1 | 20.1 | 82.6 | 83.7 | 43.9 | 82.3 | 91.1 |
OmniDocBenchv1.5
10/27
| Overall | Text Edit↓ | CDM↑ | TEDS↑ | TEDS-S↑ | Read Order↓ |
|---|---|---|---|---|---|
| 85.3 | 0.089 | 83.4 | 81.3 | 86.7 | 0.077 |
IDP Core Benchv1.0
2/27
| Overall | KIE | OCR | Table | VQA |
|---|---|---|---|---|
| 84.4 | 85.7 | 69.1 | 94.8 | 78.2 |
Capability Profile
Strength Analysis
Auto-generated from benchmark scores
Strengths
- Text Extraction91.1
- Table Understanding89.1
Weaknesses
- Visual QA78.2
- Formula83.4