Qwen3.5-4B

Alibaba2025-09-01
Overall Rank
#13
of 27 models
Overall Score
72.5
avg across benchmarks
Best Task
Key Information Extraction
86.0
Weakest Task
Text Extraction
70.8
Benchmark Performance
OlmOCR Benchv1.0
7/27
| Overall | ArXiv Math | H&F | Long/Tiny | Multi-Col | Old Scans | Scans Math | Tables |
|---|---|---|---|---|---|---|---|
| 75.4 | 86.7 | 47.2 | 83.9 | 79.2 | 41.1 | 81.9 | 85.0 |
OmniDocBenchv1.5
20/27
| Overall | Text Edit↓ | CDM↑ | TEDS↑ | TEDS-S↑ | Read Order↓ |
|---|---|---|---|---|---|
| 67.6 | 0.292 | 71.5 | 60.4 | 64.6 | 0.106 |
IDP Core Benchv1.0
13/27
| Overall | KIE | OCR | Table | VQA |
|---|---|---|---|---|
| 74.5 | 86.0 | 64.7 | 76.7 | 72.4 |
Capability Profile
Strength Analysis
Auto-generated from benchmark scores
Strengths
- Key Information Extraction86.0
- Layout & Order84.3
Weaknesses
- Text Extraction70.8
- Formula71.5