Qwen3.5-2B

Alibaba2025-09-01
Overall Rank
#20
of 27 models
Overall Score
62.6
avg across benchmarks
Best Task
Key Information Extraction
78.5
Weakest Task
Text Extraction
37.9
Benchmark Performance
OlmOCR Benchv1.0
9/27
| Overall | ArXiv Math | H&F | Long/Tiny | Multi-Col | Old Scans | Scans Math | Tables |
|---|---|---|---|---|---|---|---|
| 71.9 | 82.1 | 49.6 | 77.1 | 74.7 | 38.0 | 75.3 | 80.7 |
OmniDocBenchv1.5
22/27
| Overall | Text Edit↓ | CDM↑ | TEDS↑ | TEDS-S↑ | Read Order↓ |
|---|---|---|---|---|---|
| 48.7 | 0.621 | 62.9 | 45.3 | 48.2 | 0.401 |
IDP Core Benchv1.0
19/27
| Overall | KIE | OCR | Table | VQA |
|---|---|---|---|---|
| 67.1 | 78.5 | 56.2 | 72.4 | 59.8 |
Capability Profile
Strength Analysis
Auto-generated from benchmark scores
Strengths
- Key Information Extraction78.5
- Layout & Order67.3
Weaknesses
- Text Extraction37.9
- Visual QA59.8