Back to Leaderboard

Qwen3.5-2B

Alibaba
Alibaba2025-09-01
Overall Rank
#20
of 27 models
Overall Score
62.6
avg across benchmarks
Best Task
Key Information Extraction
78.5
Weakest Task
Text Extraction
37.9

Benchmark Performance

OlmOCR Benchv1.0
9/27
OverallArXiv MathH&FLong/TinyMulti-ColOld ScansScans MathTables
71.982.149.677.174.738.075.380.7
OmniDocBenchv1.5
22/27
OverallText Edit↓CDM↑TEDS↑TEDS-S↑Read Order↓
48.70.62162.945.348.20.401
IDP Core Benchv1.0
19/27
OverallKIEOCRTableVQA
67.178.556.272.459.8

Capability Profile

Strength Analysis

Auto-generated from benchmark scores

Strengths

  • Key Information Extraction78.5
  • Layout & Order67.3

Weaknesses

  • Text Extraction37.9
  • Visual QA59.8