Back to Leaderboard

Qwen3.5-4B

Alibaba
Alibaba2025-09-01
Overall Rank
#13
of 27 models
Overall Score
72.5
avg across benchmarks
Best Task
Key Information Extraction
86.0
Weakest Task
Text Extraction
70.8

Benchmark Performance

OlmOCR Benchv1.0
7/27
OverallArXiv MathH&FLong/TinyMulti-ColOld ScansScans MathTables
75.486.747.283.979.241.181.985.0
OmniDocBenchv1.5
20/27
OverallText Edit↓CDM↑TEDS↑TEDS-S↑Read Order↓
67.60.29271.560.464.60.106
IDP Core Benchv1.0
13/27
OverallKIEOCRTableVQA
74.586.064.776.772.4

Capability Profile

Strength Analysis

Auto-generated from benchmark scores

Strengths

  • Key Information Extraction86.0
  • Layout & Order84.3

Weaknesses

  • Text Extraction70.8
  • Formula71.5