Back to Leaderboard

Qwen3.5-2B

Alibaba2025-09-01

Overall Rank

#20

of 27 models

Overall Score

62.6

avg across benchmarks

Best Task

Key Information Extraction

78.5

Weakest Task

Text Extraction

37.9

Benchmark Performance

OlmOCR Benchv1.0

9/27

Overall	ArXiv Math	H&F	Long/Tiny	Multi-Col	Old Scans	Scans Math	Tables
71.9	82.1	49.6	77.1	74.7	38.0	75.3	80.7

OmniDocBenchv1.5

22/27

Overall	Text Edit↓	CDM↑	TEDS↑	TEDS-S↑	Read Order↓
48.7	0.621	62.9	45.3	48.2	0.401

IDP Core Benchv1.0

19/27

Overall	KIE	OCR	Table	VQA
67.1	78.5	56.2	72.4	59.8

Capability Profile

Strength Analysis

Auto-generated from benchmark scores

Strengths

Key Information Extraction78.5
Layout & Order67.3

Weaknesses

Text Extraction37.9
Visual QA59.8