Back to Leaderboard

GPT-4.1

OpenAI
OpenAI1048K context$13.6/1K pages2025-04-14
Overall Rank
#13
of 23 models
Overall Score
70.0
avg across benchmarks
Best Task
Key Information Extraction
87.1
Weakest Task
Visual QA
63.0

Benchmark Performance

OlmOCR Benchv1.0
19/23
OverallMathTablePresentAbsentOrder
55.560.059.147.334.959.4
OmniDocBenchv1.5
11/23
OverallText Edit↓CDM↑TEDS↑TEDS-S↑Read Order↓
79.90.16782.274.083.80.115
IDP Core Benchv1.0
9/23
OverallKIEOCRTableVQA
74.787.175.673.163.0

Capability Profile

Strength Analysis

Auto-generated from benchmark scores

Strengths

  • Key Information Extraction87.1
  • Text Extraction83.3

Weaknesses

  • Visual QA63.0
  • Table Understanding68.7