TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

dots.ocr

dots.ocr is a single vision-language model that jointly learns layout detection, text recognition, tables, formulas and reading order instead of using multi stage OCR pipelines. Built on a compact 1.7B LLM, it reaches SOTA on OmniDocBench and strong performance on multilingual internal benchmarks, supporting over 100 languages. The project also introduces XDocParse, a 126 language benchmark where dots.ocr sets a strong baseline, showing that a unified VLM can rival or beat specialized detectors.
Text Gen 7
Released: July 30, 2025

Overview

dots.ocr is a 1.7B parameter vision-language model for multilingual document layout parsing, unifying layout detection, OCR and reading order in one model, and achieving state-of-the-art results on OmniDocBench.

About Rednote HiLab

rednote (Xiaohongshu) is a Chinese social commerce platform where users share lifestyle content through photos, videos, and live streams, discover products, and make purchases. It operates as a hybrid social network and e-commerce hub serving over 350 million monthly active users, primarily in China.

Industry: Social Networking Platforms
Location: Shanghai, CN
View Company Profile

Tools using dots.ocr

No tools found for this model yet.

Last updated: April 6, 2026
0 AIs selected
Clear selection
#
Name
Task