dots.ocr

dots.ocr

dots.ocr is a single vision-language model that jointly learns layout detection, text recognition, tables, formulas and reading order instead of using multi stage OCR pipelines. Built on a compact 1.7B LLM, it reaches SOTA on OmniDocBench and strong performance on multilingual internal benchmarks, supporting over 100 languages. The project also introduces XDocParse, a 126 language benchmark where dots.ocr sets a strong baseline, showing that a unified VLM can rival or beat specialized detectors.

Overview

dots.ocr is a 1.7B parameter vision-language model for multilingual document layout parsing, unifying layout detection, OCR and reading order in one model, and achieving state-of-the-art results on OmniDocBench.

🏭Manufacturing

About Rednote HiLab

rednote (Xiaohongshu) is a Chinese social commerce platform where users share lifestyle content through photos, videos, and live streams, discover products, and make purchases. It operates as a hybrid social network and e-commerce hub serving over 350 million monthly active users, primarily in China.

Industry: Social Networking Platforms

Location: Shanghai, CN

Website: www.xiaohongshu.com

View Company Profile

Tools using dots.ocr

No tools found for this model yet.

Last updated: April 6, 2026

Go to section

Search

Overview

About Rednote HiLab

Tools using dots.ocr

Related Models

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: