OCR and document AI

enterprise OCR

Alibaba Cloud OCR

阿里云

Alibaba Cloud OCR is a OCR and document AI product from 阿里云, focused on enterprise OCR with tags such as OCR, API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI
OCRAPIChina ecosystem
table and receipt OCR

Amazon Textract

AWS

Amazon Textract is a OCR and document AI product from AWS, focused on table and receipt OCR with tags such as OCR, Document AI, Table extraction.

closed-source / platformfree trial / usage-basedAPI
OCRDocument AITable extraction
enterprise OCR

Azure Document Intelligence

Microsoft

Azure Document Intelligence is a OCR and document AI product from Microsoft, focused on enterprise OCR with tags such as OCR, Document AI, Table extraction.

closed-source / platformfree trial / usage-basedAPI
OCRDocument AITable extraction
cloud OCR

Baidu OCR

百度智能云

Baidu OCR is a OCR and document AI product from 百度智能云, focused on cloud OCR with tags such as OCR, API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI
OCRAPIChina ecosystem
document structuring

Docling

Docling

Docling is a OCR and document AI product from Docling, focused on document structuring with tags such as OCR, Document AI, Open source.

open-source / self-hostedopen source / self-hostedAPI
OCRDocument AIOpen source
enterprise document AI

Google Document AI

Google

Google Document AI is a OCR and document AI product from Google, focused on enterprise document AI with tags such as OCR, Document AI, Table extraction.

closed-source / platformfree trial / usage-basedAPI
OCRDocument AITable extraction
document parsing API

LlamaParse

LlamaIndex

LlamaParse is a OCR and document AI product from LlamaIndex, focused on document parsing API with tags such as OCR, Document AI, API.

closed-source / platformfree trial / usage-basedAPI
OCRDocument AIAPI
formula OCR

Mathpix

Mathpix

Mathpix is a OCR and document AI product from Mathpix, focused on formula OCR with tags such as OCR, Document AI, API.

closed-source / platformfree trial / usage-basedAPI
OCRDocument AIAPI
open-source document parsing

MinerU

OpenDataLab

MinerU is a OCR and document AI product from OpenDataLab, focused on open-source document parsing with tags such as OCR, Document AI, Open source.

open-source / self-hostedopen source / self-hostedAPI
OCRDocument AIOpen source
document parsing

Mistral OCR

Mistral AI

Mistral OCR is a OCR and document AI product from Mistral AI, focused on document parsing with tags such as OCR, Document AI, API.

closed-source / platformfree trial / usage-basedAPI
OCRDocument AIAPI
open-source OCR

PaddleOCR

百度飞桨

PaddleOCR is a OCR and document AI product from 百度飞桨, focused on open-source OCR with tags such as OCR, Open source, China ecosystem.

open-source / self-hostedopen source / self-hostedAPI
OCROpen sourceChina ecosystem
OCR API

Tencent Cloud OCR

腾讯云

Tencent Cloud OCR is a OCR and document AI product from 腾讯云, focused on OCR API with tags such as OCR, API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI
OCRAPIChina ecosystem
unstructured document parsing

Unstructured

Unstructured

Unstructured is a OCR and document AI product from Unstructured, focused on unstructured document parsing with tags such as OCR, Document AI, Workflow.

closed-source / platformfree trial / usage-basedAPI
OCRDocument AIWorkflow
Selection guide

How to choose OCR and document AI

  • Separate scanned docs, invoices, tables, contracts, and PDF Q&A first because engines excel at different document types.
  • Recognition tasks should prioritize OCR accuracy and layout recovery, while understanding tasks should prioritize extraction and QA stability.
  • Batch processing must check API support, queues, page limits, and retry behavior.
  • For contracts or financial documents, then check private deployment, permissions, and audit trails.

What matters first on OCR and document AI category pages?

Start with official access, pricing model, API support, open/closed status, and common use cases.