PaddlePaddle/PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

71,802 stars9,918 forksPythonView on GitHub ↗

7-day Growth RateA+

+24/day

Top 4.9%

Daily compound star growth rate over 7 days

30-day Growth RateA+

+1/day

Top 4.9%

Daily compound star growth rate over 30 days

AccelerationS

Steady

Top 0.0%

Is the growth rate speeding up or slowing down?

OriginalityB-

3/100

Top 37.0%

Stars earned relative to the number of similar repos in the same category

Topics

ai4sciencechineseocrdocument-parsingdocument-translationkieocrpaddleocr-vlpdf-extractor-ragpdf-parserpdf2markdownpp-ocrpp-structurerag

Data as of 2026-03-08