PaddlePaddle/PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

71,802 stars9,918 forksPythonView on GitHub ↗
7-day Growth RateA+
+24/day
Top 4.9%

Daily compound star growth rate over 7 days

30-day Growth RateA+
+1/day
Top 4.9%

Daily compound star growth rate over 30 days

AccelerationS
Steady
Top 0.0%

Is the growth rate speeding up or slowing down?

OriginalityB-
3/100
Top 37.0%

Stars earned relative to the number of similar repos in the same category

Topics

ai4sciencechineseocrdocument-parsingdocument-translationkieocrpaddleocr-vlpdf-extractor-ragpdf-parserpdf2markdownpp-ocrpp-structurerag

Data as of 2026-03-08