{"name":"PaddleOCR: A Powerful OCR Toolkit for Structured Document Data","description":"PaddleOCR is an industry-leading, production-ready OCR and document AI engine that transforms any PDF or image document into structured, AI-friendly data. It offers end-to-end solutions from text extraction to intelligent document understanding, supporting over 100 languages with high accuracy and efficiency.","github":"https://github.com/PaddlePaddle/PaddleOCR","url":"https://osrepos.com/repo/paddlepaddle-paddleocr","source":"osrepos.com","sourceDescription":"This repository profile is provided by osrepos.com, an open source repository discovery platform.","repositoryProfile":"https://osrepos.com/repo/paddlepaddle-paddleocr","generatedFor":"open source discovery and AI-assisted research","markdown":"https://osrepos.com/repo/paddlepaddle-paddleocr.md","json":"https://osrepos.com/repo/paddlepaddle-paddleocr.json","topics":["ocr","document-parsing","ai","python","machine-learning","multilingual-ocr","pdf-processing","information-extraction"],"keywords":["ocr","document-parsing","ai","python","machine-learning","multilingual-ocr","pdf-processing","information-extraction"],"stars":null,"summary":"PaddleOCR is an industry-leading, production-ready OCR and document AI engine that transforms any PDF or image document into structured, AI-friendly data. It offers end-to-end solutions from text extraction to intelligent document understanding, supporting over 100 languages with high accuracy and efficiency.","content":"## Introduction\nPaddleOCR is an industry-leading, production-ready Optical Character Recognition (OCR) and document AI engine developed by PaddlePaddle. It provides comprehensive, end-to-end solutions, transforming any PDF or image document into structured, AI-friendly data like JSON and Markdown. With support for over 100 languages, PaddleOCR bridges the gap between raw visual documents and advanced Large Language Models (LLMs), making it a powerful and lightweight toolkit for various AI applications. The project boasts over 72,000 stars on GitHub, highlighting its widespread adoption and impact in the AI community. Recent advancements include PaddleOCR-VL-1.5 for real-world document parsing and text spotting, and PP-OCRv5 for universal scene text recognition.\n\n## Installation\nTo get started with PaddleOCR, you first need to install PaddlePaddle. Refer to the [PaddlePaddle Installation Guide](https://www.paddlepaddle.org.cn/en/install/quick?docurl=/documentation/docs/en/develop/install/pip/linux-pip_en.html) for detailed instructions. Once PaddlePaddle is installed, you can install the PaddleOCR toolkit using pip:\n\nbash\n# If you only want to use the basic text recognition feature (returns text position coordinates and content), including the PP-OCR series\npython -m pip install paddleocr\n\n\nFor full functionality, including document parsing, understanding, and translation, you can install with the `[all]` dependency group:\n\nbash\n# If you want to use all features such as document parsing, document understanding, document translation, key information extraction, etc.\npython -m pip install \"paddleocr[all]\"\n\n\nPaddleOCR also supports installing partial optional features by specifying other dependency groups like `doc-parser` for document parsing, `ie` for information extraction, and `trans` for document translation.\n\n## Examples\nPaddleOCR offers both command-line interface (CLI) and API for inference.\n\n### CLI Examples:\n\nbash\n# Run PP-OCRv5 inference\npaddleocr ocr -i https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png --use_doc_orientation_classify False --use_doc_unwarping False --use_textline_orientation False  \n\n# Run PP-StructureV3 inference\npaddleocr pp_structurev3 -i https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/pp_structure_v3_demo.png --use_doc_orientation_classify False --use_doc_unwarping False\n\n# Run PaddleOCR-VL inference\npaddleocr doc_parser -i https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/paddleocr_vl_demo.png\n\n\n### API Example (PP-OCRv5):\n\npython\n# Initialize PaddleOCR instance\nfrom paddleocr import PaddleOCR\nocr = PaddleOCR(\n    use_doc_orientation_classify=False,\n    use_doc_unwarping=False,\n    use_textline_orientation=False)\n\n# Run OCR inference on a sample image \nresult = ocr.predict(\n    input=\"https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png\")\n\n# Visualize the results and save the JSON results\nfor res in result:\n    res.print()\n    res.save_to_img(\"output\")\n    res.save_to_json(\"output\")\n\n\n## Why Use PaddleOCR\nPaddleOCR stands out as a premier solution for intelligent document applications in the AI era due to several compelling reasons:\n*   **Industry-Leading Accuracy:** It consistently achieves state-of-the-art performance in various OCR and document parsing benchmarks, including complex real-world scenarios.\n*   **Multilingual Support:** With robust support for over 100 languages, it caters to global applications and diverse linguistic needs.\n*   **Comprehensive Functionality:** Beyond basic text recognition, it offers advanced features like document parsing (PP-StructureV3), intelligent information extraction (PP-ChatOCRv4), and document translation (PP-DocTranslation).\n*   **Production-Ready and Efficient:** Designed for practical deployment, PaddleOCR is lightweight, resource-efficient, and supports high-performance inference across various hardware, including CPU, GPU, XPU, and NPU.\n*   **Strong Community and Integrations:** Integrated into leading projects like MinerU, RAGFlow, and pathway, it benefits from an active community and extensive documentation.\n\n## Links\n*   **GitHub Repository:** [PaddlePaddle/PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)\n*   **Official Website:** [PaddleOCR Official Website](https://www.paddleocr.com)\n*   **PaddleOCR 3.0 Technical Report:** [arXiv](https://arxiv.org/abs/2507.05595)\n*   **PaddleOCR-VL Technical Report:** [arXiv](https://arxiv.org/abs/2510.14528)\n*   **HuggingFace Demo (PaddleOCR-VL-1.5):** [PaddleOCR-VL-1.5 Online Demo](https://huggingface.co/spaces/PaddlePaddle/PaddleOCR-VL-1.5_Online_Demo)\n*   **ModelScope Demo (PaddleOCR-VL-1.5):** [PaddleOCR-VL-1.5 Online Demo](https://www.modelscope.cn/studios/PaddlePaddle/PaddleOCR-VL-1.5_Online_Demo)\n*   **Documentation:** [PaddleOCR Documentation](https://paddlepaddle.github.io/PaddleOCR/latest/en/)","metrics":{"detailViews":6,"githubClicks":6},"dates":{"published":null,"modified":"2026-03-14T00:37:41.000Z"}}