Tagged articles

PaddleOCR

20 articles · Page 1 of 1

Jul 5, 2026 · Artificial Intelligence

Why PaddleOCR Is the Must‑Use Open‑Source OCR Tool

PaddleOCR, an open‑source OCR library from Baidu, offers high‑precision multilingual text extraction, lightweight models, and a modern pipeline, with benchmarks showing superior accuracy and speed over Tesseract and EasyOCR, and provides detailed installation, usage, and Java integration guides for developers.

BenchmarkDeep LearningJava Integration

0 likes · 23 min read

Why PaddleOCR Is the Must‑Use Open‑Source OCR Tool

Python Crawling & Data Mining

Jun 7, 2026 · Artificial Intelligence

Python OCR Table Extraction: Boost Accuracy from 95% to 99% with Batch Processing

The article explains why generic OCR struggles with structured tables, proposes a partition‑based fixed‑region recognition method using PaddleOCR, provides a complete Python script for batch processing, and demonstrates how this approach consistently achieves over 99% accuracy.

Batch ProcessingOCRPaddleOCR

0 likes · 4 min read

Python OCR Table Extraction: Boost Accuracy from 95% to 99% with Batch Processing

SuanNi

Apr 30, 2026 · Artificial Intelligence

Deploy a 24/7 Document Recognition Toolbox with the PaddleOCR Image on the Cloud

This guide explains how to use Baidu's open‑source PaddleOCR engine—its full OCR and layout analysis pipeline, multi‑language support, and output formats—to set up a continuously running document recognition service on the 算网 GPU cloud platform, including environment preparation, model configuration, and inference execution.

Document processingGPUMagicMind

0 likes · 6 min read

Deploy a 24/7 Document Recognition Toolbox with the PaddleOCR Image on the Cloud

AI Explorer

Mar 30, 2026 · Industry Insights

AI Tech Daily March 30 2026: PaddleOCR tops GitHub, New Large Models, and Security Advances

The March 30, 2026 AI Tech Daily highlights PaddleOCR surpassing Tesseract with 73.3K stars, GigaWorld's GigaBrain models leading global rankings, Eli Lilly's 2.75 billion‑RMB AI drug‑discovery investment, Doubao joining top‑tier LLMs, Stanford's InfoTok video tokenization breakthrough, Claude's rapid vulnerability exploit, a new Claude model beating Opus 4.6, Node.js core members petitioning to ban AI‑assisted coding, Kimi K2.5 reaching $1 billion monthly revenue, a startup's tiny model delivering 15× inference speed, and Claude's 15 KB engine boosting rendering speed 1,200×.

AI drug discoveryClaudeGigaBrain

0 likes · 6 min read

AI Tech Daily March 30 2026: PaddleOCR tops GitHub, New Large Models, and Security Advances

Old Zhang's AI Learning

Jan 30, 2026 · Artificial Intelligence

PaddleOCR‑VL‑1.5: 0.9B Model Beats Billion‑Parameter OCR Models with 94.5% Accuracy

PaddleOCR‑VL‑1.5, the latest Baidu release, uses only 0.9 B parameters to achieve 94.5% accuracy on OmniDocBench v1.5, surpassing larger open‑source and commercial OCR models, while offering multi‑task, multi‑language support, lightweight deployment, and detailed performance benchmarks.

DeepSeek-OCRGPU inferenceOCR

0 likes · 9 min read

PaddleOCR‑VL‑1.5: 0.9B Model Beats Billion‑Parameter OCR Models with 94.5% Accuracy

Old Meng AI Explorer

Dec 26, 2025 · Artificial Intelligence

How PaddleOCR Boosts Text Extraction Efficiency 10×: A Hands‑On Review

PaddleOCR, Baidu’s open‑source OCR engine, delivers high‑accuracy multilingual text extraction from images, PDFs, and handwritten notes, offering offline operation, free commercial use, and specialized models for invoices, IDs, and tables, enabling users to automate document processing and increase productivity up to tenfold.

.aiDocument AutomationMultilingual

0 likes · 9 min read

How PaddleOCR Boosts Text Extraction Efficiency 10×: A Hands‑On Review

Old Meng AI Explorer

Oct 30, 2025 · Artificial Intelligence

How PaddleOCR Turns Handwritten Notes and PDFs into Editable Text in Seconds

This article explains how PaddleOCR, an open‑source OCR engine from Baidu, achieves high‑accuracy text extraction from handwritten notes, scanned PDFs, invoices, IDs and multilingual documents, offering offline cross‑platform support, free commercial use, and step‑by‑step guidance for rapid deployment.

AutomationDocument processingOCR

0 likes · 10 min read

How PaddleOCR Turns Handwritten Notes and PDFs into Editable Text in Seconds

Fun with Large Models

Oct 26, 2025 · Artificial Intelligence

From Deep Learning to Large‑Model OCR: Which Model Leads the Pack?

This article traces OCR's evolution from early CNN‑LSTM systems to modern multimodal VLMs, analyzes leading open‑source models such as DeepSeek‑OCR, PaddleOCR, and MonkeyOCR, and offers practical guidance for long‑document, academic, and edge‑computing scenarios.

DeepSeek-OCRMonkeyOCRMultimodal AI

0 likes · 15 min read

From Deep Learning to Large‑Model OCR: Which Model Leads the Pack?

Sohu Tech Products

Sep 17, 2025 · Artificial Intelligence

Choosing the Right Python OCR Library: pytesseract, cnocr, or PaddleOCR?

This article compares three popular Python OCR frameworks—pytesseract, cnocr, and PaddleOCR—examining their installation ease, Chinese recognition ability, model size, accuracy, and unique features, and provides practical code examples to help developers pick the best tool for their needs.

Image processingOCRPaddleOCR

0 likes · 5 min read

Choosing the Right Python OCR Library: pytesseract, cnocr, or PaddleOCR?

Baidu Geek Talk

Jul 9, 2025 · Artificial Intelligence

PaddleOCR 3.1 Unveils Multilingual PP‑OCRv5, Document Translation, and MCP Server Integration

PaddleOCR 3.1 introduces three major upgrades—a multilingual PP‑OCRv5 model supporting 37 languages with over 30% accuracy gain, a PP‑DocTranslation pipeline for high‑quality multi‑language document translation, and MCP server support for flexible AI application integration—accompanied by detailed CLI usage, demo scenarios, and open‑source resources.

.aiMCPMultilingual

0 likes · 11 min read

PaddleOCR 3.1 Unveils Multilingual PP‑OCRv5, Document Translation, and MCP Server Integration

Baidu Geek Talk

Dec 11, 2024 · Artificial Intelligence

How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform

This article examines how an AI‑powered essay‑grading platform combines PaddleOCR and Baidu's Wenxin large model to automate scoring, generate personalized feedback, and reduce teachers' grading workload by over six times, while improving student learning outcomes across hundreds of Chinese schools.

.aiEducation TechnologyPaddleOCR

0 likes · 11 min read

How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform

Baidu Tech Salon

Dec 6, 2024 · Industry Insights

How AI Cut Chinese Essay Grading Time by Sixfold: A Real‑World Case Study

Chinese teachers spend hours grading essays, but an AI‑powered platform that combines Baidu's PaddleOCR and the Wenxin large model now automates scoring, feedback and analytics, delivering six‑times faster results and personalized guidance for both teachers and students across hundreds of schools.

Case StudyEducational TechnologyPaddleOCR

0 likes · 11 min read

How AI Cut Chinese Essay Grading Time by Sixfold: A Real‑World Case Study

Sohu Tech Products

Dec 27, 2023 · Artificial Intelligence

OCR-Based Video Review System: Technology Selection, Optimization, and Model Fine-Tuning

An OCR‑based video review system using PaddleOCR’s DB detector and SVTR recognizer, combined with multi‑level frame deduplication, message‑queue task decoupling, Redis prioritization, and dynamic thread‑pool scheduling, was fine‑tuned on 5 000 samples to cut daily frames from 794 million to 3.6 million, achieving automated detection of over 230 abnormal videos per day and replacing three manual reviewers, with future plans for GPU acceleration and cross‑instance GRPC dispatch.

.aiOCRPaddleOCR

0 likes · 20 min read

OCR-Based Video Review System: Technology Selection, Optimization, and Model Fine-Tuning

Baidu Geek Talk

Oct 17, 2022 · Artificial Intelligence

OCR Technology: PaddleOCR and Paddle.js Integration

The article explains OCR fundamentals and details how Baidu’s open‑source PaddleOCR suite can be converted and run in browsers via the @paddlejs‑models/ocr SDK, describing model initialization, detection and CRNN‑based recognition pipelines, and presenting benchmark results that show the newer ch_PP‑OCRv2 model achieving higher accuracy and faster inference than the mobile variant.

.aiOCRPaddle.js

0 likes · 9 min read

OCR Technology: PaddleOCR and Paddle.js Integration

Baidu Geek Talk

Apr 28, 2022 · Artificial Intelligence

How AI Powers Financial Form Automation and Insurance Q&A: Open‑Source Solutions

This article presents open‑source AI solutions for financial form recognition and insurance smart Q&A, detailing the challenges, model choices, optimization strategies, performance results, and deployment methods using PaddleOCR, PaddleNLP, LayoutXLM, RocketQA and SimCSE.

.aiFinTechForm Recognition

0 likes · 10 min read

How AI Powers Financial Form Automation and Insurance Q&A: Open‑Source Solutions

Baidu App Technology

Dec 7, 2021 · Artificial Intelligence

Paddle.js OCR SDK: Text Recognition in Web Browsers

Paddle.js OCR SDK brings Baidu’s lightweight PaddleOCR models to web browsers, offering init() and recognize() APIs that load the ch_PP-OCRv2 detection (DB) and recognition (CRNN with bidirectional LSTM) models in parallel, achieving 258 ms detection, 60 ms recognition, 0.52 F‑score, and a combined size under 12 MB.

.aiOCRPaddle.js

0 likes · 7 min read

Paddle.js OCR SDK: Text Recognition in Web Browsers

Baidu Geek Talk

Sep 8, 2021 · Artificial Intelligence

How PP‑OCRv2 Boosts OCR Speed and Accuracy with Five Key Innovations

The article provides a comprehensive technical overview of PaddleOCR's PP‑OCRv2, detailing its five major algorithmic enhancements, performance improvements over previous versions, historical milestones, core capabilities, and links to the open‑source repositories for developers interested in state‑of‑the‑art OCR solutions.

Knowledge DistillationModel OptimizationOCR

0 likes · 10 min read

How PP‑OCRv2 Boosts OCR Speed and Accuracy with Five Key Innovations

Baidu Geek Talk

Aug 4, 2021 · Artificial Intelligence

PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition

PaddleOCR v2.2 launches PP‑Structure, a Python‑installable toolkit that combines PP‑YOLO v2 layout analysis (classifying text, title, table, image, list) with RARE‑based table recognition to extract structured content and export editable Excel files, while supporting custom training and simple command‑line use.

.aiDeep LearningExcel Export

0 likes · 8 min read

PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition

iQIYI Technical Product Team

Mar 26, 2021 · Artificial Intelligence

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

iQIYI’s OCR journey, explained by researcher Harlon, covers the evolution from separate detection and recognition pipelines to end‑to‑end models, key algorithms like CTPN, DB and CRNN, large‑scale simulated training, diverse video‑text applications, and future goals such as mobile deployment and tighter NLP integration.

.aiDeep LearningOCR

0 likes · 21 min read

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

DataFunTalk

Sep 23, 2020 · Artificial Intelligence

PaddleOCR: 2020’s Outstanding Open‑Source OCR Suite with a 3.5 MB Ultra‑Light Model

PaddleOCR, the 2020 breakthrough in open‑source OCR, offers ultra‑light 3.5 MB multilingual models, high F1‑score performance across diverse scenarios, easy installation via pip, comprehensive documentation, custom training support, and deployment options for both server and mobile platforms, all backed by detailed benchmarks and code examples.

OCROpen-sourcePaddleOCR

0 likes · 8 min read

PaddleOCR: 2020’s Outstanding Open‑Source OCR Suite with a 3.5 MB Ultra‑Light Model