Tagged articles
18 articles
Page 1 of 1
SuanNi
SuanNi
Apr 30, 2026 · Artificial Intelligence

Deploy a 24/7 Document Recognition Toolbox with the PaddleOCR Image on the Cloud

This guide explains how to use Baidu's open‑source PaddleOCR engine—its full OCR and layout analysis pipeline, multi‑language support, and output formats—to set up a continuously running document recognition service on the 算网 GPU cloud platform, including environment preparation, model configuration, and inference execution.

Document ProcessingGPUMagicMind
0 likes · 6 min read
Deploy a 24/7 Document Recognition Toolbox with the PaddleOCR Image on the Cloud
AI Explorer
AI Explorer
Mar 30, 2026 · Industry Insights

AI Tech Daily March 30 2026: PaddleOCR tops GitHub, New Large Models, and Security Advances

The March 30, 2026 AI Tech Daily highlights PaddleOCR surpassing Tesseract with 73.3K stars, GigaWorld's GigaBrain models leading global rankings, Eli Lilly's 2.75 billion‑RMB AI drug‑discovery investment, Doubao joining top‑tier LLMs, Stanford's InfoTok video tokenization breakthrough, Claude's rapid vulnerability exploit, a new Claude model beating Opus 4.6, Node.js core members petitioning to ban AI‑assisted coding, Kimi K2.5 reaching $1 billion monthly revenue, a startup's tiny model delivering 15× inference speed, and Claude's 15 KB engine boosting rendering speed 1,200×.

AI drug discoveryClaudeGigaBrain
0 likes · 6 min read
AI Tech Daily March 30 2026: PaddleOCR tops GitHub, New Large Models, and Security Advances
Old Meng AI Explorer
Old Meng AI Explorer
Dec 26, 2025 · Artificial Intelligence

How PaddleOCR Boosts Text Extraction Efficiency 10×: A Hands‑On Review

PaddleOCR, Baidu’s open‑source OCR engine, delivers high‑accuracy multilingual text extraction from images, PDFs, and handwritten notes, offering offline operation, free commercial use, and specialized models for invoices, IDs, and tables, enabling users to automate document processing and increase productivity up to tenfold.

Document AutomationOCRPaddleOCR
0 likes · 9 min read
How PaddleOCR Boosts Text Extraction Efficiency 10×: A Hands‑On Review
Old Meng AI Explorer
Old Meng AI Explorer
Oct 30, 2025 · Artificial Intelligence

How PaddleOCR Turns Handwritten Notes and PDFs into Editable Text in Seconds

This article explains how PaddleOCR, an open‑source OCR engine from Baidu, achieves high‑accuracy text extraction from handwritten notes, scanned PDFs, invoices, IDs and multilingual documents, offering offline cross‑platform support, free commercial use, and step‑by‑step guidance for rapid deployment.

Document ProcessingOCRPaddleOCR
0 likes · 10 min read
How PaddleOCR Turns Handwritten Notes and PDFs into Editable Text in Seconds
Fun with Large Models
Fun with Large Models
Oct 26, 2025 · Artificial Intelligence

From Deep Learning to Large‑Model OCR: Which Model Leads the Pack?

This article traces OCR's evolution from early CNN‑LSTM systems to modern multimodal VLMs, analyzes leading open‑source models such as DeepSeek‑OCR, PaddleOCR, and MonkeyOCR, and offers practical guidance for long‑document, academic, and edge‑computing scenarios.

MonkeyOCRMultimodal AIOCR
0 likes · 15 min read
From Deep Learning to Large‑Model OCR: Which Model Leads the Pack?
Sohu Tech Products
Sohu Tech Products
Sep 17, 2025 · Artificial Intelligence

Choosing the Right Python OCR Library: pytesseract, cnocr, or PaddleOCR?

This article compares three popular Python OCR frameworks—pytesseract, cnocr, and PaddleOCR—examining their installation ease, Chinese recognition ability, model size, accuracy, and unique features, and provides practical code examples to help developers pick the best tool for their needs.

Image ProcessingOCRPaddleOCR
0 likes · 5 min read
Choosing the Right Python OCR Library: pytesseract, cnocr, or PaddleOCR?
Baidu Geek Talk
Baidu Geek Talk
Jul 9, 2025 · Artificial Intelligence

PaddleOCR 3.1 Unveils Multilingual PP‑OCRv5, Document Translation, and MCP Server Integration

PaddleOCR 3.1 introduces three major upgrades—a multilingual PP‑OCRv5 model supporting 37 languages with over 30% accuracy gain, a PP‑DocTranslation pipeline for high‑quality multi‑language document translation, and MCP server support for flexible AI application integration—accompanied by detailed CLI usage, demo scenarios, and open‑source resources.

Computer VisionMCPOCR
0 likes · 11 min read
PaddleOCR 3.1 Unveils Multilingual PP‑OCRv5, Document Translation, and MCP Server Integration
Baidu Geek Talk
Baidu Geek Talk
Dec 11, 2024 · Artificial Intelligence

How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform

This article examines how an AI‑powered essay‑grading platform combines PaddleOCR and Baidu's Wenxin large model to automate scoring, generate personalized feedback, and reduce teachers' grading workload by over six times, while improving student learning outcomes across hundreds of Chinese schools.

Education TechnologyPaddleOCRProduct Review
0 likes · 11 min read
How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform
Baidu Tech Salon
Baidu Tech Salon
Dec 6, 2024 · Industry Insights

How AI Cut Chinese Essay Grading Time by Sixfold: A Real‑World Case Study

Chinese teachers spend hours grading essays, but an AI‑powered platform that combines Baidu's PaddleOCR and the Wenxin large model now automates scoring, feedback and analytics, delivering six‑times faster results and personalized guidance for both teachers and students across hundreds of schools.

Educational TechnologyPaddleOCRWenxin model
0 likes · 11 min read
How AI Cut Chinese Essay Grading Time by Sixfold: A Real‑World Case Study
Sohu Tech Products
Sohu Tech Products
Dec 27, 2023 · Artificial Intelligence

OCR-Based Video Review System: Technology Selection, Optimization, and Model Fine-Tuning

An OCR‑based video review system using PaddleOCR’s DB detector and SVTR recognizer, combined with multi‑level frame deduplication, message‑queue task decoupling, Redis prioritization, and dynamic thread‑pool scheduling, was fine‑tuned on 5 000 samples to cut daily frames from 794 million to 3.6 million, achieving automated detection of over 230 abnormal videos per day and replacing three manual reviewers, with future plans for GPU acceleration and cross‑instance GRPC dispatch.

Fine-tuningModel SelectionOCR
0 likes · 20 min read
OCR-Based Video Review System: Technology Selection, Optimization, and Model Fine-Tuning
Baidu Geek Talk
Baidu Geek Talk
Oct 17, 2022 · Artificial Intelligence

OCR Technology: PaddleOCR and Paddle.js Integration

The article explains OCR fundamentals and details how Baidu’s open‑source PaddleOCR suite can be converted and run in browsers via the @paddlejs‑models/ocr SDK, describing model initialization, detection and CRNN‑based recognition pipelines, and presenting benchmark results that show the newer ch_PP‑OCRv2 model achieving higher accuracy and faster inference than the mobile variant.

Computer VisionOCRPaddle.js
0 likes · 9 min read
OCR Technology: PaddleOCR and Paddle.js Integration
Baidu App Technology
Baidu App Technology
Dec 7, 2021 · Artificial Intelligence

Paddle.js OCR SDK: Text Recognition in Web Browsers

Paddle.js OCR SDK brings Baidu’s lightweight PaddleOCR models to web browsers, offering init() and recognize() APIs that load the ch_PP-OCRv2 detection (DB) and recognition (CRNN with bidirectional LSTM) models in parallel, achieving 258 ms detection, 60 ms recognition, 0.52 F‑score, and a combined size under 12 MB.

OCRPaddle.jsPaddleOCR
0 likes · 7 min read
Paddle.js OCR SDK: Text Recognition in Web Browsers
Baidu Geek Talk
Baidu Geek Talk
Sep 8, 2021 · Artificial Intelligence

How PP‑OCRv2 Boosts OCR Speed and Accuracy with Five Key Innovations

The article provides a comprehensive technical overview of PaddleOCR's PP‑OCRv2, detailing its five major algorithmic enhancements, performance improvements over previous versions, historical milestones, core capabilities, and links to the open‑source repositories for developers interested in state‑of‑the‑art OCR solutions.

Computer VisionModel OptimizationOCR
0 likes · 10 min read
How PP‑OCRv2 Boosts OCR Speed and Accuracy with Five Key Innovations
Baidu Geek Talk
Baidu Geek Talk
Aug 4, 2021 · Artificial Intelligence

PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition

PaddleOCR v2.2 launches PP‑Structure, a Python‑installable toolkit that combines PP‑YOLO v2 layout analysis (classifying text, title, table, image, list) with RARE‑based table recognition to extract structured content and export editable Excel files, while supporting custom training and simple command‑line use.

Deep LearningPP-StructurePaddleOCR
0 likes · 8 min read
PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 26, 2021 · Artificial Intelligence

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

iQIYI’s OCR journey, explained by researcher Harlon, covers the evolution from separate detection and recognition pipelines to end‑to‑end models, key algorithms like CTPN, DB and CRNN, large‑scale simulated training, diverse video‑text applications, and future goals such as mobile deployment and tighter NLP integration.

Computer VisionDeep LearningOCR
0 likes · 21 min read
Insights into OCR Technology at iQIYI: Development, Challenges, and Applications
DataFunTalk
DataFunTalk
Sep 23, 2020 · Artificial Intelligence

PaddleOCR: 2020’s Outstanding Open‑Source OCR Suite with a 3.5 MB Ultra‑Light Model

PaddleOCR, the 2020 breakthrough in open‑source OCR, offers ultra‑light 3.5 MB multilingual models, high F1‑score performance across diverse scenarios, easy installation via pip, comprehensive documentation, custom training support, and deployment options for both server and mobile platforms, all backed by detailed benchmarks and code examples.

OCRPaddleOCRPython
0 likes · 8 min read
PaddleOCR: 2020’s Outstanding Open‑Source OCR Suite with a 3.5 MB Ultra‑Light Model