Tagged articles

16 articles

Page 1 of 1

Oct 30, 2025 · Artificial Intelligence

How PaddleOCR Turns Handwritten Notes and PDFs into Editable Text in Seconds

This article explains how PaddleOCR, an open‑source OCR engine from Baidu, achieves high‑accuracy text extraction from handwritten notes, scanned PDFs, invoices, IDs and multilingual documents, offering offline cross‑platform support, free commercial use, and step‑by‑step guidance for rapid deployment.

Document ProcessingOCROpen-source

0 likes · 10 min read

How PaddleOCR Turns Handwritten Notes and PDFs into Editable Text in Seconds

Rare Earth Juejin Tech Community

Aug 12, 2023 · Artificial Intelligence

An Introduction to OCR: Concepts, History, Applications, Datasets, and Technical Workflow

This article provides a comprehensive overview of Optical Character Recognition (OCR), covering its definition, historical development, classification, real‑world applications, technical pipeline, common challenges, mitigation strategies, popular datasets, model performance comparisons, and leading open‑source platforms.

Computer VisionDatasetsDeep Learning

0 likes · 16 min read

An Introduction to OCR: Concepts, History, Applications, Datasets, and Technical Workflow

Shopee Tech Team

Nov 10, 2022 · Artificial Intelligence

ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video

ShopeeVideo OCR is a multi‑language text‑recognition system for Southeast Asian e‑commerce videos that unifies detection, Transformer‑based recognition, layout analysis, and large‑scale synthetic data generation to handle Indonesian, Filipino, English, Vietnamese, Thai and Chinese scripts, delivering industry‑leading accuracy and winning thirteen ICDAR first‑place awards.

Computer VisionDeep LearningMulti-language OCR

0 likes · 15 min read

ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video

Baidu Geek Talk

Oct 17, 2022 · Artificial Intelligence

OCR Technology: PaddleOCR and Paddle.js Integration

The article explains OCR fundamentals and details how Baidu’s open‑source PaddleOCR suite can be converted and run in browsers via the @paddlejs‑models/ocr SDK, describing model initialization, detection and CRNN‑based recognition pipelines, and presenting benchmark results that show the newer ch_PP‑OCRv2 model achieving higher accuracy and faster inference than the mobile variant.

AIComputer VisionOCR

0 likes · 9 min read

OCR Technology: PaddleOCR and Paddle.js Integration

DataFunSummit

Sep 6, 2022 · Artificial Intelligence

Recent Advances in Self‑Supervised Learning for Text Recognition (OCR)

This article reviews recent progress in applying self‑supervised learning to OCR text recognition, covering mainstream model architectures, key considerations for self‑supervised tasks on text images, and detailed analyses of representative papers such as SeqCLR, SimAN, and DiG, highlighting their designs, experiments, and results.

Computer VisionOCRcontrastive learning

0 likes · 20 min read

Recent Advances in Self‑Supervised Learning for Text Recognition (OCR)

Laiye Technology Team

Aug 15, 2022 · Artificial Intelligence

Recent Advances in Self‑Supervised Learning for Text Recognition

This article reviews recent self‑supervised learning approaches for optical character recognition, covering mainstream OCR model architectures, key factors for applying contrastive and masked image modeling methods to text images, and detailed analyses of representative works such as SeqCLR, SimAN, and DiG, including their designs and experimental results.

OCRcontrastive learningmasked image modeling

0 likes · 19 min read

Recent Advances in Self‑Supervised Learning for Text Recognition

Baidu App Technology

Dec 7, 2021 · Artificial Intelligence

Paddle.js OCR SDK: Text Recognition in Web Browsers

Paddle.js OCR SDK brings Baidu’s lightweight PaddleOCR models to web browsers, offering init() and recognize() APIs that load the ch_PP-OCRv2 detection (DB) and recognition (CRNN with bidirectional LSTM) models in parallel, achieving 258 ms detection, 60 ms recognition, 0.52 F‑score, and a combined size under 12 MB.

AIOCRPaddle.js

0 likes · 7 min read

Paddle.js OCR SDK: Text Recognition in Web Browsers

Cyber Elephant Tech Team

Oct 14, 2021 · Artificial Intelligence

Mastering OCR: From Traditional Techniques to Deep Learning Solutions

This article provides a comprehensive overview of Optical Character Recognition, covering its traditional applications, the evolution to deep learning methods, key datasets, popular tools, and practical strategies for tackling diverse OCR challenges in real-world scenarios.

CRNNComputer VisionDatasets

0 likes · 18 min read

Mastering OCR: From Traditional Techniques to Deep Learning Solutions

Baidu Geek Talk

Aug 4, 2021 · Artificial Intelligence

PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition

PaddleOCR v2.2 launches PP‑Structure, a Python‑installable toolkit that combines PP‑YOLO v2 layout analysis (classifying text, title, table, image, list) with RARE‑based table recognition to extract structured content and export editable Excel files, while supporting custom training and simple command‑line use.

AIDeep LearningPP-Structure

0 likes · 8 min read

PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition

TiPaiPai Technical Team

Aug 2, 2021 · Artificial Intelligence

How Attention Boosts Text Recognition: From CNN‑Seq2Seq to Multi‑Scale Models

This article explains how attention mechanisms are applied to text recognition, covering the basic CNN‑Seq2Seq‑Attention architecture, multi‑scale attention extensions, and a 2D attentional irregular scene text recognizer with detailed network components, training loss, and experimental results.

CNNComputer VisionDeep Learning

0 likes · 8 min read

How Attention Boosts Text Recognition: From CNN‑Seq2Seq to Multi‑Scale Models

TiPaiPai Technical Team

Jun 18, 2021 · Artificial Intelligence

Mastering Text Recognition: Encoder & Decoder Strategies Explained

This article reviews modern text‑recognition systems, detailing how encoders such as CNN, CNN‑BiLSTM, and Transformer‑based models extract visual features, and how decoders like Position Attention, Transformer decoders, and RNN Seq2Seq align variable‑length text, while also discussing CTC loss and practical design choices.

CNNCTCDecoder

0 likes · 9 min read

Mastering Text Recognition: Encoder & Decoder Strategies Explained

iQIYI Technical Product Team

Mar 26, 2021 · Artificial Intelligence

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

iQIYI’s OCR journey, explained by researcher Harlon, covers the evolution from separate detection and recognition pipelines to end‑to‑end models, key algorithms like CTPN, DB and CRNN, large‑scale simulated training, diverse video‑text applications, and future goals such as mobile deployment and tighter NLP integration.

AIComputer VisionDeep Learning

0 likes · 21 min read

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

Tencent Cloud Developer

Mar 4, 2021 · Artificial Intelligence

WeChat OCR: Implementation of Image Text Extraction Feature

WeChat’s 8.0 update introduced an OCR pipeline that first quickly detects text in images, classifies the image type, applies a lightweight multi‑language detection network and a MobileNetV3‑based DBNet recognizer with a multi‑task CTC/Attention model, then merges results via a rule‑based layout analyzer to deliver accurate, well‑formatted extracted text across diverse languages and document types.

Computer VisionDBNetDeep Learning

0 likes · 13 min read

WeChat OCR: Implementation of Image Text Extraction Feature

Python Crawling & Data Mining

Dec 10, 2020 · Artificial Intelligence

How to Build a Python OCR & Image Converter with Baidu API and Pillow

Learn step‑by‑step how to use Baidu’s OCR service to extract text from images and employ the Pillow library to convert image formats in Python, including code snippets, API parameter details, and practical tips for handling local and online files.

Baidu APIOCRimage-processing

0 likes · 7 min read

How to Build a Python OCR & Image Converter with Baidu API and Pillow

Tencent Cloud Developer

Jun 5, 2019 · Artificial Intelligence

Tencent Cloud OCR Technology: Principles, Challenges, and Industry Applications

Tencent Cloud OCR leverages deep‑learning‑based text detection and recognition, including Compact Inception and multi‑layer RNN refinements, to overcome challenges such as complex backgrounds, low resolution, and multilingual layouts, delivering over 90% accuracy for ID cards, bank cards, business licenses, handwritten text, and powering fast, cost‑saving applications in logistics, QQ, and WeChat Work.

Deep LearningImage ProcessingOCR

0 likes · 7 min read

Tencent Cloud OCR Technology: Principles, Challenges, and Industry Applications

Ctrip Technology

Feb 28, 2019 · Artificial Intelligence

OCR Techniques and Solutions for Ctrip Business: Deep Learning Based Text Detection and Recognition

This article presents an overview of computer‑vision based OCR in Ctrip's operations, detailing deep‑learning text detection methods for controlled and uncontrolled scenarios, sequence‑based recognition models, training strategies with synthetic data, and performance results, while discussing current challenges and future improvements.

AIComputer VisionCtrip

0 likes · 11 min read

OCR Techniques and Solutions for Ctrip Business: Deep Learning Based Text Detection and Recognition