Tag

text recognition

0 views collected around this technical thread.

Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 12, 2023 · Artificial Intelligence

An Introduction to OCR: Concepts, History, Applications, Datasets, and Technical Workflow

This article provides a comprehensive overview of Optical Character Recognition (OCR), covering its definition, historical development, classification, real‑world applications, technical pipeline, common challenges, mitigation strategies, popular datasets, model performance comparisons, and leading open‑source platforms.

OCROptical Character Recognitioncomputer vision
0 likes · 16 min read
An Introduction to OCR: Concepts, History, Applications, Datasets, and Technical Workflow
Shopee Tech Team
Shopee Tech Team
Nov 10, 2022 · Artificial Intelligence

ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video

ShopeeVideo OCR is a multi‑language text‑recognition system for Southeast Asian e‑commerce videos that unifies detection, Transformer‑based recognition, layout analysis, and large‑scale synthetic data generation to handle Indonesian, Filipino, English, Vietnamese, Thai and Chinese scripts, delivering industry‑leading accuracy and winning thirteen ICDAR first‑place awards.

Multi-language OCROCROptical Character Recognition
0 likes · 15 min read
ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video
Baidu Geek Talk
Baidu Geek Talk
Oct 17, 2022 · Artificial Intelligence

OCR Technology: PaddleOCR and Paddle.js Integration

The article explains OCR fundamentals and details how Baidu’s open‑source PaddleOCR suite can be converted and run in browsers via the @paddlejs‑models/ocr SDK, describing model initialization, detection and CRNN‑based recognition pipelines, and presenting benchmark results that show the newer ch_PP‑OCRv2 model achieving higher accuracy and faster inference than the mobile variant.

AIOCRPaddle.js
0 likes · 9 min read
OCR Technology: PaddleOCR and Paddle.js Integration
DataFunSummit
DataFunSummit
Sep 6, 2022 · Artificial Intelligence

Recent Advances in Self‑Supervised Learning for Text Recognition (OCR)

This article reviews recent progress in applying self‑supervised learning to OCR text recognition, covering mainstream model architectures, key considerations for self‑supervised tasks on text images, and detailed analyses of representative papers such as SeqCLR, SimAN, and DiG, highlighting their designs, experiments, and results.

OCRcomputer visioncontrastive learning
0 likes · 20 min read
Recent Advances in Self‑Supervised Learning for Text Recognition (OCR)
Laiye Technology Team
Laiye Technology Team
Aug 15, 2022 · Artificial Intelligence

Recent Advances in Self‑Supervised Learning for Text Recognition

This article reviews recent self‑supervised learning approaches for optical character recognition, covering mainstream OCR model architectures, key factors for applying contrastive and masked image modeling methods to text images, and detailed analyses of representative works such as SeqCLR, SimAN, and DiG, including their designs and experimental results.

OCRcomputer visioncontrastive learning
0 likes · 19 min read
Recent Advances in Self‑Supervised Learning for Text Recognition
Baidu App Technology
Baidu App Technology
Dec 7, 2021 · Artificial Intelligence

Paddle.js OCR SDK: Text Recognition in Web Browsers

Paddle.js OCR SDK brings Baidu’s lightweight PaddleOCR models to web browsers, offering init() and recognize() APIs that load the ch_PP-OCRv2 detection (DB) and recognition (CRNN with bidirectional LSTM) models in parallel, achieving 258 ms detection, 60 ms recognition, 0.52 F‑score, and a combined size under 12 MB.

AIMachine LearningOCR
0 likes · 7 min read
Paddle.js OCR SDK: Text Recognition in Web Browsers
Cyber Elephant Tech Team
Cyber Elephant Tech Team
Oct 14, 2021 · Artificial Intelligence

Mastering OCR: From Traditional Techniques to Deep Learning Solutions

This article provides a comprehensive overview of Optical Character Recognition, covering its traditional applications, the evolution to deep learning methods, key datasets, popular tools, and practical strategies for tackling diverse OCR challenges in real-world scenarios.

CRNNEASTOCR
0 likes · 18 min read
Mastering OCR: From Traditional Techniques to Deep Learning Solutions
Baidu Geek Talk
Baidu Geek Talk
Aug 4, 2021 · Artificial Intelligence

PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition

PaddleOCR v2.2 launches PP‑Structure, a Python‑installable toolkit that combines PP‑YOLO v2 layout analysis (classifying text, title, table, image, list) with RARE‑based table recognition to extract structured content and export editable Excel files, while supporting custom training and simple command‑line use.

AIExcel ExportPP-Structure
0 likes · 8 min read
PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 26, 2021 · Artificial Intelligence

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

iQIYI’s OCR journey, explained by researcher Harlon, covers the evolution from separate detection and recognition pipelines to end‑to‑end models, key algorithms like CTPN, DB and CRNN, large‑scale simulated training, diverse video‑text applications, and future goals such as mobile deployment and tighter NLP integration.

AIOCRPaddleOCR
0 likes · 21 min read
Insights into OCR Technology at iQIYI: Development, Challenges, and Applications
Tencent Cloud Developer
Tencent Cloud Developer
Mar 4, 2021 · Artificial Intelligence

WeChat OCR: Implementation of Image Text Extraction Feature

WeChat’s 8.0 update introduced an OCR pipeline that first quickly detects text in images, classifies the image type, applies a lightweight multi‑language detection network and a MobileNetV3‑based DBNet recognizer with a multi‑task CTC/Attention model, then merges results via a rule‑based layout analyzer to deliver accurate, well‑formatted extracted text across diverse languages and document types.

DBNetOCROptical Character Recognition
0 likes · 13 min read
WeChat OCR: Implementation of Image Text Extraction Feature
Tencent Cloud Developer
Tencent Cloud Developer
Jun 5, 2019 · Artificial Intelligence

Tencent Cloud OCR Technology: Principles, Challenges, and Industry Applications

Tencent Cloud OCR leverages deep‑learning‑based text detection and recognition, including Compact Inception and multi‑layer RNN refinements, to overcome challenges such as complex backgrounds, low resolution, and multilingual layouts, delivering over 90% accuracy for ID cards, bank cards, business licenses, handwritten text, and powering fast, cost‑saving applications in logistics, QQ, and WeChat Work.

Machine LearningOCROptical Character Recognition
0 likes · 7 min read
Tencent Cloud OCR Technology: Principles, Challenges, and Industry Applications
Ctrip Technology
Ctrip Technology
Feb 28, 2019 · Artificial Intelligence

OCR Techniques and Solutions for Ctrip Business: Deep Learning Based Text Detection and Recognition

This article presents an overview of computer‑vision based OCR in Ctrip's operations, detailing deep‑learning text detection methods for controlled and uncontrolled scenarios, sequence‑based recognition models, training strategies with synthetic data, and performance results, while discussing current challenges and future improvements.

AICtripOCR
0 likes · 11 min read
OCR Techniques and Solutions for Ctrip Business: Deep Learning Based Text Detection and Recognition