Tag

PaddleOCR

0 views collected around this technical thread.

Sohu Tech Products
Sohu Tech Products
Dec 27, 2023 · Artificial Intelligence

OCR-Based Video Review System: Technology Selection, Optimization, and Model Fine-Tuning

An OCR‑based video review system using PaddleOCR’s DB detector and SVTR recognizer, combined with multi‑level frame deduplication, message‑queue task decoupling, Redis prioritization, and dynamic thread‑pool scheduling, was fine‑tuned on 5 000 samples to cut daily frames from 794 million to 3.6 million, achieving automated detection of over 230 abnormal videos per day and replacing three manual reviewers, with future plans for GPU acceleration and cross‑instance GRPC dispatch.

AIFine-tuningMultithreading
0 likes · 20 min read
OCR-Based Video Review System: Technology Selection, Optimization, and Model Fine-Tuning
Baidu Geek Talk
Baidu Geek Talk
Oct 17, 2022 · Artificial Intelligence

OCR Technology: PaddleOCR and Paddle.js Integration

The article explains OCR fundamentals and details how Baidu’s open‑source PaddleOCR suite can be converted and run in browsers via the @paddlejs‑models/ocr SDK, describing model initialization, detection and CRNN‑based recognition pipelines, and presenting benchmark results that show the newer ch_PP‑OCRv2 model achieving higher accuracy and faster inference than the mobile variant.

AIOCRPaddle.js
0 likes · 9 min read
OCR Technology: PaddleOCR and Paddle.js Integration
Baidu App Technology
Baidu App Technology
Dec 7, 2021 · Artificial Intelligence

Paddle.js OCR SDK: Text Recognition in Web Browsers

Paddle.js OCR SDK brings Baidu’s lightweight PaddleOCR models to web browsers, offering init() and recognize() APIs that load the ch_PP-OCRv2 detection (DB) and recognition (CRNN with bidirectional LSTM) models in parallel, achieving 258 ms detection, 60 ms recognition, 0.52 F‑score, and a combined size under 12 MB.

AIMachine LearningOCR
0 likes · 7 min read
Paddle.js OCR SDK: Text Recognition in Web Browsers
Baidu Geek Talk
Baidu Geek Talk
Aug 4, 2021 · Artificial Intelligence

PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition

PaddleOCR v2.2 launches PP‑Structure, a Python‑installable toolkit that combines PP‑YOLO v2 layout analysis (classifying text, title, table, image, list) with RARE‑based table recognition to extract structured content and export editable Excel files, while supporting custom training and simple command‑line use.

AIExcel ExportPP-Structure
0 likes · 8 min read
PaddleOCR v2.2 Release: PP-Structure for Document Layout Analysis and Table Recognition
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 26, 2021 · Artificial Intelligence

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

iQIYI’s OCR journey, explained by researcher Harlon, covers the evolution from separate detection and recognition pipelines to end‑to‑end models, key algorithms like CTPN, DB and CRNN, large‑scale simulated training, diverse video‑text applications, and future goals such as mobile deployment and tighter NLP integration.

AIOCRPaddleOCR
0 likes · 21 min read
Insights into OCR Technology at iQIYI: Development, Challenges, and Applications
DataFunTalk
DataFunTalk
Sep 23, 2020 · Artificial Intelligence

PaddleOCR: 2020’s Outstanding Open‑Source OCR Suite with a 3.5 MB Ultra‑Light Model

PaddleOCR, the 2020 breakthrough in open‑source OCR, offers ultra‑light 3.5 MB multilingual models, high F1‑score performance across diverse scenarios, easy installation via pip, comprehensive documentation, custom training support, and deployment options for both server and mobile platforms, all backed by detailed benchmarks and code examples.

OCRPaddleOCRcomputer vision
0 likes · 8 min read
PaddleOCR: 2020’s Outstanding Open‑Source OCR Suite with a 3.5 MB Ultra‑Light Model