Tagged articles
5 articles
Page 1 of 1
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 8, 2026 · Artificial Intelligence

Choosing the Best OCR Large Model: DeepSeek‑OCR‑2, HunyuanOCR, PaddleOCR‑VL‑1.5, and GLM‑OCR Compared

This article provides a detailed technical comparison of four OCR large models—DeepSeek‑OCR‑2, HunyuanOCR, PaddleOCR‑VL‑1.5, and GLM‑OCR—covering their architectures, parameter sizes, release dates, licensing, core features, strengths, weaknesses, benchmark scores, multilingual support, deployment requirements, and recommended use‑cases, helping readers select the most suitable model for their needs.

DeepSeek-OCR 2GLM-OCRHunyuanOCR
0 likes · 17 min read
Choosing the Best OCR Large Model: DeepSeek‑OCR‑2, HunyuanOCR, PaddleOCR‑VL‑1.5, and GLM‑OCR Compared
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 12, 2024 · Artificial Intelligence

AAAI‑2024 Highlights: Alibaba Cloud’s Deep Tabular Learning & Multi‑Modal Fusion

Alibaba Cloud’s AI platform PAI showcased four cutting‑edge papers at AAAI‑2024—introducing AMFormer for deep tabular learning via arithmetic feature interaction, MuLTI for efficient video‑language understanding, M2SD for few‑shot class‑incremental learning, and M2Doc for multi‑modal document layout analysis—demonstrating the platform’s growing impact on artificial‑intelligence research.

Deep LearningFew‑Shot LearningMultimodal AI
0 likes · 9 min read
AAAI‑2024 Highlights: Alibaba Cloud’s Deep Tabular Learning & Multi‑Modal Fusion
Laiye Technology Team
Laiye Technology Team
Sep 28, 2022 · Artificial Intelligence

Checkbox Detection and State Classification Using YOLOv5

This article describes a comprehensive solution for detecting checkboxes in document images and determining their selected or unselected status by combining YOLOv5 object detection, synthetic and semi‑synthetic data generation, specialized post‑processing, and association logic to handle varied shapes, positions, and markings.

YOLOv5checkbox detectiondata synthesis
0 likes · 13 min read
Checkbox Detection and State Classification Using YOLOv5
Alibaba Terminal Technology
Alibaba Terminal Technology
Sep 23, 2021 · Artificial Intelligence

Real‑Time Document Corner Detection on Mobile: Heatmap‑Based Keypoint Algorithms Explained

This article reviews the end‑to‑end pipeline for real‑time document corner detection on mobile devices, breaks down the keypoint detection workflow into image processing, encoding, network modeling and decoding, compares heatmap‑based and fully‑connected approaches, introduces a differentiable DSNT decoding method with unbiased coordinate transformations, and presents experimental results and conclusions on its effectiveness and limitations.

DSNTMobile AIdocument-analysis
0 likes · 15 min read
Real‑Time Document Corner Detection on Mobile: Heatmap‑Based Keypoint Algorithms Explained
Tencent Cloud Developer
Tencent Cloud Developer
Aug 10, 2018 · Artificial Intelligence

Overview of OCR Technology and Its Applications on Tencent Cloud

The talk outlines OCR’s evolution from early postal-code readers to modern deep‑learning models, explains Tencent Cloud’s fast, accurate services for printed and handwritten text—including table‑structured and general OCR—and showcases real‑world applications such as ID cards, business cards, license plates, checks, and medical documents while highlighting ongoing challenges and future enhancements.

Cloud ServicesDeep LearningOCR
0 likes · 19 min read
Overview of OCR Technology and Its Applications on Tencent Cloud