Tagged articles
289 articles
Page 3 of 3
Kuaishou Large Model
Kuaishou Large Model
Feb 25, 2021 · Artificial Intelligence

How Kuaishou’s AI‑Powered Beauty Engine Transforms Real‑Time Video

This article details Kuaishou Y‑tech’s Gorgeous beauty platform, covering traditional smoothing, advanced skin‑tone effects, AI‑driven blemish removal, clarity enhancement, local facial tuning, and the UNet‑based GorgeousGAN that delivers one‑click high‑definition beauty for live‑stream and short‑video applications.

AI beautyComputer VisionDeep Learning
0 likes · 13 min read
How Kuaishou’s AI‑Powered Beauty Engine Transforms Real‑Time Video
php Courses
php Courses
Feb 4, 2021 · Information Security

Analyzing and Decoding CAPTCHA Images Using PHP

This article explains how to extract RGB values from a CAPTCHA image with PHP, convert the pixel data into binary patterns, map those patterns to digits using a predefined dictionary, and achieve 100% recognition accuracy, illustrating a practical backend security technique.

BackendImage ProcessingOCR
0 likes · 4 min read
Analyzing and Decoding CAPTCHA Images Using PHP
Kuaishou Large Model
Kuaishou Large Model
Jan 28, 2021 · Artificial Intelligence

How Portrait Deformation Powers Modern Beauty Filters: Algorithms Explained

This article explores the core portrait deformation techniques behind today’s beauty and body‑shaping effects—covering affine transforms, Moving Least Squares, triangulation, liquify, offset, 3D mesh, and deep‑learning approaches—detailing their principles, implementations, and visual results in live‑streaming and short‑video apps.

AIComputer VisionImage Processing
0 likes · 13 min read
How Portrait Deformation Powers Modern Beauty Filters: Algorithms Explained
Programmer DD
Programmer DD
Dec 22, 2020 · Artificial Intelligence

How AI Can Unmask Pixelated Text: Inside the Depix Project

The article explains how the open‑source Depix tool uses AI to decode pixelated or censored text, outlines its algorithm based on De Bruijn sequences, provides usage instructions, and also highlights the related PULSE AI up‑sampling technique for restoring low‑resolution images.

AIDepixImage Processing
0 likes · 7 min read
How AI Can Unmask Pixelated Text: Inside the Depix Project
MaGe Linux Operations
MaGe Linux Operations
Dec 16, 2020 · Artificial Intelligence

How to Build an Automated Minesweeper Bot with Python and Win32 API

This guide walks through setting up a Python environment, installing required libraries, capturing the Minesweeper window, extracting tile colors, converting the board into a matrix, and implementing a basic algorithm that flags mines and clicks safe squares, all automated via Win32 API calls.

AutomationGame BotImage Processing
0 likes · 10 min read
How to Build an Automated Minesweeper Bot with Python and Win32 API
php Courses
php Courses
Nov 17, 2020 · Backend Development

Using Intervention Image Package in Laravel for Image Manipulation

This article introduces the Laravel-compatible Intervention Image package, explains how to install it via Composer, configure the service provider and alias, and demonstrates basic image operations such as resizing, saving, and resource destruction, including a known issue with overwriting files.

BackendComposerImage Processing
0 likes · 2 min read
Using Intervention Image Package in Laravel for Image Manipulation
Tencent Cloud Developer
Tencent Cloud Developer
Sep 14, 2020 · Mobile Development

Memory Optimization Techniques for Image-Intensive iOS Applications

To prevent crashes, forced‑out‑of‑memory terminations, and battery drain in image‑heavy iOS apps, developers should avoid retaining unused UIImages, use efficient scaling with UIGraphicsImageRenderer, employ autorelease pools, stream thumbnails via ImageIO, and downsample to view size, cutting peak memory usage from ~600 MB to ~221 MB.

Image ProcessingMemory OptimizationSwift
0 likes · 10 min read
Memory Optimization Techniques for Image-Intensive iOS Applications
Programmer DD
Programmer DD
Sep 4, 2020 · Artificial Intelligence

How to Build a Java Spring Boot License Plate Recognition System with OpenCV

An open-source Java Spring Boot project demonstrates license plate detection and recognition using OpenCV, supporting multiple plate colors, with SVM and ANN training, detailed architecture, feature list, installation guide, and visual processing steps, offering a beginner-friendly tutorial for image recognition enthusiasts.

Image ProcessingOpenCVSpring Boot
0 likes · 7 min read
How to Build a Java Spring Boot License Plate Recognition System with OpenCV
Programmer DD
Programmer DD
Aug 12, 2020 · Artificial Intelligence

Turn Photos and Videos into Cartoons with the Open‑Source Cartoonize AI

Cartoonize is an open‑source web application that leverages a white‑box GAN model to convert images and short videos into high‑quality cartoon style, offering easy Docker or virtualenv installation, detailed usage instructions, and insights into the underlying research paper.

AIDockerGAN
0 likes · 9 min read
Turn Photos and Videos into Cartoons with the Open‑Source Cartoonize AI
Java Captain
Java Captain
Aug 2, 2020 · Artificial Intelligence

Java Spring Boot License Plate Recognition and Training System (Open‑Source)

This article introduces an open‑source Java Spring Boot project that implements a license‑plate detection and recognition system with training capabilities, detailing its features, architecture, supported plate types, software requirements, processing steps, installation guide, and reference resources.

Deep LearningImage ProcessingJava
0 likes · 8 min read
Java Spring Boot License Plate Recognition and Training System (Open‑Source)
Top Architect
Top Architect
Aug 2, 2020 · Backend Development

Generating QR Codes in Java with ZXing and Maven

This tutorial demonstrates how to create a Maven Java project, add the ZXing library, implement utility classes for QR code generation and decoding, and run a sample application that produces a QR code image and reads its content back, complete with code examples and configuration details.

BackendImage ProcessingJava
0 likes · 9 min read
Generating QR Codes in Java with ZXing and Maven
Python Crawling & Data Mining
Python Crawling & Data Mining
Jul 5, 2020 · Artificial Intelligence

Unlock Python OpenCV: Load, Resize, Threshold, and Transform Images

This tutorial demonstrates how to use Python's OpenCV library to read and display images, perform resizing, convert to HSV, apply binary thresholding, execute erosion and dilation, and replace specific pixel colors, providing step-by-step code examples and visual results for each operation.

Image ProcessingOpenCVPixel Manipulation
0 likes · 8 min read
Unlock Python OpenCV: Load, Resize, Threshold, and Transform Images
Sohu Tech Products
Sohu Tech Products
Jun 10, 2020 · Mobile Development

Implementing Invisible LSB Watermark on iOS Images

This article explains how to embed and retrieve an invisible numeric watermark in iOS screenshots by modifying the least‑significant bit of the RGB pixel data, provides Swift code examples for encoding and decoding, discusses practical usage scenarios, and notes limitations and possible improvements.

Image ProcessingLSBSteganography
0 likes · 9 min read
Implementing Invisible LSB Watermark on iOS Images
Laravel Tech Community
Laravel Tech Community
Jun 4, 2020 · Backend Development

Using Intervention Image Package in Laravel: Installation, Configuration, and Basic Usage

This article introduces the Laravel-specific Intervention Image package, explains how to install it via Composer, configure the driver (GD or Imagick), demonstrates basic image manipulation such as resizing, watermarking, and saving, and highlights additional features like uploading, caching, filtering, and dynamic processing.

Image ProcessingIntervention ImageTutorial
0 likes · 4 min read
Using Intervention Image Package in Laravel: Installation, Configuration, and Basic Usage
iQIYI Technical Product Team
iQIYI Technical Product Team
Apr 3, 2020 · Artificial Intelligence

iCartoonFace Challenge: Cartoon Face Detection and Recognition Competition

The iCartoonFace Challenge invites participants to develop efficient algorithms for detecting and recognizing cartoon faces using large, meticulously annotated datasets—50,000 images for detection and nearly 390,000 for recognition—while meeting strict model size and latency limits and submitting detailed methods and code.

AI competitionCartoon Face RecognitionComputer Vision
0 likes · 6 min read
iCartoonFace Challenge: Cartoon Face Detection and Recognition Competition
21CTO
21CTO
Mar 22, 2020 · Fundamentals

How a Wuhan Student Used Python to Paint a Blooming Cherry Blossom from Pixels

A Wuhan University student turned the phrase “武汉加油” into thousands of colored pixels using Python, creating a time‑lapse cherry blossom animation that celebrates spring and supports the city, with step‑by‑step code for image generation, video assembly, and pixel‑level color handling.

Image ProcessingOpenCVPIL
0 likes · 6 min read
How a Wuhan Student Used Python to Paint a Blooming Cherry Blossom from Pixels
Taobao Frontend Technology
Taobao Frontend Technology
Dec 5, 2019 · Frontend Development

From UI Sketch to Code: Frontend Intelligence Generates 79% of Double‑11 Modules

This article explains how Alibaba's Front‑End Intelligent project automatically converts UI design images into production‑ready code, covering layout analysis, background and foreground processing, a fusion of traditional image algorithms with deep‑learning detection, GAN‑based complex‑background extraction, experimental results and real‑world deployment.

AutomationCode GenerationGAN
0 likes · 21 min read
From UI Sketch to Code: Frontend Intelligence Generates 79% of Double‑11 Modules
Alibaba Terminal Technology
Alibaba Terminal Technology
Dec 5, 2019 · Frontend Development

How Frontend Code Is Automatically Generated: Inside Alibaba’s AI‑Powered D2C Pipeline

This article explains Alibaba's front‑end intelligent project that automatically generated 79.34% of the Double‑11 UI code, detailing why images are used as input, the layered image‑processing pipeline, background and foreground analysis, traditional versus deep‑learning methods, fusion techniques, evaluation results, and real‑world deployments.

Code GenerationImage ProcessingLayout Analysis
0 likes · 20 min read
How Frontend Code Is Automatically Generated: Inside Alibaba’s AI‑Powered D2C Pipeline
Architecture Digest
Architecture Digest
Nov 8, 2019 · Backend Development

Designing the Architecture for a Photo ID Application: Identifying Complexity and Choosing Practical Solutions

Backend developer Xiao Ming designs an architecture for a photo‑ID app by identifying complexity in image storage and processing, prioritizing solutions such as local‑disk abstraction and Baidu AI services, and applying the principles of suitability, simplicity, and evolutionary design to produce a practical system.

Image Processingarchitecturestorage
0 likes · 6 min read
Designing the Architecture for a Photo ID Application: Identifying Complexity and Choosing Practical Solutions
Xianyu Technology
Xianyu Technology
Jul 9, 2019 · Artificial Intelligence

Complex Background Content Extraction Using Detection and GAN Networks

The proposed UI2CODE pipeline first recalls UI elements with an object detector, then uses gradient cues to separate simple from complex regions and applies an SRGAN to restore foreground details in challenging backgrounds, achieving higher precision, recall, and localization than GrabCut and Deeplab, though it demands extensive multi‑scale training data.

AIGANImage Processing
0 likes · 4 min read
Complex Background Content Extraction Using Detection and GAN Networks
Tencent Cloud Developer
Tencent Cloud Developer
Jun 5, 2019 · Artificial Intelligence

Tencent Cloud OCR Technology: Principles, Challenges, and Industry Applications

Tencent Cloud OCR leverages deep‑learning‑based text detection and recognition, including Compact Inception and multi‑layer RNN refinements, to overcome challenges such as complex backgrounds, low resolution, and multilingual layouts, delivering over 90% accuracy for ID cards, bank cards, business licenses, handwritten text, and powering fast, cost‑saving applications in logistics, QQ, and WeChat Work.

Deep LearningImage ProcessingOCR
0 likes · 7 min read
Tencent Cloud OCR Technology: Principles, Challenges, and Industry Applications
HomeTech
HomeTech
Apr 18, 2019 · Artificial Intelligence

An Overview of Image Processing Techniques and Common Tools for Beginners

This article provides a concise introduction to image processing, covering its hierarchical structure, fundamental techniques such as classification, detection, segmentation, geometric transformation, and the most widely used libraries and deep‑learning frameworks for newcomers.

Computer VisionImage ClassificationImage Processing
0 likes · 9 min read
An Overview of Image Processing Techniques and Common Tools for Beginners
MaGe Linux Operations
MaGe Linux Operations
Apr 9, 2019 · Artificial Intelligence

How to Build and Crack Image Captchas with Python and Tesserocr

This tutorial explains the types of captchas, demonstrates how to generate image captchas using the Claptcha library, outlines preprocessing steps such as grayscale conversion, binarization, and denoising, and shows how to recognize them with the Tesserocr OCR engine, including handling noise and interference lines.

CaptchaImage ProcessingOCR
0 likes · 7 min read
How to Build and Crack Image Captchas with Python and Tesserocr
Youzan Coder
Youzan Coder
Mar 29, 2019 · Artificial Intelligence

Image Binarization for Receipt Printing: Algorithms and Implementation

The article outlines a receipt‑printer image binarization pipeline that converts color logos and QR codes to black‑white bitmaps by first reviewing a C‑based OTSU implementation, then introducing a parallel trio of OTSU, average‑gray, and double‑peak algorithms whose outputs are compared via hash fingerprints, cached for reuse, and optimized for speed on mobile devices, with plans for adaptive thresholds and block‑wise QR processing.

C languageImage ProcessingOTSU algorithm
0 likes · 19 min read
Image Binarization for Receipt Printing: Algorithms and Implementation
DataFunTalk
DataFunTalk
Mar 15, 2019 · Artificial Intelligence

A Comprehensive Overview of Deep Learning Applications in Computer Vision

This article provides an extensive review of deep learning techniques applied to computer vision, covering the evolution of CNN architectures, image and video processing tasks, 2.5‑D and 3‑D reconstruction, object detection, segmentation, tracking, SLAM, and various practical applications such as AR, content retrieval, and autonomous driving.

CNNComputer VisionImage Processing
0 likes · 22 min read
A Comprehensive Overview of Deep Learning Applications in Computer Vision
Xianyu Technology
Xianyu Technology
Feb 27, 2019 · Artificial Intelligence

UI2CODE: Layout Analysis and Background/Foreground Extraction for UI Images

The UI2CODE system tackles UI layout analysis by first extracting backgrounds with Sobel, Laplacian and Canny edge detection plus a flood‑fill algorithm, then isolating foreground components through connected‑component analysis and a Faster R‑CNN classifier, and finally fusing both pipelines to achieve superior precision, recall and IoU on Xianyu app screenshots.

Computer VisionDeep LearningFaster R-CNN
0 likes · 16 min read
UI2CODE: Layout Analysis and Background/Foreground Extraction for UI Images
System Architect Go
System Architect Go
Feb 26, 2019 · Fundamentals

Master the Basics of Image Processing with OpenCV and NumPy

This article introduces core image processing concepts—pixel fundamentals, binary, grayscale, and RGB images, matrix representation—and demonstrates practical implementations of cropping, canvas creation, watermarking, translation, rotation, and scaling using Python's OpenCV and NumPy libraries, including algorithm choices for resizing.

Computer VisionImage ProcessingNumPy
0 likes · 5 min read
Master the Basics of Image Processing with OpenCV and NumPy
21CTO
21CTO
Feb 22, 2019 · Fundamentals

Why the Iconic “Lenna” Photo Became the Face of Image‑Processing Research

The article recounts how a 1960 Playboy portrait of Lena Söderberg was adopted by image‑processing researchers as a standard test image, explains the technical and cultural reasons for its lasting popularity, and follows her unexpected rise to fame within the scientific community.

BenchmarkComputer VisionImage Processing
0 likes · 7 min read
Why the Iconic “Lenna” Photo Became the Face of Image‑Processing Research
DataFunTalk
DataFunTalk
Jan 14, 2019 · Artificial Intelligence

Computer Vision Fundamentals, Traditional Methods, Deep Learning Advances, and Cloud AI Deployment

This article provides a comprehensive overview of computer vision, covering its basic concepts, traditional image processing techniques, modern deep‑learning approaches, real‑world AI application cases, and the cloud infrastructure needed to support large‑scale deployment, while also offering skill‑advancement guidance.

AI applicationsCloud AIComputer Vision
0 likes · 20 min read
Computer Vision Fundamentals, Traditional Methods, Deep Learning Advances, and Cloud AI Deployment
Baidu Intelligent Testing
Baidu Intelligent Testing
Jan 9, 2019 · Operations

Automated Visual Assertion for Search Template Rendering Using Logical Block Modeling

The article describes a visual‑based automation framework that extracts logical blocks and relative positions from template screenshots, builds a mathematical model, and uses an expert‑system approach to automatically detect layout anomalies across diverse search result templates, reducing manual testing effort.

AutomationImage ProcessingUI testing
0 likes · 11 min read
Automated Visual Assertion for Search Template Rendering Using Logical Block Modeling
MaGe Linux Operations
MaGe Linux Operations
Dec 17, 2018 · Artificial Intelligence

How to Build a KNN-Based CAPTCHA Solver with OpenCV in Python

This tutorial walks through using OpenCV and a K‑Nearest Neighbors model to preprocess, segment, manually label, train, and finally recognize distorted, noisy CAPTCHA images, achieving about 82% accuracy on a test set of one hundred samples.

CaptchaImage ProcessingOpenCV
0 likes · 12 min read
How to Build a KNN-Based CAPTCHA Solver with OpenCV in Python
37 Interactive Technology Team
37 Interactive Technology Team
Nov 27, 2018 · Artificial Intelligence

37 Xiao Luban: A Machine‑Learning Linear Regression System for Automatic Banner Generation

The article describes a PHP engineer who built a machine‑learning linear regression system called 37 Xiao Luban to automatically generate game banner images, cutting production time from hours to minutes, using polynomial regression on collected scaling data, achieving 80‑90% usability.

AIAutomationBanner Generation
0 likes · 7 min read
37 Xiao Luban: A Machine‑Learning Linear Regression System for Automatic Banner Generation
21CTO
21CTO
Nov 21, 2018 · Artificial Intelligence

What’s Driving the Rapid Evolution of Face Recognition Technology?

This comprehensive overview examines the fundamentals, historical milestones, key algorithms, major datasets, policy support, industry applications, and future trends of face recognition technology, highlighting its rapid growth within computer vision and artificial intelligence.

AIBiometricsComputer Vision
0 likes · 45 min read
What’s Driving the Rapid Evolution of Face Recognition Technology?
MaGe Linux Operations
MaGe Linux Operations
Nov 18, 2018 · Artificial Intelligence

How to Crack Image Captchas with Python: Generation, Pre‑processing, and OCR

This tutorial walks through the four main captcha types, focuses on image captchas, explains generation with the Claptcha library, details preprocessing steps such as grayscale conversion, binarization, denoising, and character segmentation, and demonstrates recognition using tesserocr, while showing the impact of noise and interference lines.

CaptchaImage ProcessingOCR
0 likes · 6 min read
How to Crack Image Captchas with Python: Generation, Pre‑processing, and OCR
Sohu Tech Products
Sohu Tech Products
Oct 24, 2018 · Artificial Intelligence

Intelligent News Image Formatter: AI‑Based Cropping and Selection System for News List Images

This article introduces the Intelligent News Formatter, an AI‑driven system that tackles news‑app list‑image problems by using face detection, object detection, deep‑learning based cropping, image quality filtering, and similarity removal to automatically produce aesthetically pleasing and information‑rich thumbnails.

AIFace DetectionImage Processing
0 likes · 14 min read
Intelligent News Image Formatter: AI‑Based Cropping and Selection System for News List Images
Youku Technology
Youku Technology
Oct 24, 2018 · Artificial Intelligence

Intelligent Video Content Production and Automated Editing Platform – Technical Overview

Alibaba Youku built Milan ZhiYun platform that uses AI video understanding to automatically extract highlights, generate covers, and reassemble fragments from long‑form videos, leveraging scene detection, beautification, bullet‑comment analysis, and subjective quality evaluation to dramatically improve editing efficiency and user engagement.

Image ProcessingVideo AIcontent automation
0 likes · 12 min read
Intelligent Video Content Production and Automated Editing Platform – Technical Overview
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 18, 2018 · Artificial Intelligence

AI-Powered Smart Document Processing for International Trade

This article outlines how Alibaba engineers apply AI, image processing, natural language processing, and knowledge‑graph techniques to automate and secure the handling of complex, image‑heavy trade documents, dramatically improving efficiency, reducing risk, and enabling scalable, low‑cost solutions for SMEs in international commerce.

AIDocument AutomationImage Processing
0 likes · 14 min read
AI-Powered Smart Document Processing for International Trade
JD Tech
JD Tech
Aug 29, 2018 · Operations

ImageMagick Command‑Line Guide: Installation, Basic Commands, Formats and Practical Examples

This tutorial explains how to install ImageMagick, use its core command‑line tools such as magick, convert, identify, mogrify and composite, format image resize commands, add watermarks, generate captchas, clone and append images, convert between GIF, PDF and other formats, and integrate the tool with Node.js scripts.

GraphicsImage ProcessingImageMagick
0 likes · 19 min read
ImageMagick Command‑Line Guide: Installation, Basic Commands, Formats and Practical Examples
Tencent Cloud Developer
Tencent Cloud Developer
Aug 13, 2018 · Artificial Intelligence

Computer Vision Technology: From Viral Social Media Apps to Enterprise AI Applications

The article surveys computer‑vision fundamentals and evolution—from early filters and feature extractors to modern deep‑learning models—illustrating how techniques like face detection, image matching, and caption generation powered viral social‑media trends and now underpin enterprise AI services on Tencent Cloud, while offering practical implementation and skill‑development guidance.

AI applicationsCNNImage Processing
0 likes · 18 min read
Computer Vision Technology: From Viral Social Media Apps to Enterprise AI Applications
Tencent Music Tech Team
Tencent Music Tech Team
Jul 27, 2018 · Mobile Development

Understanding Android Nine‑Patch Images: Creation, Usage, and Internals

The article explains Android’s Nine‑Patch format—a bitmap with a 1‑pixel black border defining stretchable and content areas—covers creation methods (Photoshop, Draw9patch, Android Studio), runtime parsing, practical use in network‑loaded chat bubbles, required conversion workflow, and the underlying PNG “npTc” chunk structure with code examples for manual drawable construction.

AndroidImage ProcessingMobile Development
0 likes · 20 min read
Understanding Android Nine‑Patch Images: Creation, Usage, and Internals
AntTech
AntTech
Jun 1, 2018 · Mobile Development

Optimizing QR Code Scanning: Boosting Recognition Rate, Cutting Latency, and Enhancing Robustness

This article details how Alipay's scanning technology team improved QR code recognition by refining aspect‑ratio tolerance, introducing new pattern detection modes, applying diagonal filtering, leveraging logistic‑regression classification, adjusting jump‑line intervals, and moving binarization to GPU, resulting in a 6.95‑point increase in recognition rate and significantly reduced processing time.

Algorithm OptimizationComputer VisionImage Processing
0 likes · 12 min read
Optimizing QR Code Scanning: Boosting Recognition Rate, Cutting Latency, and Enhancing Robustness
Architecture Digest
Architecture Digest
May 19, 2018 · Artificial Intelligence

Optical Flow: Principles, Evolution, and Applications in Computer Vision

This article explains the fundamentals of optical flow, traces its development from early variational methods to modern deep‑learning models like FlowNet, and discusses practical applications such as video object detection, semantic segmentation, and novel view synthesis, highlighting both technical challenges and future research directions.

Computer VisionDeep LearningFlowNet
0 likes · 14 min read
Optical Flow: Principles, Evolution, and Applications in Computer Vision
21CTO
21CTO
May 8, 2018 · Artificial Intelligence

How Optical Flow Powers 360° Product Views and Advanced Vision Applications

This article explores the evolution and principles of optical flow—from early Horn‑Schunck models and Lucas‑Kanade to modern deep‑learning approaches like FlowNet—detailing its role in JD’s 360° product imaging, video detection, segmentation, view synthesis, and future research challenges in computer vision.

Deep LearningImage Processingoptical flow
0 likes · 15 min read
How Optical Flow Powers 360° Product Views and Advanced Vision Applications
JD Tech
JD Tech
May 4, 2018 · Artificial Intelligence

Optical Flow: Principles, Methods, and Applications in Computer Vision

This article introduces the fundamentals and evolution of optical flow, covering classic algorithms such as Horn‑Schunck and Lucas‑Kanade, modern deep‑learning approaches like FlowNet, and their practical applications in video detection, semantic segmentation, and novel view synthesis.

CNNDeep LearningImage Processing
0 likes · 15 min read
Optical Flow: Principles, Methods, and Applications in Computer Vision
Ctrip Technology
Ctrip Technology
May 2, 2018 · Artificial Intelligence

Document OCR: From Computer Vision Fundamentals to Ctrip's Full-Text OCR Implementation

This article explains the evolution of optical character recognition, outlines the complete OCR processing pipeline—including image input, preprocessing, binarization, noise removal, tilt correction, layout analysis, character segmentation, recognition, and post‑processing—while showcasing Ctrip's real‑world OCR project, its architecture, accuracy metrics, and key computer‑vision techniques such as CNN, HSV, HOG, LBP, and Haar features.

CNNComputer VisionImage Processing
0 likes · 13 min read
Document OCR: From Computer Vision Fundamentals to Ctrip's Full-Text OCR Implementation
Java Captain
Java Captain
Jan 26, 2018 · Backend Development

Generating and Parsing QR Code Images in Java with ZXing

This article demonstrates how to use Google's ZXing Core library in a Java Maven project to generate QR code images, create random verification codes, and decode QR codes from files, providing complete code examples and step-by-step instructions.

Code GenerationImage ProcessingJava
0 likes · 12 min read
Generating and Parsing QR Code Images in Java with ZXing
AI Cyberspace
AI Cyberspace
Dec 30, 2017 · Artificial Intelligence

Revive Your 18‑Year‑Old Look with a 30‑Line Python OpenCV Beauty Filter

On the cusp of the 2017 year‑end, the author humorously marks the last 90‑post turning 18 while offering a Python‑OpenCV tutorial that, in just 30 lines of code, applies a beauty filter to make anyone look younger, complete with installation steps and sample output.

Beauty FilterComputer VisionImage Processing
0 likes · 5 min read
Revive Your 18‑Year‑Old Look with a 30‑Line Python OpenCV Beauty Filter
MaGe Linux Operations
MaGe Linux Operations
Nov 19, 2017 · Information Security

How to Crack Sliding Puzzle Captchas with Simple Python Image Analysis

This article explores the security weaknesses of sliding puzzle captchas by collecting multiple samples, performing qualitative and quantitative image analysis, and presenting a Python script that automatically identifies the puzzle gap using pixel differences and statistical thresholds.

CaptchaImage ProcessingPython
0 likes · 7 min read
How to Crack Sliding Puzzle Captchas with Simple Python Image Analysis
Aotu Lab
Aotu Lab
Nov 3, 2017 · Game Development

Build a One‑Stroke Puzzle Game with Auto‑Graph Recognition

This article explains how to implement a one‑stroke puzzle game using graph theory, describes the two‑step rendering process, provides JavaScript pseudocode for touch interaction, and details an automatic image‑recognition tool that extracts level data from hand‑drawn diagrams while optimizing performance.

CanvasImage ProcessingJavaScript
0 likes · 15 min read
Build a One‑Stroke Puzzle Game with Auto‑Graph Recognition
UCloud Tech
UCloud Tech
Aug 11, 2017 · Cloud Computing

How Serverless Architecture Powers AI Image Style Transfer with UCloud

This article explains the Serverless concept, compares it with PaaS and BaaS, and provides a step‑by‑step guide to building an AI‑driven Van Gogh style‑transfer service using UCloud's Serverless (UGC) platform, covering both backend setup and frontend interaction.

AIFaaSImage Processing
0 likes · 9 min read
How Serverless Architecture Powers AI Image Style Transfer with UCloud
WeChat Client Technology Team
WeChat Client Technology Team
Jun 20, 2017 · Artificial Intelligence

Boosting Mobile Video Calls: Real-Time Low-Light Enhancement Using Spatial‑Temporal Constraints

This article presents a real‑time low‑lighting video enhancement technique that combines spatial and temporal constraints to brighten mobile video calls without flicker, offering low computational cost and robust performance validated through subjective and objective experiments.

Image Processinglow-light videomobile-communication
0 likes · 11 min read
Boosting Mobile Video Calls: Real-Time Low-Light Enhancement Using Spatial‑Temporal Constraints
WeChat Backend Team
WeChat Backend Team
Jun 20, 2017 · Artificial Intelligence

How Constrained Spatial‑Temporal Modeling Boosts Low‑Light Video Calls in Real Time

Researchers present a real‑time low‑lighting video enhancement technique for mobile communication that combines spatial brightness and contrast constraints with temporal consistency, delivering brighter, flicker‑free video without over‑enhancement, while maintaining low computational cost suitable for smartphones.

Image Processinglow-light videomobile-communication
0 likes · 12 min read
How Constrained Spatial‑Temporal Modeling Boosts Low‑Light Video Calls in Real Time
MaGe Linux Operations
MaGe Linux Operations
May 8, 2017 · Game Development

How to Build a Python Bot to Automate a Simple Web Game

This tutorial explains how to use Python libraries such as autopy, PIL, and pywin32 to capture screen images, recognize game elements, and automate mouse movements and clicks, enabling a bot to automatically play a sushi‑making browser game.

Game BotImage ProcessingPython
0 likes · 10 min read
How to Build a Python Bot to Automate a Simple Web Game
21CTO
21CTO
Dec 24, 2016 · Backend Development

Generate Thumbnails with GD2 and ImageMagick in CodeIgniter

This guide explains why thumbnails are essential, outlines common use cases, and provides step‑by‑step CodeIgniter examples for creating 50×50 thumbnails using PHP's GD2 library and ImageMagick, including configuration, resizing, and error handling.

CodeIgniterGD2Image Processing
0 likes · 8 min read
Generate Thumbnails with GD2 and ImageMagick in CodeIgniter
Architecture Digest
Architecture Digest
Nov 25, 2016 · Artificial Intelligence

Simple Python Facial Recognition with OpenCV in Ten Lines of Code

This article demonstrates how to build a lightweight facial‑recognition tool using Python and OpenCV, explains the underlying concepts such as cascade classifiers and image preprocessing, and provides a fully commented ten‑line script together with installation instructions and usage examples.

Image ProcessingOpenCVfacial recognition
0 likes · 10 min read
Simple Python Facial Recognition with OpenCV in Ten Lines of Code
Qunar Tech Salon
Qunar Tech Salon
Aug 8, 2016 · Artificial Intelligence

OCR Technology Overview and Implementation Steps for Card Number Recognition

This article provides a comprehensive overview of OCR technology, explains its definition and application scenarios, and details a five‑step workflow—including target extraction, preprocessing, character localization, digit matching, and format validation—specifically illustrated with bank card number recognition.

Bank Card RecognitionComputer VisionImage Processing
0 likes · 9 min read
OCR Technology Overview and Implementation Steps for Card Number Recognition
Java High-Performance Architecture
Java High-Performance Architecture
Jul 2, 2016 · Cloud Computing

How Serverless FaaS Redefines Image Processing with AWS Lambda

This article explains the Function-as-a-Service (FaaS) model, illustrates how AWS Lambda can replace traditional server‑centric image upload workflows by handling scaling, storage, and database updates through event‑driven functions, and discusses the broader impact of serverless on cloud architecture.

AWS LambdaEvent-drivenFaaS
0 likes · 5 min read
How Serverless FaaS Redefines Image Processing with AWS Lambda
21CTO
21CTO
Jun 7, 2016 · Backend Development

How Twitter Scales to 3,000 Images per Second: Architecture Secrets Revealed

Twitter’s media platform, launched in 2016, processes up to 3,000 images per second by decoupling media from tweets, using chunked resumable uploads, progressive JPEGs, on‑demand variant generation, and efficient storage policies that saved millions of dollars, illustrating modern backend scaling techniques.

Backend EngineeringImage ProcessingMedia Platform
0 likes · 14 min read
How Twitter Scales to 3,000 Images per Second: Architecture Secrets Revealed
High Availability Architecture
High Availability Architecture
Mar 18, 2016 · Backend Development

High‑Availability Architecture of Upyun Image Processing Service

The article details Upyun's high‑availability image processing architecture, covering workload‑aware system design, custom GmServer implementation, task scheduling, current strengths and limitations, and future directions such as a ServiceServer‑based queue and Docker‑driven dynamic scaling.

Backend ArchitectureCloud ServicesDocker
0 likes · 15 min read
High‑Availability Architecture of Upyun Image Processing Service
Architecture Digest
Architecture Digest
Mar 14, 2016 · Mobile Development

Optimizing Mobile Taobao Main Venue Performance and Personalization with a Reusable Framework and Image‑Merging System

The article details how Alibaba's Mobile Taobao team redesigned the main promotional venue using a reusable native framework, dynamic Weex rendering, cloud‑driven configuration, pre‑heat data push, multi‑strategy routing, and a large‑scale image‑merging system to improve browsing speed, reduce bounce rates, and support flexible, personalized large‑scale sales events.

Image ProcessingPerformance Optimizationframework design
0 likes · 15 min read
Optimizing Mobile Taobao Main Venue Performance and Personalization with a Reusable Framework and Image‑Merging System
ITPUB
ITPUB
Jan 28, 2016 · Artificial Intelligence

Detect Your Oven’s On/Off State with Python and OpenCV

This tutorial shows how to use Python, OpenCV, and basic image‑processing techniques to automatically detect whether a kitchen oven is on by analyzing the red indicator light captured by a home camera, providing a simple safety alert system.

Computer VisionHome AutomationImage Processing
0 likes · 7 min read
Detect Your Oven’s On/Off State with Python and OpenCV
21CTO
21CTO
Aug 31, 2015 · Backend Development

Boost Image Delivery Speed with Bolt: A Fast Real-Time Image Resizing Server

Bolt is a high‑performance real‑time image cropping and compression server that processes each image only once, uses an LRU cache to limit memory, and offers flexible command‑line options for installation, usage, and scaling on CPU‑rich servers.

Image ProcessingLRUcaching
0 likes · 4 min read
Boost Image Delivery Speed with Bolt: A Fast Real-Time Image Resizing Server
Ctrip Technology
Ctrip Technology
Jun 29, 2015 · Artificial Intelligence

Bank Card Scanning and Recognition: Extending Support for Chinese Debit Cards

This article describes a project that enhances an open‑source card‑number scanning solution to recognize 19‑digit Chinese debit cards, addressing challenges such as black‑printed fonts, light‑colored embossed fonts, background filtering, single‑character OCR, and Luhn‑based checksum verification.

Bank Card RecognitionComputer VisionImage Processing
0 likes · 6 min read
Bank Card Scanning and Recognition: Extending Support for Chinese Debit Cards