Tagged articles

Image processing

292 articles · Page 3 of 3

Mar 23, 2021 · Mobile Development

Building a Flutter Color Picker Tool for UI Debugging

This article explains how to create a Flutter-based color‑picker utility that captures the current screen, extracts pixel colors at any touch point, and displays a magnified view with the selected color, providing a practical solution for UI verification and design‑developer communication.

Color PickerFlutterImage processing

0 likes · 13 min read

Building a Flutter Color Picker Tool for UI Debugging

Huawei Cloud Developer Alliance

Mar 23, 2021 · Artificial Intelligence

How to Recognize Credit Card Numbers with OpenCV: A Step‑by‑Step Tutorial

This tutorial walks through a project‑based OpenCV workflow that reads a digit template, preprocesses both template and credit‑card images, extracts individual numbers, matches them against the template, and finally overlays the recognized digits onto the original image, illustrating core computer‑vision techniques.

Image processingOCRPython

0 likes · 10 min read

How to Recognize Credit Card Numbers with OpenCV: A Step‑by‑Step Tutorial

Python Programming Learning Circle

Mar 2, 2021 · Fundamentals

Creating a Panda‑Head Meme with Python, OpenCV and Pillow

This tutorial demonstrates how to import, resize, threshold, rotate, and blend foreground photos with a panda‑head background using OpenCV and Pillow in Python, then add English or Chinese text and save the resulting meme image.

Image processingMeme Generationopencv

0 likes · 10 min read

Creating a Panda‑Head Meme with Python, OpenCV and Pillow

Kuaishou Large Model

Feb 25, 2021 · Artificial Intelligence

How Kuaishou’s AI‑Powered Beauty Engine Transforms Real‑Time Video

This article details Kuaishou Y‑tech’s Gorgeous beauty platform, covering traditional smoothing, advanced skin‑tone effects, AI‑driven blemish removal, clarity enhancement, local facial tuning, and the UNet‑based GorgeousGAN that delivers one‑click high‑definition beauty for live‑stream and short‑video applications.

AI beautyImage processingcomputer vision

0 likes · 13 min read

How Kuaishou’s AI‑Powered Beauty Engine Transforms Real‑Time Video

php Courses

Feb 4, 2021 · Information Security

Analyzing and Decoding CAPTCHA Images Using PHP

This article explains how to extract RGB values from a CAPTCHA image with PHP, convert the pixel data into binary patterns, map those patterns to digits using a predefined dictionary, and achieve 100% recognition accuracy, illustrating a practical backend security technique.

Image processingOCRPHP

0 likes · 4 min read

Analyzing and Decoding CAPTCHA Images Using PHP

Kuaishou Large Model

Jan 28, 2021 · Artificial Intelligence

How Portrait Deformation Powers Modern Beauty Filters: Algorithms Explained

This article explores the core portrait deformation techniques behind today’s beauty and body‑shaping effects—covering affine transforms, Moving Least Squares, triangulation, liquify, offset, 3D mesh, and deep‑learning approaches—detailing their principles, implementations, and visual results in live‑streaming and short‑video apps.

AIImage processingbeauty filters

0 likes · 13 min read

How Portrait Deformation Powers Modern Beauty Filters: Algorithms Explained

Programmer DD

Dec 22, 2020 · Artificial Intelligence

How AI Can Unmask Pixelated Text: Inside the Depix Project

The article explains how the open‑source Depix tool uses AI to decode pixelated or censored text, outlines its algorithm based on De Bruijn sequences, provides usage instructions, and also highlights the related PULSE AI up‑sampling technique for restoring low‑resolution images.

AIDepixImage processing

0 likes · 7 min read

How AI Can Unmask Pixelated Text: Inside the Depix Project

MaGe Linux Operations

Dec 16, 2020 · Artificial Intelligence

How to Build an Automated Minesweeper Bot with Python and Win32 API

This guide walks through setting up a Python environment, installing required libraries, capturing the Minesweeper window, extracting tile colors, converting the board into a matrix, and implementing a basic algorithm that flags mines and clicks safe squares, all automated via Win32 API calls.

AutomationGame BotImage processing

0 likes · 10 min read

How to Build an Automated Minesweeper Bot with Python and Win32 API

php Courses

Nov 17, 2020 · Backend Development

Using Intervention Image Package in Laravel for Image Manipulation

This article introduces the Laravel-compatible Intervention Image package, explains how to install it via Composer, configure the service provider and alias, and demonstrates basic image operations such as resizing, saving, and resource destruction, including a known issue with overwriting files.

ComposerImage processingIntervention Image

0 likes · 2 min read

Using Intervention Image Package in Laravel for Image Manipulation

Tencent Cloud Developer

Sep 14, 2020 · Mobile Development

Memory Optimization Techniques for Image-Intensive iOS Applications

To prevent crashes, forced‑out‑of‑memory terminations, and battery drain in image‑heavy iOS apps, developers should avoid retaining unused UIImages, use efficient scaling with UIGraphicsImageRenderer, employ autorelease pools, stream thumbnails via ImageIO, and downsample to view size, cutting peak memory usage from ~600 MB to ~221 MB.

Image processingMemory optimizationSwift

0 likes · 10 min read

Memory Optimization Techniques for Image-Intensive iOS Applications

Baidu App Technology

Sep 7, 2020 · Artificial Intelligence

How Baidu App Achieves Real‑Time Mobile Super‑Resolution with Deep Learning

This article explains how Baidu App leverages a VDSR‑based deep‑learning model and a series of mobile‑side optimizations to deliver real‑time image and video super‑resolution on iOS and Android devices, detailing the technical challenges, performance gains, and integration steps.

AIAndroidImage processing

0 likes · 6 min read

How Baidu App Achieves Real‑Time Mobile Super‑Resolution with Deep Learning

Programmer DD

Sep 4, 2020 · Artificial Intelligence

How to Build a Java Spring Boot License Plate Recognition System with OpenCV

An open-source Java Spring Boot project demonstrates license plate detection and recognition using OpenCV, supporting multiple plate colors, with SVM and ANN training, detailed architecture, feature list, installation guide, and visual processing steps, offering a beginner-friendly tutorial for image recognition enthusiasts.

Image processingSpring Bootlicense-plate-recognition

0 likes · 7 min read

How to Build a Java Spring Boot License Plate Recognition System with OpenCV

MaGe Linux Operations

Aug 15, 2020 · Fundamentals

How to Overlay Custom Text on Images with Python Pillow and Fix Font Path Issues

This guide walks through common Pillow errors when adding text to images, shows how to correct font file paths, switch fonts and colors, and provides working Python code snippets for rendering both English and Chinese characters on pictures.

Image processingText Overlayfonts

0 likes · 4 min read

How to Overlay Custom Text on Images with Python Pillow and Fix Font Path Issues

Programmer DD

Aug 12, 2020 · Artificial Intelligence

Turn Photos and Videos into Cartoons with the Open‑Source Cartoonize AI

Cartoonize is an open‑source web application that leverages a white‑box GAN model to convert images and short videos into high‑quality cartoon style, offering easy Docker or virtualenv installation, detailed usage instructions, and insights into the underlying research paper.

AIDockerGaN

0 likes · 9 min read

Turn Photos and Videos into Cartoons with the Open‑Source Cartoonize AI

Java Captain

Aug 2, 2020 · Artificial Intelligence

Java Spring Boot License Plate Recognition and Training System (Open‑Source)

This article introduces an open‑source Java Spring Boot project that implements a license‑plate detection and recognition system with training capabilities, detailing its features, architecture, supported plate types, software requirements, processing steps, installation guide, and reference resources.

Image processingJavaSpring Boot

0 likes · 8 min read

Java Spring Boot License Plate Recognition and Training System (Open‑Source)

Top Architect

Aug 2, 2020 · Backend Development

Generating QR Codes in Java with ZXing and Maven

This tutorial demonstrates how to create a Maven Java project, add the ZXing library, implement utility classes for QR code generation and decoding, and run a sample application that produces a QR code image and reads its content back, complete with code examples and configuration details.

Image processingJavaQR code

0 likes · 9 min read

Generating QR Codes in Java with ZXing and Maven

Python Crawling & Data Mining

Jul 21, 2020 · Artificial Intelligence

Turn Your Photos into Anime with Baidu AI: A Step‑by‑Step Python Guide

This tutorial explains how to use Baidu AI's Portrait Anime API with Python, covering token acquisition, API key setup, request formation, and code examples for generating both plain anime portraits and masked versions, complete with sample images and usage limits.

APIAccess TokenAnime Transformation

0 likes · 8 min read

Turn Your Photos into Anime with Baidu AI: A Step‑by‑Step Python Guide

Python Crawling & Data Mining

Jul 13, 2020 · Backend Development

Turn Photos into Cartoons with Python, OpenCV, and Flask

This tutorial walks through building a Python program that uses OpenCV to reduce colors and enhance edges for cartoon‑style conversion, then wraps the process in a simple Flask web app for uploading images and displaying the resulting cartoons.

CartoonifyFlaskImage processing

0 likes · 6 min read

Turn Photos into Cartoons with Python, OpenCV, and Flask

Python Crawling & Data Mining

Jul 5, 2020 · Artificial Intelligence

Unlock Python OpenCV: Load, Resize, Threshold, and Transform Images

This tutorial demonstrates how to use Python's OpenCV library to read and display images, perform resizing, convert to HSV, apply binary thresholding, execute erosion and dilation, and replace specific pixel colors, providing step-by-step code examples and visual results for each operation.

Image processingPixel ManipulationPython

0 likes · 8 min read

Unlock Python OpenCV: Load, Resize, Threshold, and Transform Images

Python Programming Learning Circle

Jun 15, 2020 · Artificial Intelligence

One-Click Background Replacement for ID Photos Using Python OpenCV

This tutorial demonstrates how to use Python, OpenCV, and NumPy on Windows to import an image, resize it, convert it to HSV, apply color-based masking, perform erosion and dilation, replace the background color, and finally save the processed photo with a single script.

Background ReplacementImage processingNumPy

0 likes · 4 min read

One-Click Background Replacement for ID Photos Using Python OpenCV

Sohu Tech Products

Jun 10, 2020 · Mobile Development

Implementing Invisible LSB Watermark on iOS Images

This article explains how to embed and retrieve an invisible numeric watermark in iOS screenshots by modifying the least‑significant bit of the RGB pixel data, provides Swift code examples for encoding and decoding, discusses practical usage scenarios, and notes limitations and possible improvements.

Image processingLSBiOS

0 likes · 9 min read

Implementing Invisible LSB Watermark on iOS Images

Python Programming Learning Circle

Jun 6, 2020 · Fundamentals

How to Reverse a GIF Using Python's Pillow (PIL) Library

This tutorial demonstrates how to use Python's Pillow library to decompose a GIF into individual frames, reverse their order, and reassemble them into a new reversed‑play GIF, providing step‑by‑step code and explanations for each stage.

GIFImage processingpillow

0 likes · 3 min read

How to Reverse a GIF Using Python's Pillow (PIL) Library

Laravel Tech Community

Jun 4, 2020 · Backend Development

Using Intervention Image Package in Laravel: Installation, Configuration, and Basic Usage

This article introduces the Laravel-specific Intervention Image package, explains how to install it via Composer, configure the driver (GD or Imagick), demonstrates basic image manipulation such as resizing, watermarking, and saving, and highlights additional features like uploading, caching, filtering, and dynamic processing.

Image processingIntervention Imagetutorial

0 likes · 4 min read

Using Intervention Image Package in Laravel: Installation, Configuration, and Basic Usage

Huawei Cloud Developer Alliance

May 14, 2020 · Fundamentals

Create a Personalized ‘I Love You’ Photo Mosaic with Python

This tutorial shows how to turn a loved one's photo into a heart‑shaped mosaic made of the phrase “I love you” using Python, OpenCV, and PIL by extracting pixel colors and drawing colored text onto a new image.

Image processingPhoto MosaicPython

0 likes · 6 min read

Create a Personalized ‘I Love You’ Photo Mosaic with Python

iQIYI Technical Product Team

Apr 3, 2020 · Artificial Intelligence

iCartoonFace Challenge: Cartoon Face Detection and Recognition Competition

The iCartoonFace Challenge invites participants to develop efficient algorithms for detecting and recognizing cartoon faces using large, meticulously annotated datasets—50,000 images for detection and nearly 390,000 for recognition—while meeting strict model size and latency limits and submitting detailed methods and code.

AI competitionCartoon Face RecognitionData Set

0 likes · 6 min read

iCartoonFace Challenge: Cartoon Face Detection and Recognition Competition

21CTO

Mar 22, 2020 · Fundamentals

How a Wuhan Student Used Python to Paint a Blooming Cherry Blossom from Pixels

A Wuhan University student turned the phrase “武汉加油” into thousands of colored pixels using Python, creating a time‑lapse cherry blossom animation that celebrates spring and supports the city, with step‑by‑step code for image generation, video assembly, and pixel‑level color handling.

Image processingPILopencv

0 likes · 6 min read

How a Wuhan Student Used Python to Paint a Blooming Cherry Blossom from Pixels

Python Crawling & Data Mining

Jan 30, 2020 · Fundamentals

Create Unique Chinese New Year Phone Wallpapers with Python and PIL

This tutorial shows how to use Python's Pillow library to add custom text, rotate images, and combine transparent layers to generate personalized Chinese New Year phone wallpapers, complete with step‑by‑step code examples and sample results.

Image processingPythonWallpaper

0 likes · 6 min read

Create Unique Chinese New Year Phone Wallpapers with Python and PIL

Python Programming Learning Circle

Jan 10, 2020 · Artificial Intelligence

How to Correct Skewed Text in Images Using OpenCV: A Step‑by‑Step Guide

This tutorial explains how to detect, calculate, and correct the rotation angle of text in an image using OpenCV, covering image binarization, minimum‑area bounding box extraction, angle adjustment, and affine transformation with clear Python code examples.

Image processingPythoncomputer vision

0 likes · 3 min read

How to Correct Skewed Text in Images Using OpenCV: A Step‑by‑Step Guide

Python Programming Learning Circle

Jan 8, 2020 · Artificial Intelligence

How Dilated Convolutions Preserve Image Size While Expanding Receptive Field

This article explains the concept, mathematics, and practical PyTorch implementation of dilated (or atrous) convolutions, showing how to keep image dimensions unchanged while dramatically increasing the receptive field and discussing their advantages and typical applications.

Image processingPyTorchdeep learning

0 likes · 4 min read

How Dilated Convolutions Preserve Image Size While Expanding Receptive Field

Python Programming Learning Circle

Dec 10, 2019 · Artificial Intelligence

How to Swap Faces with Python: Step‑by‑Step Guide Using dlib and OpenCV

This tutorial demonstrates how to perform face swapping on static images using Python, dlib, and OpenCV, guiding readers through landmark detection, Procrustes alignment, color correction, and seamless blending to achieve realistic results.

Image processingProcrustes analysisPython

0 likes · 7 min read

How to Swap Faces with Python: Step‑by‑Step Guide Using dlib and OpenCV

360 Quality & Efficiency

Dec 6, 2019 · Artificial Intelligence

Technical Research on Anime4K: Real‑Time Super‑Resolution Algorithm for Anime Images

Anime4K is a GPU‑accelerated super‑resolution algorithm tailored for animated images that achieves single‑digit millisecond latency, and the article explains its underlying residual‑based principle, practical results, and performance limitations on both GPU and CPU platforms.

Anime4KGPUImage processing

0 likes · 3 min read

Technical Research on Anime4K: Real‑Time Super‑Resolution Algorithm for Anime Images

360 Quality & Efficiency

Dec 6, 2019 · Artificial Intelligence

Technical Research on Anime4K: Real-Time Super-Resolution Algorithm for Animated Images

Anime4K is a GPU-accelerated super‑resolution algorithm designed for animated images, achieving sub‑10 ms latency by enhancing low‑resolution frames with edge‑sharpened residuals, and the article details its principles, visual results, and real‑time performance limitations on typical CPUs.

Anime4KGPUImage processing

0 likes · 2 min read

Technical Research on Anime4K: Real-Time Super-Resolution Algorithm for Animated Images

Taobao Frontend Technology

Dec 5, 2019 · Frontend Development

From UI Sketch to Code: Frontend Intelligence Generates 79% of Double‑11 Modules

This article explains how Alibaba's Front‑End Intelligent project automatically converts UI design images into production‑ready code, covering layout analysis, background and foreground processing, a fusion of traditional image algorithms with deep‑learning detection, GAN‑based complex‑background extraction, experimental results and real‑world deployment.

AutomationGaNImage processing

0 likes · 21 min read

From UI Sketch to Code: Frontend Intelligence Generates 79% of Double‑11 Modules

Alibaba Terminal Technology

Dec 5, 2019 · Frontend Development

How Frontend Code Is Automatically Generated: Inside Alibaba’s AI‑Powered D2C Pipeline

This article explains Alibaba's front‑end intelligent project that automatically generated 79.34% of the Double‑11 UI code, detailing why images are used as input, the layered image‑processing pipeline, background and foreground analysis, traditional versus deep‑learning methods, fusion techniques, evaluation results, and real‑world deployments.

Image processingLayout Analysiscode generation

0 likes · 20 min read

How Frontend Code Is Automatically Generated: Inside Alibaba’s AI‑Powered D2C Pipeline

Architecture Digest

Nov 8, 2019 · Backend Development

Designing the Architecture for a Photo ID Application: Identifying Complexity and Choosing Practical Solutions

Backend developer Xiao Ming designs an architecture for a photo‑ID app by identifying complexity in image storage and processing, prioritizing solutions such as local‑disk abstraction and Baidu AI services, and applying the principles of suitability, simplicity, and evolutionary design to produce a practical system.

Image processingarchitecturestorage

0 likes · 6 min read

Designing the Architecture for a Photo ID Application: Identifying Complexity and Choosing Practical Solutions

Architects' Tech Alliance

Jul 20, 2019 · Artificial Intelligence

FaceApp: AI-Powered Age Transformation, Privacy Concerns, and Technical Insights

The article examines the rapid rise of FaceApp, an AI-driven photo‑editing tool that can age or rejuvenate faces, discusses its technical foundations such as autoencoders, and analyzes the privacy and security concerns raised by governments and users worldwide.

AIAutoencoderFaceApp

0 likes · 7 min read

FaceApp: AI-Powered Age Transformation, Privacy Concerns, and Technical Insights

Xianyu Technology

Jul 9, 2019 · Artificial Intelligence

Complex Background Content Extraction Using Detection and GAN Networks

The proposed UI2CODE pipeline first recalls UI elements with an object detector, then uses gradient cues to separate simple from complex regions and applies an SRGAN to restore foreground details in challenging backgrounds, achieving higher precision, recall, and localization than GrabCut and Deeplab, though it demands extensive multi‑scale training data.

AIGaNImage processing

0 likes · 4 min read

Complex Background Content Extraction Using Detection and GAN Networks

iQIYI Technical Product Team

Jun 14, 2019 · Artificial Intelligence

How iQIYI’s ZoomAI Revives Classic Films with AI‑Powered Video Enhancement

iQIYI’s ZoomAI combines super‑resolution, denoising, sharpening and color‑enhancement AI models to restore low‑quality historic Chinese films, achieving up to 500‑fold faster processing than manual methods while preserving original visual characteristics.

AI video enhancementImage processingZoomAI

0 likes · 9 min read

How iQIYI’s ZoomAI Revives Classic Films with AI‑Powered Video Enhancement

Tencent Cloud Developer

Jun 5, 2019 · Artificial Intelligence

Tencent Cloud OCR Technology: Principles, Challenges, and Industry Applications

Tencent Cloud OCR leverages deep‑learning‑based text detection and recognition, including Compact Inception and multi‑layer RNN refinements, to overcome challenges such as complex backgrounds, low resolution, and multilingual layouts, delivering over 90% accuracy for ID cards, bank cards, business licenses, handwritten text, and powering fast, cost‑saving applications in logistics, QQ, and WeChat Work.

Image processingOCROptical Character Recognition

0 likes · 7 min read

Tencent Cloud OCR Technology: Principles, Challenges, and Industry Applications

MaGe Linux Operations

May 20, 2019 · Fundamentals

Turn a Long Image into a GIF with Python’s PIL: Step‑by‑Step Guide

Learn how to split a lengthy image into equal‑height segments using Python’s PIL library and then recombine those pieces into an animated GIF, complete with code examples, command‑line usage, and tips for handling watermarks and unwanted overlays.

GIFImage processingPIL

0 likes · 7 min read

Turn a Long Image into a GIF with Python’s PIL: Step‑by‑Step Guide

HomeTech

Apr 18, 2019 · Artificial Intelligence

An Overview of Image Processing Techniques and Common Tools for Beginners

This article provides a concise introduction to image processing, covering its hierarchical structure, fundamental techniques such as classification, detection, segmentation, geometric transformation, and the most widely used libraries and deep‑learning frameworks for newcomers.

Image processingcomputer visionimage classification

0 likes · 9 min read

An Overview of Image Processing Techniques and Common Tools for Beginners

MaGe Linux Operations

Apr 9, 2019 · Artificial Intelligence

How to Build and Crack Image Captchas with Python and Tesserocr

This tutorial explains the types of captchas, demonstrates how to generate image captchas using the Claptcha library, outlines preprocessing steps such as grayscale conversion, binarization, and denoising, and shows how to recognize them with the Tesserocr OCR engine, including handling noise and interference lines.

Image processingOCRPython

0 likes · 7 min read

How to Build and Crack Image Captchas with Python and Tesserocr

Youzan Coder

Mar 29, 2019 · Artificial Intelligence

Image Binarization for Receipt Printing: Algorithms and Implementation

The article outlines a receipt‑printer image binarization pipeline that converts color logos and QR codes to black‑white bitmaps by first reviewing a C‑based OTSU implementation, then introducing a parallel trio of OTSU, average‑gray, and double‑peak algorithms whose outputs are compared via hash fingerprints, cached for reuse, and optimized for speed on mobile devices, with plans for adaptive thresholds and block‑wise QR processing.

C languageImage processingOTSU algorithm

0 likes · 19 min read

Image Binarization for Receipt Printing: Algorithms and Implementation

DataFunTalk

Mar 15, 2019 · Artificial Intelligence

A Comprehensive Overview of Deep Learning Applications in Computer Vision

This article provides an extensive review of deep learning techniques applied to computer vision, covering the evolution of CNN architectures, image and video processing tasks, 2.5‑D and 3‑D reconstruction, object detection, segmentation, tracking, SLAM, and various practical applications such as AR, content retrieval, and autonomous driving.

CNNImage processingSLAM

0 likes · 22 min read

A Comprehensive Overview of Deep Learning Applications in Computer Vision

Xianyu Technology

Feb 27, 2019 · Artificial Intelligence

UI2CODE: Layout Analysis and Background/Foreground Extraction for UI Images

The UI2CODE system tackles UI layout analysis by first extracting backgrounds with Sobel, Laplacian and Canny edge detection plus a flood‑fill algorithm, then isolating foreground components through connected‑component analysis and a Faster R‑CNN classifier, and finally fusing both pipelines to achieve superior precision, recall and IoU on Xianyu app screenshots.

Faster R-CNNImage processingLayout Analysis

0 likes · 16 min read

UI2CODE: Layout Analysis and Background/Foreground Extraction for UI Images

System Architect Go

Feb 26, 2019 · Fundamentals

Master the Basics of Image Processing with OpenCV and NumPy

This article introduces core image processing concepts—pixel fundamentals, binary, grayscale, and RGB images, matrix representation—and demonstrates practical implementations of cropping, canvas creation, watermarking, translation, rotation, and scaling using Python's OpenCV and NumPy libraries, including algorithm choices for resizing.

Image processingNumPyPython

0 likes · 5 min read

Master the Basics of Image Processing with OpenCV and NumPy

21CTO

Feb 22, 2019 · Fundamentals

Why the Iconic “Lenna” Photo Became the Face of Image‑Processing Research

The article recounts how a 1960 Playboy portrait of Lena Söderberg was adopted by image‑processing researchers as a standard test image, explains the technical and cultural reasons for its lasting popularity, and follows her unexpected rise to fame within the scientific community.

Image processingLennabenchmark

0 likes · 7 min read

Why the Iconic “Lenna” Photo Became the Face of Image‑Processing Research

DataFunTalk

Jan 14, 2019 · Artificial Intelligence

Computer Vision Fundamentals, Traditional Methods, Deep Learning Advances, and Cloud AI Deployment

This article provides a comprehensive overview of computer vision, covering its basic concepts, traditional image processing techniques, modern deep‑learning approaches, real‑world AI application cases, and the cloud infrastructure needed to support large‑scale deployment, while also offering skill‑advancement guidance.

AI ApplicationsImage processingcloud AI

0 likes · 20 min read

Computer Vision Fundamentals, Traditional Methods, Deep Learning Advances, and Cloud AI Deployment

Baidu Intelligent Testing

Jan 9, 2019 · Operations

Automated Visual Assertion for Search Template Rendering Using Logical Block Modeling

The article describes a visual‑based automation framework that extracts logical blocks and relative positions from template screenshots, builds a mathematical model, and uses an expert‑system approach to automatically detect layout anomalies across diverse search result templates, reducing manual testing effort.

AutomationImage processingUI testing

0 likes · 11 min read

Automated Visual Assertion for Search Template Rendering Using Logical Block Modeling

MaGe Linux Operations

Dec 17, 2018 · Artificial Intelligence

How to Build a KNN-Based CAPTCHA Solver with OpenCV in Python

This tutorial walks through using OpenCV and a K‑Nearest Neighbors model to preprocess, segment, manually label, train, and finally recognize distorted, noisy CAPTCHA images, achieving about 82% accuracy on a test set of one hundred samples.

Image processingPythoncaptcha

0 likes · 12 min read

How to Build a KNN-Based CAPTCHA Solver with OpenCV in Python

37 Interactive Technology Team

Nov 27, 2018 · Artificial Intelligence

37 Xiao Luban: A Machine‑Learning Linear Regression System for Automatic Banner Generation

The article describes a PHP engineer who built a machine‑learning linear regression system called 37 Xiao Luban to automatically generate game banner images, cutting production time from hours to minutes, using polynomial regression on collected scaling data, achieving 80‑90% usability.

AIAutomationBanner Generation

0 likes · 7 min read

37 Xiao Luban: A Machine‑Learning Linear Regression System for Automatic Banner Generation

21CTO

Nov 21, 2018 · Artificial Intelligence

What’s Driving the Rapid Evolution of Face Recognition Technology?

This comprehensive overview examines the fundamentals, historical milestones, key algorithms, major datasets, policy support, industry applications, and future trends of face recognition technology, highlighting its rapid growth within computer vision and artificial intelligence.

AIBiometricsImage processing

0 likes · 45 min read

What’s Driving the Rapid Evolution of Face Recognition Technology?

MaGe Linux Operations

Nov 18, 2018 · Artificial Intelligence

How to Crack Image Captchas with Python: Generation, Pre‑processing, and OCR

This tutorial walks through the four main captcha types, focuses on image captchas, explains generation with the Claptcha library, details preprocessing steps such as grayscale conversion, binarization, denoising, and character segmentation, and demonstrates recognition using tesserocr, while showing the impact of noise and interference lines.

Image processingOCRcaptcha

0 likes · 6 min read

How to Crack Image Captchas with Python: Generation, Pre‑processing, and OCR

Sohu Tech Products

Oct 24, 2018 · Artificial Intelligence

Intelligent News Image Formatter: AI‑Based Cropping and Selection System for News List Images

This article introduces the Intelligent News Formatter, an AI‑driven system that tackles news‑app list‑image problems by using face detection, object detection, deep‑learning based cropping, image quality filtering, and similarity removal to automatically produce aesthetically pleasing and information‑rich thumbnails.

AIImage processingface detection

0 likes · 14 min read

Intelligent News Image Formatter: AI‑Based Cropping and Selection System for News List Images

Youku Technology

Oct 24, 2018 · Artificial Intelligence

Intelligent Video Content Production and Automated Editing Platform – Technical Overview

Alibaba Youku built Milan ZhiYun platform that uses AI video understanding to automatically extract highlights, generate covers, and reassemble fragments from long‑form videos, leveraging scene detection, beautification, bullet‑comment analysis, and subjective quality evaluation to dramatically improve editing efficiency and user engagement.

Content AutomationImage processingVideo AI

0 likes · 12 min read

Intelligent Video Content Production and Automated Editing Platform – Technical Overview

Alibaba Cloud Developer

Oct 18, 2018 · Artificial Intelligence

AI-Powered Smart Document Processing for International Trade

This article outlines how Alibaba engineers apply AI, image processing, natural language processing, and knowledge‑graph techniques to automate and secure the handling of complex, image‑heavy trade documents, dramatically improving efficiency, reducing risk, and enabling scalable, low‑cost solutions for SMEs in international commerce.

AIDocument AutomationImage processing

0 likes · 14 min read

AI-Powered Smart Document Processing for International Trade

Python Programming Learning Circle

Aug 30, 2018 · Backend Development

How to Generate Circular Avatars in PHP Using Layered Image Design

This tutorial explains how to create circular user avatars in PHP by stacking three image layers—a base layer, the uploaded user photo, and a transparent circular mask—detailing the design concept and showing the essential avatar.class.php implementation.

Image processingLayered DesignPHP

0 likes · 2 min read

How to Generate Circular Avatars in PHP Using Layered Image Design

JD Tech

Aug 29, 2018 · Operations

ImageMagick Command‑Line Guide: Installation, Basic Commands, Formats and Practical Examples

This tutorial explains how to install ImageMagick, use its core command‑line tools such as magick, convert, identify, mogrify and composite, format image resize commands, add watermarks, generate captchas, clone and append images, convert between GIF, PDF and other formats, and integrate the tool with Node.js scripts.

GraphicsImage processingImageMagick

0 likes · 19 min read

ImageMagick Command‑Line Guide: Installation, Basic Commands, Formats and Practical Examples

MaGe Linux Operations

Aug 22, 2018 · Fundamentals

Automate Meme Creation with Python: A Step‑by‑Step Image Processing Guide

This tutorial shows how to use Python to automate the creation of meme‑style emoticon images by preparing assets, overlaying pictures and text, and handling font sizing, providing complete code examples and practical tips for batch production.

AutomationGraphicsImage processing

0 likes · 6 min read

Automate Meme Creation with Python: A Step‑by‑Step Image Processing Guide

Tencent Cloud Developer

Aug 13, 2018 · Artificial Intelligence

Computer Vision Technology: From Viral Social Media Apps to Enterprise AI Applications

The article surveys computer‑vision fundamentals and evolution—from early filters and feature extractors to modern deep‑learning models—illustrating how techniques like face detection, image matching, and caption generation powered viral social‑media trends and now underpin enterprise AI services on Tencent Cloud, while offering practical implementation and skill‑development guidance.

AI ApplicationsCNNImage processing

0 likes · 18 min read

Computer Vision Technology: From Viral Social Media Apps to Enterprise AI Applications

Tencent Music Tech Team

Jul 27, 2018 · Mobile Development

Understanding Android Nine‑Patch Images: Creation, Usage, and Internals

The article explains Android’s Nine‑Patch format—a bitmap with a 1‑pixel black border defining stretchable and content areas—covers creation methods (Photoshop, Draw9patch, Android Studio), runtime parsing, practical use in network‑loaded chat bubbles, required conversion workflow, and the underlying PNG “npTc” chunk structure with code examples for manual drawable construction.

AndroidImage processingMobile Development

0 likes · 20 min read

Understanding Android Nine‑Patch Images: Creation, Usage, and Internals

AntTech

Jun 1, 2018 · Mobile Development

Optimizing QR Code Scanning: Boosting Recognition Rate, Cutting Latency, and Enhancing Robustness

This article details how Alipay's scanning technology team improved QR code recognition by refining aspect‑ratio tolerance, introducing new pattern detection modes, applying diagonal filtering, leveraging logistic‑regression classification, adjusting jump‑line intervals, and moving binarization to GPU, resulting in a 6.95‑point increase in recognition rate and significantly reduced processing time.

Image processingQR codealgorithm optimization

0 likes · 12 min read

Optimizing QR Code Scanning: Boosting Recognition Rate, Cutting Latency, and Enhancing Robustness

Test Development Learning Exchange

May 31, 2018 · Artificial Intelligence

Decode QR Codes in Python with Zxing: A Complete Step-by-Step Tutorial

This guide walks through setting up Python’s Zxing library to decode QR code images, covering installation, handling 64‑bit compatibility issues, a complete script that creates temporary files, reads barcodes, logs results, and cleans up, enabling quick integration into your projects.

Image processingPythonQR code

0 likes · 4 min read

Decode QR Codes in Python with Zxing: A Complete Step-by-Step Tutorial

Architecture Digest

May 19, 2018 · Artificial Intelligence

Optical Flow: Principles, Evolution, and Applications in Computer Vision

This article explains the fundamentals of optical flow, traces its development from early variational methods to modern deep‑learning models like FlowNet, and discusses practical applications such as video object detection, semantic segmentation, and novel view synthesis, highlighting both technical challenges and future research directions.

FlowNetImage processingLucas-Kanade

0 likes · 14 min read

Optical Flow: Principles, Evolution, and Applications in Computer Vision

21CTO

May 8, 2018 · Artificial Intelligence

How Optical Flow Powers 360° Product Views and Advanced Vision Applications

This article explores the evolution and principles of optical flow—from early Horn‑Schunck models and Lucas‑Kanade to modern deep‑learning approaches like FlowNet—detailing its role in JD’s 360° product imaging, video detection, segmentation, view synthesis, and future research challenges in computer vision.

Image processingdeep learningoptical flow

0 likes · 15 min read

How Optical Flow Powers 360° Product Views and Advanced Vision Applications

JD Tech

May 4, 2018 · Artificial Intelligence

Optical Flow: Principles, Methods, and Applications in Computer Vision

This article introduces the fundamentals and evolution of optical flow, covering classic algorithms such as Horn‑Schunck and Lucas‑Kanade, modern deep‑learning approaches like FlowNet, and their practical applications in video detection, semantic segmentation, and novel view synthesis.

CNNImage processingdeep learning

0 likes · 15 min read

Optical Flow: Principles, Methods, and Applications in Computer Vision

Ctrip Technology

May 2, 2018 · Artificial Intelligence

Document OCR: From Computer Vision Fundamentals to Ctrip's Full-Text OCR Implementation

This article explains the evolution of optical character recognition, outlines the complete OCR processing pipeline—including image input, preprocessing, binarization, noise removal, tilt correction, layout analysis, character segmentation, recognition, and post‑processing—while showcasing Ctrip's real‑world OCR project, its architecture, accuracy metrics, and key computer‑vision techniques such as CNN, HSV, HOG, LBP, and Haar features.

CNNImage processingOCR

0 likes · 13 min read

Document OCR: From Computer Vision Fundamentals to Ctrip's Full-Text OCR Implementation

MaGe Linux Operations

Apr 13, 2018 · Fundamentals

Automate Meme Creation with Python: Image Overlay and Text Styling

This tutorial shows how to use Python and the Pillow library to programmatically combine a base image with a meme face, overlay custom text, and generate complete meme pictures, while also discussing challenges of fitting mixed‑language captions within limited space.

Image processingPILPython

0 likes · 6 min read

Automate Meme Creation with Python: Image Overlay and Text Styling

Java Captain

Jan 26, 2018 · Backend Development

Generating and Parsing QR Code Images in Java with ZXing

This article demonstrates how to use Google's ZXing Core library in a Java Maven project to generate QR code images, create random verification codes, and decode QR codes from files, providing complete code examples and step-by-step instructions.

Image processingJavaQR code

0 likes · 12 min read

Generating and Parsing QR Code Images in Java with ZXing

AI Cyberspace

Dec 30, 2017 · Artificial Intelligence

Revive Your 18‑Year‑Old Look with a 30‑Line Python OpenCV Beauty Filter

On the cusp of the 2017 year‑end, the author humorously marks the last 90‑post turning 18 while offering a Python‑OpenCV tutorial that, in just 30 lines of code, applies a beauty filter to make anyone look younger, complete with installation steps and sample output.

Beauty FilterImage processingPython

0 likes · 5 min read

Revive Your 18‑Year‑Old Look with a 30‑Line Python OpenCV Beauty Filter

MaGe Linux Operations

Nov 19, 2017 · Information Security

How to Crack Sliding Puzzle Captchas with Simple Python Image Analysis

This article explores the security weaknesses of sliding puzzle captchas by collecting multiple samples, performing qualitative and quantitative image analysis, and presenting a Python script that automatically identifies the puzzle gap using pixel differences and statistical thresholds.

Image processingPythoncaptcha

0 likes · 7 min read

How to Crack Sliding Puzzle Captchas with Simple Python Image Analysis

CoolHome R&D Department

Nov 10, 2017 · Fundamentals

How to Extract Dominant Colors for Product Search Using MeanShift & FloodFill

This article explains a practical pipeline for extracting dominant colors from product images—using image resizing, MeanShift and FloodFill segmentation, region merging, Gaussian weighting, selection, and CIELab-based tagging—enabling accurate color-based search in e‑commerce platforms.

FloodFillImage processingMeanShift

0 likes · 13 min read

How to Extract Dominant Colors for Product Search Using MeanShift & FloodFill

Aotu Lab

Nov 3, 2017 · Game Development

Build a One‑Stroke Puzzle Game with Auto‑Graph Recognition

This article explains how to implement a one‑stroke puzzle game using graph theory, describes the two‑step rendering process, provides JavaScript pseudocode for touch interaction, and details an automatic image‑recognition tool that extracts level data from hand‑drawn diagrams while optimizing performance.

CanvasImage processingJavaScript

0 likes · 15 min read

Build a One‑Stroke Puzzle Game with Auto‑Graph Recognition

21CTO

Nov 2, 2017 · Artificial Intelligence

Step-by-Step Guide to Building a Face Recognition System on Ubuntu with Python

This tutorial walks through setting up Ubuntu 17.10 with Python 2.7, installing required packages, compiling dlib, and using the face_recognition library to detect, identify, and beautify faces through multiple code examples.

AIImage processingcomputer vision

0 likes · 9 min read

Step-by-Step Guide to Building a Face Recognition System on Ubuntu with Python

MaGe Linux Operations

Oct 7, 2017 · Artificial Intelligence

How to Crack Image Captchas with Python: Grayscale, Binarization, and Tesserocr

This tutorial explains the four main captcha types, focuses on image‑based captchas, and walks through generating, preprocessing (grayscale, contrast, binarization, denoising, skew correction), and recognizing them with Python's Claptcha library and the Tesserocr OCR engine.

Image processingOCRPython

0 likes · 7 min read

How to Crack Image Captchas with Python: Grayscale, Binarization, and Tesserocr

UCloud Tech

Aug 11, 2017 · Cloud Computing

How Serverless Architecture Powers AI Image Style Transfer with UCloud

This article explains the Serverless concept, compares it with PaaS and BaaS, and provides a step‑by‑step guide to building an AI‑driven Van Gogh style‑transfer service using UCloud's Serverless (UGC) platform, covering both backend setup and frontend interaction.

AIFaaSImage processing

0 likes · 9 min read

How Serverless Architecture Powers AI Image Style Transfer with UCloud

WeChat Client Technology Team

Jun 20, 2017 · Artificial Intelligence

Boosting Mobile Video Calls: Real-Time Low-Light Enhancement Using Spatial‑Temporal Constraints

This article presents a real‑time low‑lighting video enhancement technique that combines spatial and temporal constraints to brighten mobile video calls without flicker, offering low computational cost and robust performance validated through subjective and objective experiments.

Image processinglow-light videomobile-communication

0 likes · 11 min read

Boosting Mobile Video Calls: Real-Time Low-Light Enhancement Using Spatial‑Temporal Constraints

WeChat Backend Team

Jun 20, 2017 · Artificial Intelligence

How Constrained Spatial‑Temporal Modeling Boosts Low‑Light Video Calls in Real Time

Researchers present a real‑time low‑lighting video enhancement technique for mobile communication that combines spatial brightness and contrast constraints with temporal consistency, delivering brighter, flicker‑free video without over‑enhancement, while maintaining low computational cost suitable for smartphones.

Image processinglow-light videomobile-communication

0 likes · 12 min read

How Constrained Spatial‑Temporal Modeling Boosts Low‑Light Video Calls in Real Time

MaGe Linux Operations

May 8, 2017 · Game Development

How to Build a Python Bot to Automate a Simple Web Game

This tutorial explains how to use Python libraries such as autopy, PIL, and pywin32 to capture screen images, recognize game elements, and automate mouse movements and clicks, enabling a bot to automatically play a sushi‑making browser game.

Game BotImage processingPython

0 likes · 10 min read

How to Build a Python Bot to Automate a Simple Web Game

Tongcheng Travel Technology Center

May 5, 2017 · Artificial Intelligence

Improving Passport OCR: Process, Preprocessing, and Prior Knowledge Corrections

This article outlines a comprehensive OCR workflow for passport recognition, covering image acquisition, preprocessing techniques, engine integration, and prior‑knowledge corrections to enhance accuracy and user experience, while sharing practical insights and performance results.

AIImage processingOCR

0 likes · 8 min read

Improving Passport OCR: Process, Preprocessing, and Prior Knowledge Corrections

21CTO

Dec 24, 2016 · Backend Development

Generate Thumbnails with GD2 and ImageMagick in CodeIgniter

This guide explains why thumbnails are essential, outlines common use cases, and provides step‑by‑step CodeIgniter examples for creating 50×50 thumbnails using PHP's GD2 library and ImageMagick, including configuration, resizing, and error handling.

CodeIgniterGD2Image processing

0 likes · 8 min read

Generate Thumbnails with GD2 and ImageMagick in CodeIgniter

Architecture Digest

Nov 25, 2016 · Artificial Intelligence

Simple Python Facial Recognition with OpenCV in Ten Lines of Code

This article demonstrates how to build a lightweight facial‑recognition tool using Python and OpenCV, explains the underlying concepts such as cascade classifiers and image preprocessing, and provides a fully commented ten‑line script together with installation instructions and usage examples.

Image processingfacial recognitionopencv

0 likes · 10 min read

Simple Python Facial Recognition with OpenCV in Ten Lines of Code

Qunar Tech Salon

Aug 8, 2016 · Artificial Intelligence

OCR Technology Overview and Implementation Steps for Card Number Recognition

This article provides a comprehensive overview of OCR technology, explains its definition and application scenarios, and details a five‑step workflow—including target extraction, preprocessing, character localization, digit matching, and format validation—specifically illustrated with bank card number recognition.

Bank Card RecognitionImage processingMorphological Operations

0 likes · 9 min read

OCR Technology Overview and Implementation Steps for Card Number Recognition

Java High-Performance Architecture

Jul 2, 2016 · Cloud Computing

How Serverless FaaS Redefines Image Processing with AWS Lambda

This article explains the Function-as-a-Service (FaaS) model, illustrates how AWS Lambda can replace traditional server‑centric image upload workflows by handling scaling, storage, and database updates through event‑driven functions, and discusses the broader impact of serverless on cloud architecture.

AWS LambdaFaaSImage processing

0 likes · 5 min read

How Serverless FaaS Redefines Image Processing with AWS Lambda

21CTO

Jun 7, 2016 · Backend Development

How Twitter Scales to 3,000 Images per Second: Architecture Secrets Revealed

Twitter’s media platform, launched in 2016, processes up to 3,000 images per second by decoupling media from tweets, using chunked resumable uploads, progressive JPEGs, on‑demand variant generation, and efficient storage policies that saved millions of dollars, illustrating modern backend scaling techniques.

Backend EngineeringImage processingMedia Platform

0 likes · 14 min read

How Twitter Scales to 3,000 Images per Second: Architecture Secrets Revealed

Java High-Performance Architecture

Jun 4, 2016 · Databases

Unlock Redis Power: How the New Module System Extends Functionality

Redis Labs introduces the Redis Module System, enabling developers to extend Redis with C‑based modules that provide advanced features such as in‑database image processing via GraphicsMagick and enhanced string operations like CHECKAND and PREPEND, fostering a growing ecosystem of shared modules.

DatabasesImage processingModules

0 likes · 4 min read

Unlock Redis Power: How the New Module System Extends Functionality

High Availability Architecture

Mar 18, 2016 · Backend Development

High‑Availability Architecture of Upyun Image Processing Service

The article details Upyun's high‑availability image processing architecture, covering workload‑aware system design, custom GmServer implementation, task scheduling, current strengths and limitations, and future directions such as a ServiceServer‑based queue and Docker‑driven dynamic scaling.

Cloud ServicesDockerHigh Availability

0 likes · 15 min read

High‑Availability Architecture of Upyun Image Processing Service

Architecture Digest

Mar 14, 2016 · Mobile Development

Optimizing Mobile Taobao Main Venue Performance and Personalization with a Reusable Framework and Image‑Merging System

The article details how Alibaba's Mobile Taobao team redesigned the main promotional venue using a reusable native framework, dynamic Weex rendering, cloud‑driven configuration, pre‑heat data push, multi‑strategy routing, and a large‑scale image‑merging system to improve browsing speed, reduce bounce rates, and support flexible, personalized large‑scale sales events.

Image processingPerformance Optimizationframework design

0 likes · 15 min read

Optimizing Mobile Taobao Main Venue Performance and Personalization with a Reusable Framework and Image‑Merging System

ITPUB

Jan 28, 2016 · Artificial Intelligence

Detect Your Oven’s On/Off State with Python and OpenCV

This tutorial shows how to use Python, OpenCV, and basic image‑processing techniques to automatically detect whether a kitchen oven is on by analyzing the red indicator light captured by a home camera, providing a simple safety alert system.

Home AutomationImage processingcomputer vision

0 likes · 7 min read

Detect Your Oven’s On/Off State with Python and OpenCV

21CTO

Aug 31, 2015 · Backend Development

Boost Image Delivery Speed with Bolt: A Fast Real-Time Image Resizing Server

Bolt is a high‑performance real‑time image cropping and compression server that processes each image only once, uses an LRU cache to limit memory, and offers flexible command‑line options for installation, usage, and scaling on CPU‑rich servers.

CachingImage processingLRU

0 likes · 4 min read

Boost Image Delivery Speed with Bolt: A Fast Real-Time Image Resizing Server

Ctrip Technology

Jun 29, 2015 · Artificial Intelligence

Bank Card Scanning and Recognition: Extending Support for Chinese Debit Cards

This article describes a project that enhances an open‑source card‑number scanning solution to recognize 19‑digit Chinese debit cards, addressing challenges such as black‑printed fonts, light‑colored embossed fonts, background filtering, single‑character OCR, and Luhn‑based checksum verification.

Bank Card RecognitionImage processingOCR

0 likes · 6 min read

Bank Card Scanning and Recognition: Extending Support for Chinese Debit Cards

Qunar Tech Salon

May 25, 2015 · Operations

Improving Image Conversion Performance with /dev/shm tmpfs in Qunar's Group Buying Backend

The article describes how Qunar's group‑buying backend faced IO bottlenecks during large‑scale image conversion and solved the issue by moving intermediate files to a memory‑based tmpfs filesystem (/dev/shm), achieving a dramatic load reduction without code changes or server restarts.

IO optimizationImage processingLinux

0 likes · 5 min read

Improving Image Conversion Performance with /dev/shm tmpfs in Qunar's Group Buying Backend