Tagged articles
650 articles
Page 7 of 7
Tencent Cloud Developer
Tencent Cloud Developer
Aug 1, 2018 · Artificial Intelligence

How AI Powers Real-World Apps: From Face Filters to Medical Imaging

The July 28 Tencent Cloud community salon in Beijing gathered five AI experts who demonstrated practical AI applications—including computer‑vision face filters, OCR services, smart construction attendance, game AI, and breast‑cancer detection—showing how cloud‑based models, data pipelines, and deployment strategies turn research into usable products.

AICloud AIComputer Vision
0 likes · 21 min read
How AI Powers Real-World Apps: From Face Filters to Medical Imaging
Qunar Tech Salon
Qunar Tech Salon
Jul 24, 2018 · Artificial Intelligence

Meituan's AI-Powered Image Intelligent Review System: Watermark Detection, Celebrity Face Recognition, Pornography Detection, and Scene Classification

This article describes Meituan's large‑scale AI‑driven image moderation platform, detailing deep‑learning based watermark detection, celebrity face recognition, pornographic image detection, and scene classification techniques, along with system architecture, data preparation, model evaluation, and deployment considerations.

Computer VisionDeep LearningImage Moderation
0 likes · 19 min read
Meituan's AI-Powered Image Intelligent Review System: Watermark Detection, Celebrity Face Recognition, Pornography Detection, and Scene Classification
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 6, 2018 · Artificial Intelligence

How Dynamic Scale Selection Boosts Real-Time Action Prediction

This article explains online action prediction, the challenges of early‑stage classification, and introduces a Scale Selection Network that dynamically chooses optimal temporal windows using dilated convolutions, regression and classification sub‑networks, achieving state‑of‑the‑art results on two benchmark datasets.

Computer VisionDeep Learningdilated convolution
0 likes · 7 min read
How Dynamic Scale Selection Boosts Real-Time Action Prediction
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 4, 2018 · Artificial Intelligence

Turning Fashion Into AI‑Ready Data: Building Practical Image Datasets

This article explains how Alibaba's Image & Beauty team designs and iterates a practical fashion image dataset by aligning data purpose, integrating professional knowledge, handling sample scarcity and structured noise, and defining fine‑grained evaluation metrics to enable AI models that truly understand clothing.

Computer VisionKnowledge Engineeringdata annotation
0 likes · 34 min read
Turning Fashion Into AI‑Ready Data: Building Practical Image Datasets
Xianyu Technology
Xianyu Technology
Jun 30, 2018 · Artificial Intelligence

No-Reference Image Sharpness Assessment Based on Strong Edge Validity Statistics

The paper proposes a no‑reference image sharpness metric that computes strong‑edge validity statistics—ratio of maximum directional gradient sum to squared strong‑edge count—across image blocks, classifies them into grades, and effectively handles defocus and motion blur for applications such as video thumbnail selection.

Computer VisionNo-Referenceedge statistics
0 likes · 8 min read
No-Reference Image Sharpness Assessment Based on Strong Edge Validity Statistics
MaGe Linux Operations
MaGe Linux Operations
Jun 29, 2018 · Artificial Intelligence

How to Build a 200-Line Python Script for Automatic Face Swapping

This article walks through creating a concise 200‑line Python script that automatically detects facial landmarks with dlib, aligns faces using Procrustes analysis, corrects color differences, and blends a second face onto a first image, complete with code snippets and step‑by‑step explanations.

Computer VisionOpenCVdlib
0 likes · 9 min read
How to Build a 200-Line Python Script for Automatic Face Swapping
Qunar Tech Salon
Qunar Tech Salon
Jun 29, 2018 · Artificial Intelligence

Face Recognition with OpenCV, Python, and Deep Learning

This tutorial explains how to implement high‑accuracy face recognition using OpenCV, Python, and deep learning by leveraging dlib's deep metric learning, creating a custom dataset, encoding facial embeddings, and performing real‑time identification on images and video streams.

Computer VisionDeep LearningOpenCV
0 likes · 30 min read
Face Recognition with OpenCV, Python, and Deep Learning
Meituan Technology Team
Meituan Technology Team
Jun 28, 2018 · Artificial Intelligence

Deep Learning-Based OCR Techniques at Meituan

Meituan’s OCR system replaces the classic preprocess‑segment‑recognize pipeline with deep‑learning components—CNN‑based text detection, synthetic‑data‑trained character models, and BLSTM‑CTC sequence recognition—delivering far higher accuracy on noisy, varied real‑world images such as menus, receipts, and IDs, though further integration with layout analysis remains needed.

Computer VisionOCRSequence Learning
0 likes · 22 min read
Deep Learning-Based OCR Techniques at Meituan
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 27, 2018 · Artificial Intelligence

How Context-Contrast Features and Gated Multi‑Scale Fusion Boost Scene Segmentation

The paper introduces a context‑contrast local feature and a gated multi‑scale fusion mechanism that together enhance pixel‑level scene segmentation, especially for inconspicuous objects, and validates the approach with state‑of‑the‑art results on Pascal Context, SUN‑RGBD, and COCO Stuff datasets.

Computer VisionDeep Learningcontext contrast
0 likes · 6 min read
How Context-Contrast Features and Gated Multi‑Scale Fusion Boost Scene Segmentation
Tencent Cloud Developer
Tencent Cloud Developer
Jun 8, 2018 · Industry Insights

How AI and Data Are Powering the Next Retail Revolution with Tencent Cloud

The article analyzes the challenges of declining online traffic and low in‑store conversion, then details how Tencent Cloud’s AI‑driven retail suite—covering scientific site selection, commercial‑area insights, computer‑vision store management, one‑code product tracking, and brand sentiment analysis—offers data‑powered solutions for modern retailers.

AIComputer VisionRetail analytics
0 likes · 12 min read
How AI and Data Are Powering the Next Retail Revolution with Tencent Cloud
AntTech
AntTech
Jun 1, 2018 · Mobile Development

Optimizing QR Code Scanning: Boosting Recognition Rate, Cutting Latency, and Enhancing Robustness

This article details how Alipay's scanning technology team improved QR code recognition by refining aspect‑ratio tolerance, introducing new pattern detection modes, applying diagonal filtering, leveraging logistic‑regression classification, adjusting jump‑line intervals, and moving binarization to GPU, resulting in a 6.95‑point increase in recognition rate and significantly reduced processing time.

Algorithm OptimizationComputer VisionImage Processing
0 likes · 12 min read
Optimizing QR Code Scanning: Boosting Recognition Rate, Cutting Latency, and Enhancing Robustness
Architecture Digest
Architecture Digest
May 19, 2018 · Artificial Intelligence

Optical Flow: Principles, Evolution, and Applications in Computer Vision

This article explains the fundamentals of optical flow, traces its development from early variational methods to modern deep‑learning models like FlowNet, and discusses practical applications such as video object detection, semantic segmentation, and novel view synthesis, highlighting both technical challenges and future research directions.

Computer VisionDeep LearningFlowNet
0 likes · 14 min read
Optical Flow: Principles, Evolution, and Applications in Computer Vision
360 Tech Engineering
360 Tech Engineering
May 17, 2018 · Artificial Intelligence

Applying Image Recognition in UI Automation Testing with Sikuli

This article introduces how image‑recognition techniques, particularly using the Sikuli tool, can be applied to UI automation testing for both web and mobile applications, covering practical scenarios, core principles, a suite of useful functions, example code, and the advantages and limitations of the approach.

Computer VisionSikuliUI automation
0 likes · 7 min read
Applying Image Recognition in UI Automation Testing with Sikuli
360 Quality & Efficiency
360 Quality & Efficiency
May 16, 2018 · Fundamentals

Applying Image Recognition in UI Automation Testing with Sikuli

This article introduces the use of image‑recognition techniques, particularly the Sikuli tool, for UI automation testing, covering typical scenarios, underlying principles, key functions such as Find, click, wait, and type, as well as example code, and discusses the advantages and limitations of this approach.

Computer VisionJythonSikuli
0 likes · 7 min read
Applying Image Recognition in UI Automation Testing with Sikuli
Ctrip Technology
Ctrip Technology
May 2, 2018 · Artificial Intelligence

Document OCR: From Computer Vision Fundamentals to Ctrip's Full-Text OCR Implementation

This article explains the evolution of optical character recognition, outlines the complete OCR processing pipeline—including image input, preprocessing, binarization, noise removal, tilt correction, layout analysis, character segmentation, recognition, and post‑processing—while showcasing Ctrip's real‑world OCR project, its architecture, accuracy metrics, and key computer‑vision techniques such as CNN, HSV, HOG, LBP, and Haar features.

CNNComputer VisionImage Processing
0 likes · 13 min read
Document OCR: From Computer Vision Fundamentals to Ctrip's Full-Text OCR Implementation
MaGe Linux Operations
MaGe Linux Operations
Mar 27, 2018 · Artificial Intelligence

How to Swap Faces in Images with Python: A Step‑by‑Step Guide

This article explains how to write a compact Python script (about 200 lines) that automatically detects facial landmarks with dlib, aligns two faces using Procrustes analysis, corrects color differences, and blends the second face onto the first using OpenCV, complete with full source code and visual examples.

Computer VisionOpenCVPython
0 likes · 13 min read
How to Swap Faces in Images with Python: A Step‑by‑Step Guide
Suning Technology
Suning Technology
Jan 26, 2018 · Artificial Intelligence

Suning’s ‘Beidou’ AI System Transforms Retail Customer Flow Analytics

The article details Suning’s Beidou system, an AI‑driven solution that combines video, WiFi and facial recognition to accurately count and analyze customer flow, improve store operations, and enable intelligent services such as personalized recommendations, automated payment, and safety monitoring for modern retail environments.

AIComputer VisionRetail analytics
0 likes · 13 min read
Suning’s ‘Beidou’ AI System Transforms Retail Customer Flow Analytics
JD Tech
JD Tech
Jan 26, 2018 · Artificial Intelligence

JD Big Data R&D Department Presents Three Accepted Papers at AAAI-2018

The JD Big Data R&D team announced that three of its research papers—covering cross‑domain human parsing, multi‑view outlier detection, and orthogonal weight normalization for deep neural networks—were accepted at the prestigious AAAI‑2018 conference, highlighting the department's contributions to computer vision, data mining, and deep learning.

Computer VisionCross‑domain Adaptationartificial intelligence
0 likes · 8 min read
JD Big Data R&D Department Presents Three Accepted Papers at AAAI-2018
MaGe Linux Operations
MaGe Linux Operations
Jan 21, 2018 · Artificial Intelligence

Can You Break a WordPress CAPTCHA in 15 Minutes with Machine Learning?

This tutorial shows how to generate a labeled dataset from the open‑source WordPress "Really Simple CAPTCHA" plugin, train a lightweight convolutional neural network using Python, OpenCV, Keras and TensorFlow, and decode real captchas within fifteen minutes, demonstrating the power of modern computer‑vision techniques.

Computer VisionTensorFlow
0 likes · 11 min read
Can You Break a WordPress CAPTCHA in 15 Minutes with Machine Learning?
21CTO
21CTO
Jan 6, 2018 · Artificial Intelligence

How Image Recognition Transforms Our World: Principles, Processes, and Future

This article explains the fundamentals of image recognition technology, its underlying principles, processing steps, neural‑network and nonlinear‑dimensionality‑reduction approaches, and highlights its wide‑range applications and future potential across many industries.

AIComputer VisionNeural Networks
0 likes · 11 min read
How Image Recognition Transforms Our World: Principles, Processes, and Future
AI Cyberspace
AI Cyberspace
Dec 30, 2017 · Artificial Intelligence

Revive Your 18‑Year‑Old Look with a 30‑Line Python OpenCV Beauty Filter

On the cusp of the 2017 year‑end, the author humorously marks the last 90‑post turning 18 while offering a Python‑OpenCV tutorial that, in just 30 lines of code, applies a beauty filter to make anyone look younger, complete with installation steps and sample output.

Beauty FilterComputer VisionImage Processing
0 likes · 5 min read
Revive Your 18‑Year‑Old Look with a 30‑Line Python OpenCV Beauty Filter
Node Underground
Node Underground
Nov 24, 2017 · Artificial Intelligence

Build Your First Node.js Face Recognition App with opencv4nodejs

This article introduces how to leverage the opencv4nodejs Node.js module—binding OpenCV’s full API—to develop a face detection and recognition application, highlighting the CPU‑intensive nature of computer‑vision tasks, the limitations of JavaScript, and the availability of synchronous and asynchronous examples.

Computer VisionNode.jsOpenCV
0 likes · 2 min read
Build Your First Node.js Face Recognition App with opencv4nodejs
MaGe Linux Operations
MaGe Linux Operations
Nov 5, 2017 · Artificial Intelligence

How Deep Learning Transforms Modern Face Recognition: From Basics to DeepFace

This article surveys the evolution of face recognition from traditional image‑based methods to real‑time video processing, highlights key researchers and open‑source projects, explains the four‑stage pipeline, details DeepFace's deep‑learning architecture, and provides practical installation and usage instructions for Python developers.

CNNComputer VisionDatasets
0 likes · 21 min read
How Deep Learning Transforms Modern Face Recognition: From Basics to DeepFace
MaGe Linux Operations
MaGe Linux Operations
Nov 1, 2017 · Artificial Intelligence

Master Face Recognition on Ubuntu with Python: Step‑by‑Step Guide

This guide walks through setting up Ubuntu 17.10 with Python 2.7, installing dlib and the face_recognition library, and demonstrates five practical examples ranging from a one‑line face‑recognition command to facial feature extraction and beautification, complete with code snippets and screenshots.

Computer VisionPythonUbuntu
0 likes · 10 min read
Master Face Recognition on Ubuntu with Python: Step‑by‑Step Guide
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 25, 2017 · Artificial Intelligence

How Hierarchical Multimodal LSTM Boosts Image Captioning Accuracy

This article reviews an ICCV paper introducing a hierarchical multimodal LSTM that jointly embeds images, phrases, and whole sentences, enabling detailed image descriptions and superior performance on Flickr30K, MS‑COCO, and region‑phrase datasets compared to previous methods.

Computer VisionImage CaptioningMultimodal Learning
0 likes · 8 min read
How Hierarchical Multimodal LSTM Boosts Image Captioning Accuracy
21CTO
21CTO
Oct 20, 2017 · Artificial Intelligence

How Pornhub’s New AI Identifies Adult Stars in Videos

Pornhub unveiled an AI model that uses computer‑vision techniques to automatically recognize and tag over ten thousand adult performers, allowing users to search more precisely while also involving human reviewers to verify and improve the system’s accuracy.

Adult IndustryComputer Visionartificial intelligence
0 likes · 5 min read
How Pornhub’s New AI Identifies Adult Stars in Videos
Architecture Digest
Architecture Digest
Sep 30, 2017 · Artificial Intelligence

Overview of Prominent Deep Learning Architectures for Computer Vision

This article surveys recent progress in deep learning by presenting key computer‑vision architectures such as AlexNet, VGG, GoogleNet, ResNet, ResNeXt, RCNN, YOLO, SqueezeNet, SegNet and GANs, providing brief descriptions, their advantages, and links to original papers and Keras implementations.

Computer VisionDeep LearningKeras
0 likes · 16 min read
Overview of Prominent Deep Learning Architectures for Computer Vision
Tongcheng Travel Technology Center
Tongcheng Travel Technology Center
Sep 8, 2017 · Artificial Intelligence

Challenges and Techniques in Image Search: Facenet Model and Triplet Loss

The article discusses the evolution of image search engines, outlines key challenges such as image quality, watermarks, speed, and feature extraction, and explains how the Facenet deep‑learning model with Triplet loss can be used to generate compact image embeddings for efficient similarity search.

Computer VisionDeep Learningfacenet
0 likes · 7 min read
Challenges and Techniques in Image Search: Facenet Model and Triplet Loss
Architecture Digest
Architecture Digest
Aug 1, 2017 · Artificial Intelligence

Comprehensive Overview of Autonomous Driving Technologies, Companies, and Industry Trends

This article provides a detailed overview of autonomous driving, covering its evolution from electric and shared vehicles, major industry players, technical definitions, SAE level classifications, core modules such as perception, localization, decision and control, key datasets like KITTI, and emerging business opportunities in the sector.

AIComputer VisionSensor Fusion
0 likes · 19 min read
Comprehensive Overview of Autonomous Driving Technologies, Companies, and Industry Trends
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 28, 2017 · Artificial Intelligence

Inside Alibaba AI Lab: Dr. Wang Gang on Multimodal AI and Edge Computing

In an exclusive interview, Alibaba AI Lab's distinguished scientist Dr. Wang Gang discusses the lab's research on multimodal AI, edge computing, AI hardware, bio‑inspired cognition, quantum‑deep‑learning integration, and the challenges of moving from recognition to true understanding, while also outlining Alibaba's AI talent recruitment plans.

AI researchAI talent recruitmentComputer Vision
0 likes · 25 min read
Inside Alibaba AI Lab: Dr. Wang Gang on Multimodal AI and Edge Computing
MaGe Linux Operations
MaGe Linux Operations
Jul 28, 2017 · Artificial Intelligence

How to Capture GTA V Game Frames in Python with OpenCV for AI Experiments

This tutorial walks through using Python and OpenCV to capture screen images from GTA V, covering screen grabbing, converting to NumPy arrays, improving frame rates, handling key and controller input, and setting up a basic autonomous driving test environment, all without relying on pre‑existing libraries.

Computer VisionGTA VOpenCV
0 likes · 7 min read
How to Capture GTA V Game Frames in Python with OpenCV for AI Experiments
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 24, 2017 · Artificial Intelligence

How Alibaba’s AI Beats the KITTI Benchmark and Revolutionizes Visual Shopping

Alibaba’s AI breakthroughs—from a foot‑scanning shopping demo that lets a Google engineer instantly find matching shoes, to a record‑setting vehicle detection model on KITTI and world‑leading OCR for real‑time image review—showcase the power and commercial potential of modern computer‑vision research.

AIComputer VisionDeep Learning
0 likes · 5 min read
How Alibaba’s AI Beats the KITTI Benchmark and Revolutionizes Visual Shopping
High Availability Architecture
High Availability Architecture
Jul 14, 2017 · Artificial Intelligence

Facial Emotion Recognition Using Convolutional Neural Networks: Dataset, Model Architecture, and Evaluation

This article presents a deep‑learning approach for recognizing seven basic human facial expressions using a balanced FER2013 dataset, describes the CNN architecture built with Keras and OpenCV preprocessing, reports training on AWS GPU, and analyzes validation results and visualizations.

AWS GPUCNNComputer Vision
0 likes · 11 min read
Facial Emotion Recognition Using Convolutional Neural Networks: Dataset, Model Architecture, and Evaluation
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 5, 2017 · Artificial Intelligence

Is This the New Golden Age of Visual AI? Insights from Alibaba Cloud

The article reviews the three historic AI booms, explains why today’s cloud‑based visual intelligence represents a distinct era, outlines five key factors for successful visual AI, and showcases real‑world Alibaba Cloud applications such as product search, city‑wide monitoring, medical diagnosis, and visual advertising.

AI applicationsAlibaba CloudBig Data
0 likes · 18 min read
Is This the New Golden Age of Visual AI? Insights from Alibaba Cloud
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 24, 2016 · Artificial Intelligence

How Deep Learning Revives Image Search: From Sunset to Tomorrow

Image search, once limited by early CBIR techniques, has surged back thanks to deep learning, offering improved relevance, coverage, scalability, and user experience across applications like e‑commerce, shopping, entertainment, and surveillance, while integrating data, users, models, and systems to bridge the semantic gap.

Computer VisionDeep Learninge‑commerce
0 likes · 5 min read
How Deep Learning Revives Image Search: From Sunset to Tomorrow
Ctrip Technology
Ctrip Technology
Aug 12, 2016 · Artificial Intelligence

Deep Learning Meetup Recap: Applications in Travel, Advertising, NLP, Computer Vision, and Knowledge Graphs

Last month Ctrip Technology Center hosted a deep‑learning meetup featuring academic and industry experts from UCL, Fudan, Southeast University, Nanjing University, Huawei, Sogou and others, who presented real‑world applications of deep learning in travel, advertising, natural language processing, computer vision, and knowledge graphs.

AI applicationsAdvertisingComputer Vision
0 likes · 6 min read
Deep Learning Meetup Recap: Applications in Travel, Advertising, NLP, Computer Vision, and Knowledge Graphs
Qunar Tech Salon
Qunar Tech Salon
Aug 8, 2016 · Artificial Intelligence

OCR Technology Overview and Implementation Steps for Card Number Recognition

This article provides a comprehensive overview of OCR technology, explains its definition and application scenarios, and details a five‑step workflow—including target extraction, preprocessing, character localization, digit matching, and format validation—specifically illustrated with bank card number recognition.

Bank Card RecognitionComputer VisionImage Processing
0 likes · 9 min read
OCR Technology Overview and Implementation Steps for Card Number Recognition
Meiyou UED
Meiyou UED
Jul 20, 2016 · Artificial Intelligence

What Is AR? Understanding Augmented Reality Behind Pokemon Go

This article explains augmented reality (AR), compares it with virtual reality (VR), describes the technical pipeline and design challenges of AR, illustrates its unique interaction methods, and discusses current limitations and future possibilities, using Pokemon Go as a concrete example.

ARComputer VisionInteraction Design
0 likes · 7 min read
What Is AR? Understanding Augmented Reality Behind Pokemon Go
Ctrip Technology
Ctrip Technology
Jul 18, 2016 · Artificial Intelligence

Deep Learning Applications in Ctrip Travel Guide Community

This article reviews how Ctrip’s travel guide community leverages deep learning models such as CNN, LSTM, and RCNN for multilingual text analysis, image classification, video moderation, and data matching, and outlines future directions like knowledge graphs and virtual reality.

AI applicationsComputer Visionnatural language processing
0 likes · 6 min read
Deep Learning Applications in Ctrip Travel Guide Community
21CTO
21CTO
Jan 29, 2016 · Artificial Intelligence

How Mobile Image Search Powers Real-Time Shopping: Inside Pailitao’s AI Algorithm

Mobile visual search, a long‑standing dream, has evolved from early research to a production‑grade system at Pailitao, where a five‑module AI pipeline—category prediction, object detection, feature extraction, indexing, and ranking—enables billions of images to be searched instantly on mobile devices.

Computer VisionDeep LearningMobile AI
0 likes · 8 min read
How Mobile Image Search Powers Real-Time Shopping: Inside Pailitao’s AI Algorithm
ITPUB
ITPUB
Jan 28, 2016 · Artificial Intelligence

Detect Your Oven’s On/Off State with Python and OpenCV

This tutorial shows how to use Python, OpenCV, and basic image‑processing techniques to automatically detect whether a kitchen oven is on by analyzing the red indicator light captured by a home camera, providing a simple safety alert system.

Computer VisionHome AutomationImage Processing
0 likes · 7 min read
Detect Your Oven’s On/Off State with Python and OpenCV
Architects Research Society
Architects Research Society
Oct 11, 2015 · Artificial Intelligence

Decision Forests for Pixel-Level Classification in Computer Vision

This article traces the evolution of computer vision from its 1960s origins, explains the challenges of image classification and semantic segmentation, and introduces pixel-level decision forest algorithms as an efficient solution for large‑scale pixel classification tasks.

Computer Visiondecision forestpixel classification
0 likes · 9 min read
Decision Forests for Pixel-Level Classification in Computer Vision
Ctrip Technology
Ctrip Technology
Jun 29, 2015 · Artificial Intelligence

Bank Card Scanning and Recognition: Extending Support for Chinese Debit Cards

This article describes a project that enhances an open‑source card‑number scanning solution to recognize 19‑digit Chinese debit cards, addressing challenges such as black‑printed fonts, light‑colored embossed fonts, background filtering, single‑character OCR, and Luhn‑based checksum verification.

Bank Card RecognitionComputer VisionImage Processing
0 likes · 6 min read
Bank Card Scanning and Recognition: Extending Support for Chinese Debit Cards