Tagged articles

computer vision

667 articles · Page 6 of 7

Jul 13, 2020 · Artificial Intelligence

Master Dynamic Road Condition Analysis with Car Video – AMAP-TECH Competition Overview

The AMAP-TECH algorithm competition invites participants to develop AI models that analyze in-vehicle video sequences to determine dynamic road conditions, offering detailed dataset specifications, evaluation metrics, expert judges, schedule, and prize information for researchers in computer vision and traffic analytics.

AITraffic analysiscompetition

0 likes · 9 min read

Master Dynamic Road Condition Analysis with Car Video – AMAP-TECH Competition Overview

Youku Technology

Jul 10, 2020 · Artificial Intelligence

Mastering Video Object Segmentation: Cutting-Edge Models and Design Tricks

This technical talk introduces video object segmentation tasks, reviews leading datasets and state-of-the-art deep learning models, and shares practical network design rules and performance‑boosting techniques, presented by Prof. Wang Xinggang as part of Alibaba's MEDIA AI challenge series.

AIcomputer visiondeep learning

0 likes · 4 min read

Mastering Video Object Segmentation: Cutting-Edge Models and Design Tricks

Amap Tech

Jul 9, 2020 · Artificial Intelligence

AMAP-TECH Algorithm Competition: Dynamic Road‑Condition Analysis from In‑Vehicle Video Images

Alibaba Amap’s AMAP‑TECH competition invites participants to develop AI computer‑vision models that classify real‑time road conditions—smooth, slow, or congested—from short sequences of dash‑cam images, using a labeled dataset of 1,500 training sequences and a weighted F1‑score evaluation, with cash prizes up to ¥60,000.

AIcompetitioncomputer vision

0 likes · 8 min read

AMAP-TECH Algorithm Competition: Dynamic Road‑Condition Analysis from In‑Vehicle Video Images

Alibaba Cloud Developer

Jul 3, 2020 · Artificial Intelligence

Unlocking Visual Object Tracking: Principles, Algorithms, and Evaluation

This comprehensive review explains visual object tracking in computer vision, covering its definition, core sub‑problems of candidate generation, feature extraction, and decision making, system architecture, motion, feature and observation models, algorithm classifications, evaluation metrics, datasets, and recent research trends.

Evaluation Metricscomputer visiondeep learning

0 likes · 30 min read

Unlocking Visual Object Tracking: Principles, Algorithms, and Evaluation

Youku Technology

Jun 19, 2020 · Artificial Intelligence

Video-based Temporal Event Detection Methods

In the fourth Alibaba Digital Media Technology Night Talk, algorithm engineer Liu Xiaolong presents an overview of video‑based temporal event detection, covering its problem background, representative prior works, and the latest research advances within the MEDIA AI Algorithm Challenge series.

AlibabaTemporal Event Detectionartificial-intelligence

0 likes · 1 min read

Video-based Temporal Event Detection Methods

TAL Education Technology

Jun 18, 2020 · Artificial Intelligence

An Overview of Virtual Reality, Augmented Reality, and Vision‑Based Techniques

This article explains the fundamentals of virtual reality and its distinction from augmented reality, describes VR hardware, outlines depth‑estimation and eye‑tracking methods such as projection, Hough transform, AdaBoost and sample matching, discusses Sobel edge detection, and explores the importance of audio, haptic feedback, and immersive VR applications in education.

ARDepth EstimationImmersive Education

0 likes · 11 min read

An Overview of Virtual Reality, Augmented Reality, and Vision‑Based Techniques

360 Quality & Efficiency

May 29, 2020 · Artificial Intelligence

Image Matching Techniques: Template Matching, Feature Matching, SIFT, FLANN, and Homography

This article introduces image matching fundamentals, covering template matching methods, feature-based approaches such as SIFT and FLANN, their implementation details, matching rules, homography transformation, and practical considerations, providing a comprehensive overview for computer vision applications.

FLANNFeature MatchingSIFT

0 likes · 14 min read

Image Matching Techniques: Template Matching, Feature Matching, SIFT, FLANN, and Homography

JD Retail Technology

May 27, 2020 · Artificial Intelligence

JD ARVR Tech Department Publishes Two Papers on Defocus Blur Detection and Few-Shot Learning in Top Venues

The JD ARVR technology department announced two peer‑reviewed papers—one on a novel defocus blur detection network published in Transaction on Multimedia and another on a transductive relation‑propagation network for few‑shot learning accepted at IJCAI 2020—highlighting their advanced AI research and future AR‑VR ecosystem plans.

ARVRcomputer visiondeep learning

0 likes · 7 min read

JD ARVR Tech Department Publishes Two Papers on Defocus Blur Detection and Few-Shot Learning in Top Venues

Amap Tech

May 25, 2020 · Artificial Intelligence

Automated Production Line for Base Map Data Using Image AI and Data Fusion

Gaode’s automated production line combines deep‑learning image recognition, GPS‑enhanced location services, image differencing with semantic filtering, and standardized data‑fusion to continuously refresh China’s national base map, cutting manual effort and costs while delivering real‑time, high‑quality map updates for road traffic infrastructure.

computer visiondata fusiondeep learning

0 likes · 11 min read

Automated Production Line for Base Map Data Using Image AI and Data Fusion

ITPUB

May 14, 2020 · Artificial Intelligence

Cut & Paste Real Objects into Photoshop with AR in Under 10 Seconds

This article explains the AR Cut & Paste prototype by Cyril Diagne, detailing its three‑module architecture, the underlying BASNet and U²‑Net vision models, and provides a step‑by‑step guide—including code snippets and GitHub links—to set up the mobile app, local server, and Photoshop integration.

ARBASNetGitHub

0 likes · 8 min read

Cut & Paste Real Objects into Photoshop with AR in Under 10 Seconds

Python Programming Learning Circle

May 12, 2020 · Artificial Intelligence

Batch Image Segmentation with Python and PaddlePaddle

This tutorial demonstrates how to use Python and the PaddlePaddle deep‑learning platform to automatically remove backgrounds from multiple photos in one step, covering installation, verification, and a concise five‑line code example for batch human segmentation.

Batch ProcessingPaddlePaddlecomputer vision

0 likes · 6 min read

Batch Image Segmentation with Python and PaddlePaddle

Programmer DD

May 9, 2020 · Artificial Intelligence

ChineseOCR Lite: Ultra‑Lightweight OCR Engine for Vertical Chinese Text

ChineseOCR Lite is an open‑source, ultra‑lightweight OCR solution that supports vertical Chinese text, runs on Linux/macOS via ncnn inference, and packs detection, recognition, and angle classification models into a total of just 17 MB, offering fast and accurate scene‑text processing.

Chinese OCROCRcomputer vision

0 likes · 4 min read

ChineseOCR Lite: Ultra‑Lightweight OCR Engine for Vertical Chinese Text

Didi Tech

Apr 30, 2020 · Artificial Intelligence

DGF-M: Face Recognition Algorithm for Masked Face Scenarios

Didi’s DGF‑M model, a mask‑aware face‑recognition AI, combines multi‑task training and synthetic data to detect masks with under 0.1 % miss rate and verify identities with up to 99.5 % pass rate at a 0.1 % false‑acceptance rate, and is deployed for driver verification, offered through the Didi Cloud API marketplace, and released as an open‑source solution to aid pandemic‑era security.

AI algorithmDGF-MDidi Cloud

0 likes · 5 min read

DGF-M: Face Recognition Algorithm for Masked Face Scenarios

Amap Tech

Apr 24, 2020 · Artificial Intelligence

Q&A on Computer Vision Technologies and Their Applications in Mapping, Navigation, and Autonomous Driving

In a live Q&A, Alibaba Amap’s chief scientist Ren Xiaofeng explained how computer‑vision algorithms underpin high‑precision map creation, AR navigation, visual localization and sensor fusion, discussed current hardware limits, deep‑learning bottlenecks, 5G’s role, edge‑cloud cooperation, and offered career advice for transitioning researchers.

AIAR navigationautonomous driving

0 likes · 14 min read

Q&A on Computer Vision Technologies and Their Applications in Mapping, Navigation, and Autonomous Driving

Programmer DD

Apr 17, 2020 · Artificial Intelligence

How to Make People Vanish in Real‑Time Using TensorFlow.js and MobileNet

Jason Mayes, a Google web engineer, open‑sourced a TensorFlow.js demo that removes people from live webcam video in real time using a lightweight MobileNet model, with only about 200 lines of code, and provides GitHub and CodePen links for experimentation.

MobileNetTensorFlow.jsWebcam

0 likes · 9 min read

How to Make People Vanish in Real‑Time Using TensorFlow.js and MobileNet

iQIYI Technical Product Team

Apr 3, 2020 · Artificial Intelligence

iCartoonFace Challenge: Cartoon Face Detection and Recognition Competition

The iCartoonFace Challenge invites participants to develop efficient algorithms for detecting and recognizing cartoon faces using large, meticulously annotated datasets—50,000 images for detection and nearly 390,000 for recognition—while meeting strict model size and latency limits and submitting detailed methods and code.

AI competitionCartoon Face RecognitionData Set

0 likes · 6 min read

iCartoonFace Challenge: Cartoon Face Detection and Recognition Competition

JD Retail Technology

Apr 2, 2020 · Artificial Intelligence

How Deep Learning Powers Text Detection in E‑commerce Posters

This article surveys state‑of‑the‑art deep‑learning techniques for scene text detection and recognition in e‑commerce poster images, detailing models such as CTPN, TextBoxes, SegLink, EAST, and end‑to‑end frameworks, and discusses their architectures, strengths, limitations, and future challenges.

computer visiondeep learninge-commerce

0 likes · 16 min read

How Deep Learning Powers Text Detection in E‑commerce Posters

Alibaba Cloud Developer

Mar 25, 2020 · Artificial Intelligence

How 3D Synthetic Data Supercharges AI Vision for Smart Vending Machines

This article explains how Alibaba's Alipay visual vending cabinet leverages 3D synthetic data generation—covering full‑material 3D reconstruction, parametric scene modeling, and photo‑realistic rendering—to rapidly produce high‑quality training images, dramatically cutting cost and accelerating AI model deployment.

3D synthesisAI training dataData Generation

0 likes · 10 min read

How 3D Synthetic Data Supercharges AI Vision for Smart Vending Machines

Amap Tech

Mar 23, 2020 · Artificial Intelligence

Satellite Imagery for Map Data Updating: Key Elements, Semantic Segmentation Techniques, and Future Challenges

Gaode leverages high‑resolution satellite imagery as an active discovery tool for map updates, extracting road, region and building elements through advanced semantic segmentation networks (U‑Net, ASPP, attention, non‑local) and instance‑segmentation pipelines, to accelerate accurate road‑network and building‑block data refreshes while addressing future scalability challenges.

Satellite ImagerySemantic SegmentationU-Net

0 likes · 11 min read

Satellite Imagery for Map Data Updating: Key Elements, Semantic Segmentation Techniques, and Future Challenges

Alibaba Cloud Developer

Mar 10, 2020 · Artificial Intelligence

Can Frequency‑Domain Learning Boost Image Inference Efficiency?

This article presents a system‑level approach that performs deep‑learning inference directly on JPEG frequency components, uses a gating mechanism to select important DCT coefficients, and demonstrates higher accuracy with far lower bandwidth for image classification and instance segmentation tasks.

Bandwidth Reductioncomputer visiondeep learning

0 likes · 22 min read

Can Frequency‑Domain Learning Boost Image Inference Efficiency?

HomeTech

Mar 4, 2020 · Artificial Intelligence

Video Multi-Label Classification Using Graph Convolutional Networks

This paper introduces a method for video multi-label classification that incorporates label correlation features using graph convolutional networks, significantly improving classification performance.

GCNInceptionV3NeXtVLAD

0 likes · 7 min read

Video Multi-Label Classification Using Graph Convolutional Networks

Alibaba Cloud Developer

Feb 25, 2020 · Artificial Intelligence

How Attribute‑Specific Embedding Networks Revolutionize Fashion Copyright Protection

A new AI algorithm jointly developed by Alibaba Security and Zhejiang University learns fine‑grained, attribute‑aware similarity embeddings for fashion images, enabling accurate detection of local design plagiarism and improving retrieval performance across multiple benchmark datasets.

attribute embeddingcomputer visioncopyright protection

0 likes · 14 min read

How Attribute‑Specific Embedding Networks Revolutionize Fashion Copyright Protection

UCloud Tech

Feb 20, 2020 · Artificial Intelligence

How UCloud’s AI Mask Detection Service Reaches 99% Accuracy in One Week

This article explains how UCloud’s AI team leveraged the UAI‑Train and UAI‑Inference platforms to develop, train, and deploy a high‑accuracy face‑mask detection service within a week, detailing the algorithmic approach, challenges, deployment pipeline, and real‑world applications.

AIUAI-InferenceUAI-Train

0 likes · 10 min read

How UCloud’s AI Mask Detection Service Reaches 99% Accuracy in One Week

DataFunTalk

Feb 13, 2020 · Artificial Intelligence

Deep Learning Techniques and Challenges in Autonomous Driving

This article reviews the rapid development of deep learning, its pivotal role in autonomous driving, outlines end‑to‑end perception‑to‑control pipelines, discusses the strengths and limitations of deep models, and proposes practical strategies such as task decomposition, multi‑method fusion, and sensor integration to improve safety and interpretability.

End-to-Endautonomous drivingcomputer vision

0 likes · 8 min read

Deep Learning Techniques and Challenges in Autonomous Driving

ITPUB

Jan 14, 2020 · Artificial Intelligence

Top 2019 AI Papers Loved by Reddit Users: Key Insights and Links

A curated collection of Reddit‑highlighted 2019 AI research papers, covering theoretical advances, computer‑vision breakthroughs, unsupervised learning methods, and time‑series forecasting, with summaries, key contributions, and direct links to each paper.

AIMeta LearningResearch Papers

0 likes · 6 min read

Top 2019 AI Papers Loved by Reddit Users: Key Insights and Links

Python Programming Learning Circle

Jan 10, 2020 · Artificial Intelligence

How to Correct Skewed Text in Images Using OpenCV: A Step‑by‑Step Guide

This tutorial explains how to detect, calculate, and correct the rotation angle of text in an image using OpenCV, covering image binarization, minimum‑area bounding box extraction, angle adjustment, and affine transformation with clear Python code examples.

Image processingPythoncomputer vision

0 likes · 3 min read

How to Correct Skewed Text in Images Using OpenCV: A Step‑by‑Step Guide

Alibaba Cloud Developer

Jan 10, 2020 · Artificial Intelligence

How AI Powers Ground Marker Recognition for High‑Precision Maps

This article details the evolution of ground‑marker recognition technology in high‑precision maps, covering challenges of diverse and worn markings, traditional segmentation methods, deep‑learning breakthroughs such as R‑FCN, cascade detectors, corner‑point detection, semantic segmentation, PAnet, and 3‑D point‑cloud approaches, and their impact on accuracy and production efficiency.

computer visiondeep learningground marker recognition

0 likes · 17 min read

How AI Powers Ground Marker Recognition for High‑Precision Maps

iQIYI Technical Product Team

Jan 9, 2020 · Artificial Intelligence

Results and Winning Solutions of the 2019 CCF Big Data & Computing Intelligence Contest – Video Copyright Detection Track

The 2019 CCF Big Data & Computing Intelligence Contest’s Video Copyright Detection track, judged by iQIYI, saw 705 teams from 25 countries compete, with Hengyang Data’s VGG‑16‑based solution winning, followed by Boyun Vision, Xiao Jia’s Lao Liang, Hulu Brothers and Beihang University, showcasing diverse deep‑learning and unsupervised approaches for robust video copyright detection.

CCF Contestartificial-intelligencecomputer vision

0 likes · 9 min read

Results and Winning Solutions of the 2019 CCF Big Data & Computing Intelligence Contest – Video Copyright Detection Track

Alibaba Cloud Developer

Jan 3, 2020 · Artificial Intelligence

How Alibaba’s DAMO Lab Revolutionizes Image Cutout with AI‑Powered Matting

Alibaba's DAMO Academy details its AI‑driven image cutout system, describing why automated matting is needed, the four‑module pipeline (filtering, classification, detection, segmentation), architectural innovations such as dual decoders and fusion networks, and how these advances enable product‑level batch background removal.

AIAlibabacomputer vision

0 likes · 9 min read

How Alibaba’s DAMO Lab Revolutionizes Image Cutout with AI‑Powered Matting

Alibaba Cloud Developer

Jan 2, 2020 · Artificial Intelligence

How Alibaba’s DAMO Lab Revolutionizes Image Cutout with AI‑Powered Matting

Alibaba's DAMO Academy presents an AI‑driven image cutout system that combines filtering, classification, detection, and advanced segmentation to automate high‑precision matting, improve design efficiency, and unlock new commercial opportunities across e‑commerce and media industries.

AI mattingAlibabacomputer vision

0 likes · 8 min read

Tencent Cloud Developer

Dec 26, 2019 · Artificial Intelligence

WeChat Scan-to-Identify (Scan Object) Feature: Overview, Technical Architecture, Data Construction, and Algorithmic Advances

WeChat’s iOS Scan‑to‑Identify feature lets users point a camera at any product or scene to instantly retrieve related e‑commerce, encyclopedia or news content, using a four‑pipeline architecture that builds massive annotated and deduplicated databases, advanced RetinaNet‑based detection, multi‑task metric learning, and scalable training, deployment and scheduling platforms, with plans to extend into domains like facial, vehicle and plant recognition.

AIWeChatcomputer vision

0 likes · 34 min read

WeChat Scan-to-Identify (Scan Object) Feature: Overview, Technical Architecture, Data Construction, and Algorithmic Advances

Tencent Cloud Developer

Dec 19, 2019 · Artificial Intelligence

AI-Powered Content Moderation: How Platforms Combat Harmful Content with AI

AI-powered moderation tools now scan text, images, live streams, and short videos, using techniques like TextCNN, Word2Vec, attention‑based classifiers, multi‑label sampling, and real‑time audio analysis to detect pornographic and harmful content, while emphasizing continual model updates and sample collection for both small and large platforms.

AI detectionTencent Securitycomputer vision

0 likes · 12 min read

AI-Powered Content Moderation: How Platforms Combat Harmful Content with AI

MaGe Linux Operations

Dec 19, 2019 · Artificial Intelligence

How to Build a Vehicle License Plate Recognition System with Python and OpenCV

This article introduces a complete vehicle license‑plate detection and recognition pipeline—covering image preprocessing, ROI extraction, character segmentation, SVM‑based classification, and a PyQt5 GUI—while also discussing code structure, demo results, and future improvements.

PyQt5Pythoncomputer vision

0 likes · 5 min read

How to Build a Vehicle License Plate Recognition System with Python and OpenCV

Amap Tech

Dec 13, 2019 · Artificial Intelligence

Image Segmentation for High-Definition Mapping: Evolution and Practices at Gaode Maps

Gaode Maps has progressed image segmentation from early heuristic region splitting to modern deep‑learning pipelines—leveraging FCNs, multi‑task networks, Mask R‑CNN, and specialized losses—to achieve centimeter‑level, instance‑aware mapping of roads, signs, and small objects while pursuing lighter, real‑time models.

AIGaode MapsSemantic Segmentation

0 likes · 14 min read

Image Segmentation for High-Definition Mapping: Evolution and Practices at Gaode Maps

Xianyu Technology

Dec 11, 2019 · Artificial Intelligence

Improving Small Object Detection for UI2CODE via Data Augmentation and Model Optimization

The study enhances UI2CODE’s ability to detect tiny UI components by augmenting training data with copied small objects, upgrading the detector from Faster RCNN to FPN and Cascade FPN, and refining box positions with smoothing and projection, achieving superior small‑object mAP/mAR and enabling broader UI parsing applications.

Data AugmentationFPNModel Optimization

0 likes · 9 min read

Improving Small Object Detection for UI2CODE via Data Augmentation and Model Optimization

Qunar Tech Salon

Dec 10, 2019 · Artificial Intelligence

Comprehensive Overview of Face Detection Methods and Techniques

This article provides an in‑depth review of face detection, covering traditional knowledge‑, model‑, feature‑ and appearance‑based approaches, modern deep‑learning methods such as cascade CNN, MTCNN and Facebox, strategies for handling multi‑scale faces, anchor‑box densification, and practical training considerations.

CNNCascade CNNMTCNN

0 likes · 10 min read

Comprehensive Overview of Face Detection Methods and Techniques

Alibaba Cloud Developer

Dec 5, 2019 · Artificial Intelligence

Mastering Object Detection: From R-CNN to YOLO and Real-World AI Applications

This article introduces the fundamentals of object detection, explains key models such as R-CNN, Fast R-CNN, Faster R-CNN, YOLO, and SSD, and showcases how Alibaba's AI technology is applied to photovoltaic quality inspection to boost efficiency and accuracy in industry.

AIFast R-CNNFaster R-CNN

0 likes · 8 min read

Mastering Object Detection: From R-CNN to YOLO and Real-World AI Applications

iQIYI Technical Product Team

Nov 22, 2019 · Artificial Intelligence

Analysis of ICCV 2019 Lightweight Face Recognition Challenge Champion Solutions

The ICCV 2019 Lightweight Face Recognition Challenge attracted 292 teams and defined four strict FLOP‑ and size‑limited protocols for image and video recognition, with champions employing near‑30 GFLOP EfficientNet‑style backbones, novel loss functions, frame‑fusion, and knowledge‑distilled VarGNet models to balance accuracy and computational budget.

ICCV ChallengeLightweight Face Recognitioncomputer vision

0 likes · 8 min read

Analysis of ICCV 2019 Lightweight Face Recognition Challenge Champion Solutions

Alibaba Cloud Developer

Nov 19, 2019 · Artificial Intelligence

How Visual AI Powers Real-World Mapping and AR Navigation at Amap

This article explains how Amap leverages computer vision to collect, process, and enhance map data and to deliver low‑cost, real‑time AR navigation, detailing the technical challenges, algorithmic solutions, and the broader mission of connecting the physical world.

AIAR navigationMapping

0 likes · 12 min read

How Visual AI Powers Real-World Mapping and AR Navigation at Amap

MaGe Linux Operations

Nov 15, 2019 · Artificial Intelligence

How AI Video Walls Are Transforming Indian Prisons: Inside the JARVIS Surveillance System

India’s prisons are adopting AI-powered video walls and facial‑recognition systems, such as Staqu’s JARVIS platform, to monitor inmate activity, improve security, and generate revenue, while confronting overcrowding, staffing shortages, and violent incidents, illustrating a global shift toward smart‑prison technology.

AI surveillanceIndiaJARVIS

0 likes · 9 min read

How AI Video Walls Are Transforming Indian Prisons: Inside the JARVIS Surveillance System

DataFunTalk

Nov 14, 2019 · Artificial Intelligence

Sample Imbalance and Importance in Object Detection: IoU‑Balanced Sampling and Prime Sample Attention

The talk analyzes sample imbalance and importance in object detection, proposes IoU‑balanced negative sampling and instance‑balanced positive sampling, introduces the Prime Sample concept with Hierarchical Local Rank, and presents Importance‑based Sample Reweighting and Classification‑Aware Regression Loss, achieving consistent mAP gains without extra overhead.

IoU-balanced samplingcomputer visionhard mining

0 likes · 22 min read

Sample Imbalance and Importance in Object Detection: IoU‑Balanced Sampling and Prime Sample Attention

Amap Tech

Nov 14, 2019 · Artificial Intelligence

Technical Evolution of Ground Marking Recognition for High‑Precision Maps

AMap’s ground‑marking recognition has progressed from simple threshold methods to advanced deep‑learning pipelines—including two‑stage R‑FCN, cascade detectors with local regression, corner‑point and segmentation hybrids, and LiDAR‑based 3‑D PointRCNN—achieving over 99 % recall and sub‑5 cm positional accuracy for high‑precision map production.

computer visiondeep learningground marking

0 likes · 15 min read

Technical Evolution of Ground Marking Recognition for High‑Precision Maps

Baidu App Technology

Oct 30, 2019 · Artificial Intelligence

Applying Deep Learning and AI on Mobile: Baidu App Cases and Technical Insights

The Baidu App team showcases how deep‑learning and AI can be deployed on mobile through on‑device and server‑side inference—illustrated by plant‑identification, stylized filters, video subject detection, and AR real‑time translation—while addressing model compression, cross‑platform optimization, and offering a practical guide for engineers.

AR Translationcomputer visiondeep learning

0 likes · 11 min read

Applying Deep Learning and AI on Mobile: Baidu App Cases and Technical Insights

Amap Tech

Oct 23, 2019 · Artificial Intelligence

AR Navigation Lane Detection: Methods, Challenges, and Practical Solutions

The article reviews AR navigation lane‑detection, comparing traditional handcrafted visual pipelines with modern deep‑learning segmentation approaches, proposes an efficient multitask network with weight‑allocation and vanishing‑point anchoring, and demonstrates quantized models achieving real‑time, stable performance on low‑power automotive chips while outlining remaining weather, lighting, and road‑condition challenges.

ADASAR navigationcomputer vision

0 likes · 16 min read

AR Navigation Lane Detection: Methods, Challenges, and Practical Solutions

Amap Tech

Oct 16, 2019 · Artificial Intelligence

Visual Intelligence Connecting the Real World – Amap’s Mapping and AR Navigation Technologies

Amap leverages large‑scale visual intelligence—camera‑captured imagery, AI‑driven road‑sign and POI recognition, and compressed AR navigation overlays—to automate map creation, enhance real‑time positioning, and deliver richer travel experiences for its billion‑scale user base.

AIAR navigationGeospatial

0 likes · 13 min read

Visual Intelligence Connecting the Real World – Amap’s Mapping and AR Navigation Technologies

Huawei Cloud Developer Alliance

Oct 15, 2019 · Artificial Intelligence

How ModelArts Powers AI Development and Seamless Edge‑Cloud Deployment

This article reviews Huawei's ModelArts platform, detailing its data processing, algorithm development, high‑performance training, edge‑cloud model deployment, auto‑learning capabilities, and real‑world use cases such as invisible payment and intelligent waste classification, while outlining future ecosystem prospects.

AI platformAutoMLModelArts

0 likes · 14 min read

How ModelArts Powers AI Development and Seamless Edge‑Cloud Deployment

DataFunTalk

Sep 29, 2019 · Artificial Intelligence

UC Information Flow Video Tag Recognition: System Architecture and Multi‑Modal Algorithms

This article presents a comprehensive overview of UC's information‑flow video tag recognition technology, detailing tag usage scenarios, the end‑to‑end system architecture, multi‑modal feature extraction, advanced deep‑learning models such as NextVlad, behavior and person tagging methods, and future research directions.

Multimodal LearningRecommendation Systemscomputer vision

0 likes · 14 min read

UC Information Flow Video Tag Recognition: System Architecture and Multi‑Modal Algorithms

Meituan Technology Team

Sep 26, 2019 · Artificial Intelligence

Efficient Scene Text Detection Framework with Feature Pyramid and Expanded High-Level Feature Maps

The paper presents an efficient scene‑text detector that expands high‑level SSD feature maps and integrates a feature‑pyramid network, using direction‑aware segment‑and‑link predictions to reconstruct arbitrarily long, rotated text, achieving higher recall and precision with real‑time speed and outperforming recent methods on ICDAR benchmarks and a menu‑recognition test.

ICDARSSDScene Text Detection

0 likes · 12 min read

Efficient Scene Text Detection Framework with Feature Pyramid and Expanded High-Level Feature Maps

Didi Tech

Sep 20, 2019 · Mobile Development

How Didi Maps Engineered Scalable AR Navigation for Airports and Malls

Didi Maps' chief engineer explains how the team tackled weak GPS signals in large indoor venues by building a 60,000‑square‑meter 3D map, achieving sub‑0.5 m monocular visual localization, and fusing inertial data with Google ARCore to deliver real‑time AR navigation on Android devices.

AR navigationDidi MapsGoogle ARCore

0 likes · 5 min read

How Didi Maps Engineered Scalable AR Navigation for Airports and Malls

Tencent Cloud Developer

Sep 19, 2019 · Artificial Intelligence

Inside Tencent Cloud OCR: Architecture, Performance, and Integration Guide

The article provides a comprehensive overview of Tencent Cloud’s OCR platform, detailing its service architecture, product capabilities, integration methods, performance metrics, engineering improvements, testing automation, and operational considerations, offering developers practical insights into building and deploying OCR solutions on the cloud.

OCRService ArchitectureTencent Cloud

0 likes · 10 min read

Inside Tencent Cloud OCR: Architecture, Performance, and Integration Guide

Alibaba Cloud Developer

Sep 18, 2019 · Artificial Intelligence

Mastering Video Object Segmentation: 3 Research Paths & Alibaba’s Latest Advances

This article explains video object segmentation, outlines the three main research directions—semi‑supervised, interactive, and unsupervised—describes Alibaba’s Moku Lab breakthroughs and competition results, and discusses future plans to improve segmentation in complex scenes.

Alibaba Researchcomputer visioninteractive segmentation

0 likes · 12 min read

Mastering Video Object Segmentation: 3 Research Paths & Alibaba’s Latest Advances

Xianyu Technology

Sep 12, 2019 · Artificial Intelligence

Deep Learning for Automated Module Detection in Taobao 99 Promotion Pages

This study presents a deep‑learning pipeline that employs a Cascade‑RCNN with Feature Pyramid Network to automatically detect and refine modules and their internal elements on Taobao’s 99‑promotion pages, achieving roughly 98 % precision and recall on a thousand‑image validation set and paving the way for broader e‑commerce event applications.

Cascade R-CNNTaobaocomputer vision

0 likes · 7 min read

Deep Learning for Automated Module Detection in Taobao 99 Promotion Pages

Youku Technology

Aug 19, 2019 · Artificial Intelligence

Alibaba Showcases AI Innovations in Entertainment and Security at IJCAI 2019

At IJCAI 2019, Alibaba’s MoKu Lab unveiled the Beidou Star platform and an intelligent conversational video search system for end‑to‑end content creation, while its Turing Lab demonstrated security AI such as Green Net, IP Brain, facial‑recognition and Tianyan, complemented by multiple research papers, academic collaborations and new hiring drives.

AIAlibabaEntertainment

0 likes · 11 min read

Alibaba Showcases AI Innovations in Entertainment and Security at IJCAI 2019

Youku Technology

Aug 14, 2019 · Artificial Intelligence

Technical Analysis of “Chang'an” – The Beidou Star System for Reducing Content Uncertainty and Boosting Hit Potential

The talk details how Youku’s Beidou Star AI platform deconstructs the drama “Chang’an Twelve Hours” with NLP, computer‑vision, knowledge graphs and multi‑task deep models to quantify script, character and emotion uncertainty, enabling predictive scoring that lifted the series’ daily index above one million and outlines future hybrid decision‑engine research.

AIContent AnalyticsMedia Prediction

0 likes · 12 min read

Technical Analysis of “Chang'an” – The Beidou Star System for Reducing Content Uncertainty and Boosting Hit Potential

Tencent Cloud Developer

Aug 12, 2019 · Artificial Intelligence

Build Your Own Fatigue‑Detection System with Tencent VisionSeed and STM32

Learn how to create a DIY driver fatigue detection device by integrating Tencent Youtu VisionSeed AI vision module with an STM32 microcontroller, covering hardware assembly, UART communication, algorithm selection, and real‑time alert generation using facial key‑point analysis.

AI visionDIY hardwareEmbedded AI

0 likes · 8 min read

Build Your Own Fatigue‑Detection System with Tencent VisionSeed and STM32

Alibaba Cloud Developer

Aug 8, 2019 · Artificial Intelligence

Alibaba VOS Innovations: Semi-supervised, Interactive & Unsupervised Segmentation

Video Object Segmentation (VOS) is essential for content creation, and Alibaba’s research outlines three main approaches—semi-supervised, interactive, and unsupervised—detailing their algorithms, challenges, evaluation metrics, recent breakthroughs, and future plans to improve accuracy in complex scenes.

AIInteractivecomputer vision

0 likes · 12 min read

Alibaba VOS Innovations: Semi-supervised, Interactive & Unsupervised Segmentation

Youku Technology

Jul 31, 2019 · Artificial Intelligence

Exploring the Three Key Research Directions in Video Object Segmentation

The article outlines video object segmentation (VOS), its importance for content creation, and details the three primary research avenues—semi‑supervised, interactive, and unsupervised—while reviewing benchmark metrics, algorithm categories, challenges, and recent advances from Alibaba’s MoKu Lab, including their competition results and future plans.

AIInteractivecomputer vision

0 likes · 14 min read

Exploring the Three Key Research Directions in Video Object Segmentation

iQIYI Technical Product Team

Jul 26, 2019 · Artificial Intelligence

Preface

In the 2019 iQIYI Celebrity Video Identification Challenge, our team secured fifth place by accurately recognizing video identities using mAP scoring, and this article shares the strategies, insights, and experiences of the top‑five teams, emphasizing a straightforward, pragmatic approach championed by iQIYI’s technology product team.

AITechnical Reportcomputer vision

0 likes · 5 min read

Amap Tech

Jul 23, 2019 · Artificial Intelligence

Traffic Sign Detection in Gaode Maps: Machine Learning Techniques and System Architecture

Gaode Maps uses a two-stage machine‑learning pipeline (Faster‑RCNN with shape‑based region proposal networks and fine‑grained classifiers) to detect hundreds of traffic‑sign types in billions of street‑view images, achieving high recall and precision, scalable updates, and near‑real‑time map data refresh.

AIFaster R-CNNGaode

0 likes · 11 min read

Traffic Sign Detection in Gaode Maps: Machine Learning Techniques and System Architecture

iQIYI Technical Product Team

Jul 5, 2019 · Artificial Intelligence

iQIYI Multimodal Person Recognition Competition: 91.14% Accuracy Achieved by BUPT Team

After a three‑month contest co‑hosted by iQIYI and ACM MM, 255 teams competed on the challenging iQIYI‑VID‑2019 multimodal dataset, and the BUPT Automation School team won with a 91.14% person‑recognition accuracy, advancing the field and enhancing iQIYI’s video recommendation and AI services.

AI competitionAccuracycomputer vision

0 likes · 6 min read

iQIYI Multimodal Person Recognition Competition: 91.14% Accuracy Achieved by BUPT Team

Alibaba Cloud Developer

Jun 28, 2019 · Artificial Intelligence

Alibaba AI Wins Visual Dialogue Challenge with New Recursive Model

In the second Visual Dialogue Challenge, Alibaba’s AI outperformed ten teams—including Microsoft and Seoul University—achieving a 74.57% accuracy, surpassing the previous record by 16.82% and exceeding human performance, thanks to its novel recursive exploration dialogue model that integrates image recognition, relational reasoning, and natural language understanding.

AIcomputer visionnatural language processing

0 likes · 4 min read

Alibaba AI Wins Visual Dialogue Challenge with New Recursive Model

Alibaba Cloud Developer

Jun 12, 2019 · Artificial Intelligence

How YOLOv3 Boosts Video Content Advertising on Youku: A Real‑World Case Study

By integrating YOLOv3 video object detection into Youku’s ad platform, the team replaced traditional subtitle‑based and scene‑based placements with precise object‑level targeting, achieving higher relevance, expanded inventory, and a 20% click‑through increase despite 3.5× higher exposure.

YOLOv3computer visioncontent recommendation

0 likes · 14 min read

How YOLOv3 Boosts Video Content Advertising on Youku: A Real‑World Case Study

iQIYI Technical Product Team

May 30, 2019 · Mobile Development

SmileAR: iQIYI’s Mobile AR Solution Powered by TensorFlow Lite

SmileAR, iQIYI’s self‑developed mobile AR platform powered by TensorFlow Lite, delivers real‑time face, body and gesture recognition across iQIYI’s apps through MobileNet‑based models, quantization‑aware training, multi‑task learning and encrypted SDKs, achieving fast, lightweight, cross‑platform AR experiences for millions of users.

ARCross-PlatformModel Optimization

0 likes · 10 min read

SmileAR: iQIYI’s Mobile AR Solution Powered by TensorFlow Lite

Youku Technology

May 29, 2019 · Artificial Intelligence

Youku Video Enhancement and Super-Resolution Competition Announcement

The Youku Video Enhancement and Super‑Resolution Challenge invites teams to develop models that restore low‑resolution, noisy video to high‑definition quality using a 10,000‑pair industry dataset, offering up to RMB 100,000 in prizes and a recruitment pathway, with registration open through June 16 and competition phases spanning May to September.

AI competitionYoukucomputer vision

0 likes · 10 min read

Youku Video Enhancement and Super-Resolution Competition Announcement

Youku Technology

May 20, 2019 · Artificial Intelligence

Youku Video Enhancement and Super‑Resolution Competition Overview

The Youku Video Enhancement and Super‑Resolution Competition challenges teams of up to five to develop 4× upscaling models that also remove noise and compression artifacts, using a 10,000‑pair dataset, with prizes up to ¥100,000 and recruitment opportunities, running from May to September 2019.

AI competitionYoukucomputer vision

0 likes · 9 min read

Youku Video Enhancement and Super‑Resolution Competition Overview

DataFunTalk

May 14, 2019 · Artificial Intelligence

A Comprehensive Overview of Image Search Technology: Frameworks, Evolution, and System Architecture

This article provides a thorough introduction to image‑search technology, covering its general framework, offline and online components, feature‑extraction evolution, retrieval engine structures, and architectural challenges such as dynamic indexing, feature synchronization, and high‑throughput low‑latency serving.

computer visionfeature extractionimage search

0 likes · 12 min read

A Comprehensive Overview of Image Search Technology: Frameworks, Evolution, and System Architecture

Youku Technology

May 13, 2019 · Artificial Intelligence

How Youku Tackles Multimodal Video Understanding and Quality Control

This article outlines Youku's multimodal video content understanding pipeline, covering business needs, problem decomposition, data construction, model selection, OCR subtitle extraction, scene and action recognition, sample augmentation, noise handling, and multimodal fusion strategies for robust content moderation.

AIOCRaction recognition

0 likes · 11 min read

How Youku Tackles Multimodal Video Understanding and Quality Control

DataFunTalk

May 8, 2019 · Artificial Intelligence

Perception System Overview: Sensors, Fusion, Onboard Architecture, and Technical Challenges in Autonomous Driving

This article presents a comprehensive overview of autonomous driving perception, covering system fundamentals, sensor setups and fusion techniques, onboard processing architecture, and the key technical challenges such as precision‑recall balance, adverse weather, and small‑object detection.

Perceptionautonomous drivingcomputer vision

0 likes · 12 min read

Perception System Overview: Sensors, Fusion, Onboard Architecture, and Technical Challenges in Autonomous Driving

Youku Technology

May 6, 2019 · Artificial Intelligence

Exploring Intelligent Production at Youku: AI‑Driven Video Analysis and Automation

The talk describes Youku’s intelligent production platform, which uses AI and cloud computing to automatically analyze video frames, extract fine‑grained metadata such as scenes, persons, actions and scores, and then generate highlights, vertical clips, annotations and feedback for editors and upstream producers, while addressing challenges like pose‑tracking, graph‑based action classification and future plans for deeper video understanding and open competitions.

AIcomputer visionimage search

0 likes · 14 min read

Exploring Intelligent Production at Youku: AI‑Driven Video Analysis and Automation

58 Tech

May 6, 2019 · Artificial Intelligence

Practice of Image Feature Extraction and Its Applications in Retrieval and Quality Assessment

This article summarizes a team's practical experience with various image feature extraction methods—including global, local, and CNN features—and demonstrates their use in image retrieval and no‑reference quality assessment through extensive experiments and analysis.

CNNSIFTcomputer vision

0 likes · 13 min read

Practice of Image Feature Extraction and Its Applications in Retrieval and Quality Assessment

Youku Technology

Apr 29, 2019 · Artificial Intelligence

Precise and Fast Object Segmentation Algorithms – Talk by Ren Haibing (Youku Cognitive Lab)

Ren Haibing’s Youku Cognitive Lab talk reviews object segmentation’s motivation, explains semantic and instance concepts, presents UNet‑based and category‑agnostic methods—including fast video segmentation with motion cues—and reports high IoU results while outlining future edge‑aware, label‑free, and non‑online video segmentation research directions.

AIcategory-agnosticcomputer vision

0 likes · 19 min read

Precise and Fast Object Segmentation Algorithms – Talk by Ren Haibing (Youku Cognitive Lab)

NetEase Media Technology Team

Apr 26, 2019 · Artificial Intelligence

Intelligent Cover Image Selection System for News Articles: Image Quality Assessment and Smart Cropping

The article describes an intelligent cover‑image selection system for NetEase News that automatically filters unsuitable illustrations, assesses image quality with a pairwise‑trained deep model across clarity, color and composition, and smartly crops images using aspect‑ratio‑aware object detection, dramatically cutting manual editing and enabling confidence‑based automatic publishing.

Image CroppingNeural Networkcomputer vision

0 likes · 11 min read

Intelligent Cover Image Selection System for News Articles: Image Quality Assessment and Smart Cropping

Tencent Cloud Developer

Apr 19, 2019 · Artificial Intelligence

Tencent Cloud Face Recognition Technology: Products, Architecture, and Industry Applications

The article outlines Tencent Cloud’s face‑recognition technology—from its deep‑learning‑based algorithm training and multi‑layer system architecture, through the YouTu Lab‑powered product suite for detection, analysis, comparison, liveness and search, to real‑world deployments in security, metro transportation and retail, highlighting integration challenges and performance optimizations.

AI productsSmart RetailSmart Transportation

0 likes · 18 min read

Tencent Cloud Face Recognition Technology: Products, Architecture, and Industry Applications

HomeTech

Apr 18, 2019 · Artificial Intelligence

An Overview of Image Processing Techniques and Common Tools for Beginners

This article provides a concise introduction to image processing, covering its hierarchical structure, fundamental techniques such as classification, detection, segmentation, geometric transformation, and the most widely used libraries and deep‑learning frameworks for newcomers.

Image processingcomputer visionimage classification

0 likes · 9 min read

An Overview of Image Processing Techniques and Common Tools for Beginners

Tencent Cloud Developer

Apr 16, 2019 · Artificial Intelligence

Building Image Recognition Systems: From Basics to Advanced AI Techniques

This article summarizes a computer‑vision salon where Dr. Ji Yongnan explains imaging pipelines, traditional feature‑based methods, deep‑learning breakthroughs, Tencent Cloud AI services, real‑world case studies, and answers audience questions about machine‑vision versus computer‑vision and data‑scarcity challenges.

AI ApplicationsSegmentationcomputer vision

0 likes · 18 min read

Building Image Recognition Systems: From Basics to Advanced AI Techniques

Youku Technology

Apr 11, 2019 · Artificial Intelligence

YOUKU-VSRE 2019 Video Enhancement and Super-Resolution Challenge Announcement

The YOUKU‑VSRE 2019 challenge invites researchers to develop state‑of‑the‑art video enhancement and super‑resolution models using the largest, most diverse simulated‑noise dataset, with three competition stages (preliminary, semi‑final, final), cash prizes up to ¥100,000, certificates, and fast‑track recruitment opportunities at Alibaba (Youku).

AI challengecompetitioncomputer vision

0 likes · 3 min read

YOUKU-VSRE 2019 Video Enhancement and Super-Resolution Challenge Announcement

Didi Tech

Mar 28, 2019 · Artificial Intelligence

Overview of the CVPR 2019 WAD Autonomous Driving Challenge and Participation Details

The CVPR 2019 WAD Autonomous Driving Challenge, hosted in Long Beach, introduces four new tasks—including object‑detection and tracking transfer‑learning tracks using Didi’s massive D²‑City and Berkeley’s BDD100K datasets, plus a large‑scale detection interpolation track—aimed at advancing vision algorithms under diverse, difficult driving conditions, with global teams invited to register by May 31 and winners announced at the workshop on June 17.

AIchallengecomputer vision

0 likes · 6 min read

Overview of the CVPR 2019 WAD Autonomous Driving Challenge and Participation Details

Beike Product & Technology

Mar 21, 2019 · Artificial Intelligence

Optimization Foundations and Applications in Machine Learning and Computer Vision

This article introduces how machine learning problems are formulated as optimization tasks, explains the construction of objective functions with examples such as linear regression, robust fitting, regularization, and demonstrates various applications ranging from K‑means clustering to image inpainting and 3D reconstruction.

OptimizationRegularizationcomputer vision

0 likes · 9 min read

Optimization Foundations and Applications in Machine Learning and Computer Vision

DataFunTalk

Mar 15, 2019 · Artificial Intelligence

A Comprehensive Overview of Deep Learning Applications in Computer Vision

This article provides an extensive review of deep learning techniques applied to computer vision, covering the evolution of CNN architectures, image and video processing tasks, 2.5‑D and 3‑D reconstruction, object detection, segmentation, tracking, SLAM, and various practical applications such as AR, content retrieval, and autonomous driving.

CNNImage processingSLAM

0 likes · 22 min read

A Comprehensive Overview of Deep Learning Applications in Computer Vision

System Architect Go

Mar 14, 2019 · Artificial Intelligence

Understanding Image Similarity: Image Hashing and Feature-Based Methods

This article explains why simple MD5 checks cannot assess image similarity and introduces two major approaches—image hashing and image feature extraction—detailing their algorithms, practical performance, and how to compare images efficiently using Hamming distance and indexing techniques.

Hamming distancecomputer visionfeature extraction

0 likes · 7 min read

Understanding Image Similarity: Image Hashing and Feature-Based Methods

Alibaba Cloud Developer

Mar 12, 2019 · Artificial Intelligence

How AI and RFID Combine to Track Customer‑Product Interactions in Retail

This article presents a comprehensive AI‑driven framework that fuses video‑based customer action detection, RFID‑based product flip detection, and bipartite graph matching to accurately determine when, where, and which customer interacts with which SKU in a retail environment, discussing algorithms, optimizations, and experimental results.

AICustomer BehaviorRFID

0 likes · 22 min read

How AI and RFID Combine to Track Customer‑Product Interactions in Retail

JD Tech

Mar 8, 2019 · Artificial Intelligence

Integrated Engineering & Algorithm Platform for AI Visual Applications

This article describes a comprehensive, end‑to‑end AI visual algorithm platform that unifies data collection, annotation, model training, deployment, testing, quality evaluation, and service gateways, illustrating how such integration improves transparency, efficiency, and quality across use cases like background removal, face swapping, and clothing recommendation.

AIAlgorithm PlatformClothing Recommendation

0 likes · 13 min read

Integrated Engineering & Algorithm Platform for AI Visual Applications

Hulu Beijing

Mar 7, 2019 · Artificial Intelligence

From AlexNet to ResNeXt: Key Milestones in CNN Evolution

This article traces the evolution of convolutional neural networks from the pioneering AlexNet through VGG, Inception, ResNet, Inception‑v4, Inception‑ResNet and ResNeXt, highlighting architectural innovations, performance gains, and the underlying biological inspirations that shaped modern deep learning models.

AlexNetCNNInception

0 likes · 13 min read

From AlexNet to ResNeXt: Key Milestones in CNN Evolution

21CTO

Mar 4, 2019 · Artificial Intelligence

How to Spot AI‑Generated Fake Faces: Tips, Tricks, and the Tech Behind StyleGAN

This article explains why AI‑generated faces from StyleGAN are hard to distinguish, introduces an online game for testing realism, and provides practical visual cues—such as water spots, background errors, asymmetric glasses, hair artifacts, and teeth anomalies—to reliably identify fake images.

AI-generated imagesGaNStyleGAN

0 likes · 8 min read

How to Spot AI‑Generated Fake Faces: Tips, Tricks, and the Tech Behind StyleGAN

Ctrip Technology

Feb 28, 2019 · Artificial Intelligence

OCR Techniques and Solutions for Ctrip Business: Deep Learning Based Text Detection and Recognition

This article presents an overview of computer‑vision based OCR in Ctrip's operations, detailing deep‑learning text detection methods for controlled and uncontrolled scenarios, sequence‑based recognition models, training strategies with synthetic data, and performance results, while discussing current challenges and future improvements.

AICtripOCR

0 likes · 11 min read

OCR Techniques and Solutions for Ctrip Business: Deep Learning Based Text Detection and Recognition

Xianyu Technology

Feb 27, 2019 · Artificial Intelligence

UI2CODE: Layout Analysis and Background/Foreground Extraction for UI Images

The UI2CODE system tackles UI layout analysis by first extracting backgrounds with Sobel, Laplacian and Canny edge detection plus a flood‑fill algorithm, then isolating foreground components through connected‑component analysis and a Faster R‑CNN classifier, and finally fusing both pipelines to achieve superior precision, recall and IoU on Xianyu app screenshots.

Faster R-CNNImage processingLayout Analysis

0 likes · 16 min read

UI2CODE: Layout Analysis and Background/Foreground Extraction for UI Images

System Architect Go

Feb 26, 2019 · Fundamentals

Master the Basics of Image Processing with OpenCV and NumPy

This article introduces core image processing concepts—pixel fundamentals, binary, grayscale, and RGB images, matrix representation—and demonstrates practical implementations of cropping, canvas creation, watermarking, translation, rotation, and scaling using Python's OpenCV and NumPy libraries, including algorithm choices for resizing.

Image processingNumPyPython

0 likes · 5 min read

Master the Basics of Image Processing with OpenCV and NumPy

ITPUB

Feb 23, 2019 · Artificial Intelligence

Explore a 1.59 Million Image NSFW Dataset with 159 Fine-Grained Categories

A data scientist from Besedo has open‑sourced a massive NSFW image dataset containing 1.589 million pictures, organized into 159 primary categories and further sub‑categories, with download scripts and GitHub links, requiring about 500 GB of storage and cautioning against viewing in the office.

AI researchGitHubNSFW dataset

0 likes · 3 min read

Explore a 1.59 Million Image NSFW Dataset with 159 Fine-Grained Categories

21CTO

Feb 22, 2019 · Fundamentals

Why the Iconic “Lenna” Photo Became the Face of Image‑Processing Research

The article recounts how a 1960 Playboy portrait of Lena Söderberg was adopted by image‑processing researchers as a standard test image, explains the technical and cultural reasons for its lasting popularity, and follows her unexpected rise to fame within the scientific community.

Image processingLennabenchmark

0 likes · 7 min read

Why the Iconic “Lenna” Photo Became the Face of Image‑Processing Research

ITPUB

Feb 16, 2019 · Artificial Intelligence

A 1.59 Million‑Image NSFW Dataset Released for Advanced Content Filtering

Data scientist Evgeny Bazarov has open‑sourced a 1.589 million‑image NSFW dataset organized into 159 fine‑grained categories, providing GitHub links, download scripts, and a 500 GB storage requirement, enabling researchers to build more precise adult‑content detection models.

GitHubNSFW datasetcomputer vision

0 likes · 3 min read

A 1.59 Million‑Image NSFW Dataset Released for Advanced Content Filtering

Alibaba Cloud Developer

Feb 12, 2019 · Artificial Intelligence

Essential AI Research Highlights to Jump‑Start Your Post‑Holiday Learning

After the Chinese New Year break, this curated collection of key AI articles—spanning computer vision, speech recognition, natural language processing, recommendation systems, and more—helps technical readers quickly regain momentum in work and study by revisiting core technologies with real‑world case studies.

AIcomputer visionspeech recognition

0 likes · 6 min read

Essential AI Research Highlights to Jump‑Start Your Post‑Holiday Learning

21CTO

Feb 7, 2019 · Artificial Intelligence

How to Build a Real‑Time Parking Spot Detector with Mask R‑CNN and Python

This tutorial walks through using a webcam, Mask R‑CNN, and Python to automatically detect available parking spaces, track stationary vehicles, compute Intersection‑over‑Union to confirm emptiness, and send SMS alerts via Twilio, providing full code snippets and practical tips.

IoUMask R-CNNPython

0 likes · 16 min read

How to Build a Real‑Time Parking Spot Detector with Mask R‑CNN and Python

JD Tech

Jan 30, 2019 · Artificial Intelligence

JD AI Presents Eight Papers at AAAI 2019 Showcasing Advances in Machine Learning, NLP, and Computer Vision

At AAAI 2019 in Hawaii, JD AI Research Institute had eight papers accepted covering machine learning, natural language processing, computer vision, and multimodal AI, highlighting innovations such as AutoZOOM black‑box attacks, SACN for knowledge base completion, and temporally aware video captioning models.

Multimodal Learningartificial-intelligencecomputer vision

0 likes · 11 min read

JD AI Presents Eight Papers at AAAI 2019 Showcasing Advances in Machine Learning, NLP, and Computer Vision

Alibaba Cloud Developer

Jan 29, 2019 · Artificial Intelligence

Alibaba's AI-Driven In-Store Foot Traffic Digitization

Alibaba’s search division showcases how AI transforms traditional retail by digitizing in‑store foot traffic, employing camera‑based person detection, re‑identification, RFID‑enhanced product interaction, and edge‑optimized models to generate real‑time customer insights, heatmaps, and personalized recommendations that bridge offline and online shopping experiences.

AIRFIDRetail Analytics

0 likes · 25 min read

Alibaba's AI-Driven In-Store Foot Traffic Digitization

ITPUB

Jan 27, 2019 · Artificial Intelligence

Achieve 99% Accurate Face Recognition with Python’s face_recognition Library

This guide introduces the open‑source Python library face_recognition, explains its high‑accuracy (up to 99.38%) facial detection and landmark capabilities, provides step‑by‑step code examples for locating faces, extracting landmarks, and comparing identities, and lists practical use‑case scenarios and the GitHub repository.

GitHubPythoncomputer vision

0 likes · 6 min read

Achieve 99% Accurate Face Recognition with Python’s face_recognition Library

DataFunTalk

Jan 14, 2019 · Artificial Intelligence

Computer Vision Fundamentals, Traditional Methods, Deep Learning Advances, and Cloud AI Deployment

This article provides a comprehensive overview of computer vision, covering its basic concepts, traditional image processing techniques, modern deep‑learning approaches, real‑world AI application cases, and the cloud infrastructure needed to support large‑scale deployment, while also offering skill‑advancement guidance.

AI ApplicationsImage processingcloud AI

0 likes · 20 min read

Computer Vision Fundamentals, Traditional Methods, Deep Learning Advances, and Cloud AI Deployment

Alibaba Cloud Developer

Jan 8, 2019 · Artificial Intelligence

How Alibaba Digitizes In‑Store Foot Traffic with AI and RFID Fusion

This article details Alibaba's end‑to‑end solution for digitizing offline retail foot traffic, combining existing surveillance cameras, RFID tags, and advanced AI techniques such as lightweight YOLO detection, knowledge distillation, and multi‑level pedestrian re‑identification to capture, analyze, and act on shopper behavior for both business operations and personalized in‑store experiences.

AIPedestrian Re-identificationRFID

0 likes · 27 min read

How Alibaba Digitizes In‑Store Foot Traffic with AI and RFID Fusion

Alibaba Cloud Developer

Jan 7, 2019 · Artificial Intelligence

What Are Alibaba DAMO Academy’s 2019 Top 10 Tech Trends and Their Real-World Impact?

This week’s Alibaba tech roundup highlights the DAMO Academy’s 2019 top‑10 technology trends—from smart cities and AI chips to blockchain and 5G—plus breakthrough AI liver‑tumor segmentation results, the open‑source Fusion design system, a Flink Forward China recap, a new computer‑vision paper collection, and an upcoming Apache Dubbo live session.

computer visionopen source

0 likes · 9 min read

What Are Alibaba DAMO Academy’s 2019 Top 10 Tech Trends and Their Real-World Impact?

DataFunTalk

Dec 20, 2018 · Artificial Intelligence

How to Build World-Class Visual AI Technology

This presentation outlines the fundamentals of computer vision, discusses key factors such as algorithm research, large‑scale training platforms, intelligent data processing, and hardware optimization, and shares practical experiences from DeepGlint on building a world‑class visual AI system and its real‑world applications.

computer visiondata pipelinehardware optimization

0 likes · 23 min read

How to Build World-Class Visual AI Technology

Tencent Cloud Developer

Dec 17, 2018 · Artificial Intelligence

An Overview of Computer Vision: Fundamentals, Traditional Techniques, and Deep Learning Applications

The talk provides a comprehensive overview of computer vision, defining its scope, detailing low‑, mid‑, and high‑level processing pipelines, reviewing classic filters and feature extractors, explaining deep‑learning breakthroughs such as CNNs and YOLO, and showcasing Tencent Cloud AI services, career paths, and learning resources.

AIcomputer visionmachine learning

0 likes · 43 min read

An Overview of Computer Vision: Fundamentals, Traditional Techniques, and Deep Learning Applications