Tagged articles

1235 articles

Page 5 of 13

Mar 22, 2023 · Artificial Intelligence

CUTLASS Extreme Performance Optimization and Its Application in Alibaba's Recommendation System

At the GTC conference, the talk presents Alibaba Cloud’s heterogeneous computing platform and introduces the Open Deep Learning API (ODLA), then details how CUTLASS‑based operator fusion dramatically accelerates attention and MLP layers in large‑scale recommendation models, achieving multi‑fold performance gains in production.

CUTLASSDeep LearningGPU computing

0 likes · 5 min read

CUTLASS Extreme Performance Optimization and Its Application in Alibaba's Recommendation System

Baidu Geek Talk

Mar 16, 2023 · Artificial Intelligence

PaddleDetection v2.6 Release: PP-YOLOE Family Expansion and Advanced Detection Algorithms

PaddleDetection v2.6 expands the PP‑YOLOE family with rotating, small‑object, dense‑object, and ultra‑lightweight edge‑GPU models, upgrades PP‑Human and PP‑Vehicle toolboxes, releases semi‑supervised, few‑shot and distillation learning methods, adds numerous state‑of‑the‑art algorithms, and improves infrastructure with Python 3.10, EMA filtering and AdamW support.

BaiduComputer VisionDeep Learning

0 likes · 14 min read

PaddleDetection v2.6 Release: PP-YOLOE Family Expansion and Advanced Detection Algorithms

政采云技术

Mar 9, 2023 · Artificial Intelligence

Comprehensive Overview of Object Detection: From Traditional Methods to Modern Deep Learning Models

This article provides a comprehensive overview of object detection, describing traditional sliding‑window approaches, deep‑learning based two‑stage and one‑stage models such as R‑CNN, Faster R‑CNN, YOLO series, and discusses current challenges, improvement directions, and future research trends in the field.

Computer VisionDeep LearningR-CNN

0 likes · 29 min read

Comprehensive Overview of Object Detection: From Traditional Methods to Modern Deep Learning Models

Tencent Cloud Developer

Mar 8, 2023 · Artificial Intelligence

Building a Scalable Recommendation System for WeChat Games: Architecture and Implementation

The article describes WeChat Games’ scalable recommendation system, detailing its four‑component architecture—offline ML platform, unified management, online DAG‑based engine, and peripheral services—along with a hybrid algorithm library, feature engineering, real‑time monitoring, and solutions that boost engagement across diverse game recommendation scenarios.

Data ManagementDeep LearningReal-time Processing

0 likes · 28 min read

Building a Scalable Recommendation System for WeChat Games: Architecture and Implementation

DataFunTalk

Feb 25, 2023 · Artificial Intelligence

The Evolution of Modern AI: From Deep Learning Foundations to ChatGPT and Future Directions

This article traces the development of artificial intelligence from its early conceptual roots and the 2012 deep‑learning breakthrough through the rise of self‑supervised large language models like BERT and GPT, explains ChatGPT’s architecture and RLHF training, and discusses its commercial impact and future prospects for fields such as life sciences.

AI applicationsChatGPTDeep Learning

0 likes · 19 min read

The Evolution of Modern AI: From Deep Learning Foundations to ChatGPT and Future Directions

DataFunTalk

Feb 22, 2023 · Artificial Intelligence

Fundamentals, Frontiers, and Applications of Graph Neural Networks

An in‑depth overview of graph neural networks (GNNs) covering their basic concepts, historical development, core models, recent research frontiers, and diverse applications such as recommendation systems, computer vision, NLP, program analysis, and smart cities, based on the book “Fundamentals, Frontiers and Applications of GNNs.”

AI applicationsDeep LearningGNN Models

0 likes · 12 min read

Fundamentals, Frontiers, and Applications of Graph Neural Networks

Architects Research Society

Feb 18, 2023 · Artificial Intelligence

Comparison of Deep Learning Software Frameworks

This article provides an overview of deep learning as a branch of machine learning and presents detailed comparative tables of popular deep‑learning software frameworks, covering creators, initial releases, licenses, platforms, programming languages, supported features such as CUDA, OpenMP, and model‑training capabilities.

Deep Learningartificial intelligencesoftware frameworks

0 likes · 9 min read

Comparison of Deep Learning Software Frameworks

Baidu Geek Talk

Feb 15, 2023 · Artificial Intelligence

PaddlePaddle 2.4 Release: New Sparse, Graph, and Audio APIs

PaddlePaddle 2.4 introduces 167 new APIs—including sparse computing (paddle.sparse), graph learning (paddle.geometric), and audio processing (paddle.audio) modules—enabling efficient sparse model training and inference, graph message‑passing, advanced audio feature extraction, plus fresh loss functions, tensor utilities, and expanded vision transforms.

API ReleaseAudio ProcessingDeep Learning

0 likes · 16 min read

PaddlePaddle 2.4 Release: New Sparse, Graph, and Audio APIs

DataFunSummit

Feb 15, 2023 · Artificial Intelligence

Accelerating Computer Vision Pipelines with CV‑CUDA: Reducing Complexity and Boosting Performance

This article examines how moving image pre‑ and post‑processing to GPU with NVIDIA's CV‑CUDA reduces software complexity, alleviates CPU bottlenecks, and delivers up to thirty‑fold throughput gains for computer‑vision workloads across training and inference pipelines.

CV-CUDAComputer VisionDeep Learning

0 likes · 17 min read

Accelerating Computer Vision Pipelines with CV‑CUDA: Reducing Complexity and Boosting Performance

DataFunSummit

Feb 14, 2023 · Artificial Intelligence

Deep Learning Hyperparameter Tuning and Training Tips: Insights from Zhihu Experts

This article compiles practical deep learning training and hyperparameter tuning advice from Zhihu contributors, covering model debugging, learning‑rate strategies, optimizer choices, data preprocessing, regularization techniques, initialization methods, common pitfalls, recommended research papers, and ensemble approaches.

Deep LearningRegularizationgradient clipping

0 likes · 13 min read

Deep Learning Hyperparameter Tuning and Training Tips: Insights from Zhihu Experts

Volcano Engine Developer Services

Feb 14, 2023 · Artificial Intelligence

How Make-An-Audio Turns Text Into Realistic Sound Effects

Make-An-Audio, a collaborative text‑to‑audio model from Zhejiang University, Peking University and Volcano Speech, uses a Distill‑then‑Reprogram strategy to generate high‑quality, controllable sound effects from any modality, showcasing impressive demos and promising future AIGC applications.

AIGCDeep LearningSpeech synthesis

0 likes · 7 min read

How Make-An-Audio Turns Text Into Realistic Sound Effects

DataFunTalk

Feb 11, 2023 · Artificial Intelligence

Accelerating Computer Vision Pipelines with CV-CUDA: Reducing Complexity and Performance Bottlenecks

This article explains how moving image preprocessing and post‑processing to GPU with the open‑source CV‑CUDA library dramatically reduces system complexity, eliminates CPU‑GPU bottlenecks, and delivers up to thirty‑fold performance gains for computer‑vision workloads across training and inference stages.

CV-CUDAComputer VisionDeep Learning

0 likes · 16 min read

Accelerating Computer Vision Pipelines with CV-CUDA: Reducing Complexity and Performance Bottlenecks

JD Cloud Developers

Feb 8, 2023 · Operations

Boosting Log Anomaly Detection with NLP and Deep Learning

This article presents a log anomaly detection approach that leverages NLP techniques such as Part‑of‑Speech tagging and Named Entity Recognition combined with deep neural networks, detailing a six‑step model, experimental validation on three datasets, and superior performance compared with existing DeepLog and LogClass methods.

DNNDeep LearningNER

0 likes · 13 min read

Boosting Log Anomaly Detection with NLP and Deep Learning

DataFunSummit

Feb 6, 2023 · Artificial Intelligence

A Minimalist White‑Box Unsupervised Learning Method Using Sparse Manifold Transform

A recent paper by Prof. Ma Yi and Turing‑Award winner Yann LeCun introduces a simple, interpretable unsupervised learning approach that combines sparse coding, manifold learning, and slow feature analysis, achieving near‑state‑of‑the‑art performance on MNIST, CIFAR‑10, and CIFAR‑100 without data augmentation or extensive hyper‑parameter tuning.

AIDeep LearningUnsupervised Learning

0 likes · 8 min read

A Minimalist White‑Box Unsupervised Learning Method Using Sparse Manifold Transform

DataFunSummit

Feb 5, 2023 · Artificial Intelligence

Key Takeaways from the Causal Inference Summit: Motivation, Applications, Challenges, and Links to A/B Testing, Machine Learning, and Deep Learning

After attending the DataFun causal inference summit, this article outlines why causal analysis matters, its typical use cases, practical challenges, its relationship with A/B testing, and how it integrates with machine learning and deep learning to improve decision‑making and model robustness.

A/B testingDeep LearningUplift Modeling

0 likes · 10 min read

Key Takeaways from the Causal Inference Summit: Motivation, Applications, Challenges, and Links to A/B Testing, Machine Learning, and Deep Learning

DataFunSummit

Feb 4, 2023 · Artificial Intelligence

Overview of Deep Learning Algorithms: Supervised, Unsupervised, and Semi‑Supervised Methods

This article introduces deep learning as a powerful AI technique, explains its core algorithms—including supervised, unsupervised, and semi‑supervised approaches—and provides concrete examples such as CNN, RNN, autoencoders, GAN, self‑supervised and transfer learning, illustrated with visual demos.

AIDeep LearningGAN

0 likes · 6 min read

Overview of Deep Learning Algorithms: Supervised, Unsupervised, and Semi‑Supervised Methods

Alimama Tech

Feb 1, 2023 · Artificial Intelligence

CapOnImage: Context-driven Dense Captioning on Images

The paper presents CapOnImage, a novel image‑on‑image captioning task that generates location‑specific decorative text for product images, introduces the 2.1‑million‑image CapOnImage2M dataset, and proposes a mixed‑modality transformer with position‑aware pre‑training and progressive training, achieving superior accuracy and diversity and already deployed in Alibaba’s advertising platforms for measurable business impact.

Context-AwareDatasetDeep Learning

0 likes · 9 min read

CapOnImage: Context-driven Dense Captioning on Images

Tencent Tech

Jan 20, 2023 · Artificial Intelligence

Why Tencent’s Yu Dong Became a 2022 ACM Fellow: Pioneering Deep Learning in Speech

Tencent AI Lab deputy Yu Dong was named a 2022 ACM Fellow for his groundbreaking work in speech processing and deep‑learning applications, boasting over 100 patents, multiple best‑paper awards, and technologies now embedded in many of Tencent’s products.

ACM FellowDeep LearningTencent

0 likes · 5 min read

Why Tencent’s Yu Dong Became a 2022 ACM Fellow: Pioneering Deep Learning in Speech

DataFunTalk

Jan 12, 2023 · Artificial Intelligence

The Evolution of AIGC in 2022 and Baidu’s Role in AI Talent and Large‑Model Innovation

The article reviews the rapid rise of AI‑generated content (AIGC) in 2022, outlines its historical development stages, highlights Baidu’s breakthrough video‑generation and search technologies, and discusses the severe AI talent shortage in China along with Baidu’s extensive training initiatives.

AI talentAIGCBaidu

0 likes · 14 min read

The Evolution of AIGC in 2022 and Baidu’s Role in AI Talent and Large‑Model Innovation

Xiaohongshu Tech REDtech

Jan 10, 2023 · Artificial Intelligence

AI‑Driven Video Coding: Expert Q&A on Intelligent Compression, Standards, and Future Directions

Experts Wang Shenshe and Chen Jing discuss how deep‑learning‑based video coding is reshaping traditional compression by offering modest quality gains but facing theoretical, hardware, and standardization hurdles, while debating hybrid versus end‑to‑end designs, rate control, 3‑D support, and the balance between human‑centric perception and machine‑oriented efficiency.

AIDeep LearningRate Control

0 likes · 16 min read

AI‑Driven Video Coding: Expert Q&A on Intelligent Compression, Standards, and Future Directions

Tencent Cloud Developer

Jan 9, 2023 · Artificial Intelligence

Search Relevance Architecture and Practices in QQ Browser

The QQ Browser search relevance team describes a unified, billion‑scale architecture that combines a main and vertical subsystem, a pyramid‑shaped ranking pipeline (recall, coarse, fine), a dedicated GPU‑accelerated relevance service, and hybrid semantic‑matching models (dual‑tower, BERT, matrix fusion) evaluated with offline and online metrics to deliver accurate, fresh, and authoritative results for diverse content and long‑tail queries.

Deep LearningEvaluation MetricsSystem Architecture

0 likes · 28 min read

Search Relevance Architecture and Practices in QQ Browser

DataFunTalk

Jan 9, 2023 · Artificial Intelligence

Key Techniques for Digital Human Modeling: Facial Portrait Editing, Eyelash Segmentation, and Real‑Time Loose Clothing Animation

This article reviews recent research on digital human creation, covering graph‑based facial portrait editing (fat‑slim adjustment, double‑chin removal, hair removal), a high‑quality eyelash segmentation dataset with the EyelashNet pipeline, and a deep‑learning framework for real‑time animation of loose clothing using virtual skeletons and RBF networks.

Deep Learningdigital humanseyelash segmentation

0 likes · 15 min read

Key Techniques for Digital Human Modeling: Facial Portrait Editing, Eyelash Segmentation, and Real‑Time Loose Clothing Animation

DataFunTalk

Jan 8, 2023 · Artificial Intelligence

Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Images

The paper introduces ABPN, an Adaptive Blend Pyramid Network that achieves precise, high‑quality skin retouching and garment wrinkle removal on 4K‑8K photos in real time by combining a context‑aware local retouching layer with a novel adaptive blend pyramid layer, addressing challenges of artifact‑free detail preservation and efficient high‑resolution processing.

Computer VisionDeep Learningadaptive blend pyramid

0 likes · 16 min read

Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Images

Bilibili Tech

Jan 3, 2023 · Artificial Intelligence

Two‑Pass Deep Learning Bitrate Factor Prediction for Constant‑Quality Segment Encoding in Bilibili Narrow‑Band HD Transcoding

Bilibili’s IEEE‑VCIP‑2022‑accepted two‑pass deep‑learning bitrate‑factor predictor achieves 98.8% accuracy with only 1.55 encoding passes, enabling constant‑quality segment encoding that reduces bitrate consumption by over 15% while maintaining visual quality in its narrow‑band HD transcoding pipeline.

AIBitrate OptimizationDeep Learning

0 likes · 6 min read

Two‑Pass Deep Learning Bitrate Factor Prediction for Constant‑Quality Segment Encoding in Bilibili Narrow‑Band HD Transcoding

DataFunTalk

Dec 28, 2022 · Artificial Intelligence

A Comprehensive Survey of Graph Neural Networks: Development, Complex Graph Models, Applications, Scalability, and Future Directions

This article provides an extensive overview of graph neural networks, tracing their evolution from early RNN‑based models to modern message‑passing frameworks, discussing complex graph types, diverse real‑world applications, scalability challenges, design spaces, training platforms, and promising research directions.

Deep LearningGNNScalability

0 likes · 49 min read

A Comprehensive Survey of Graph Neural Networks: Development, Complex Graph Models, Applications, Scalability, and Future Directions

DataFunTalk

Dec 27, 2022 · Artificial Intelligence

Efficient Training for Very Large‑Scale Face Recognition and the FFC Framework

This article reviews the challenges of ultra‑large‑scale face recognition, presents existing solutions such as metric learning, PFC and VFC, and details the proposed FFC framework with dual loaders, ID groups, probe and gallery networks, plus experimental results showing its cost‑effective performance.

AIComputer VisionDeep Learning

0 likes · 7 min read

Efficient Training for Very Large‑Scale Face Recognition and the FFC Framework

58 Tech

Dec 22, 2022 · Artificial Intelligence

Implementing a Cloud-Native Istio Gateway for 58.com Deep Learning Inference Platform

This article details the evolution of 58.com’s deep learning inference platform, describing the transition from the original SCF‑based architecture to a cloud‑native Istio gateway (architecture 2.0), and explains design choices, traffic‑management, adaptive rate‑limiting, observability, model pre‑warming, and performance improvements.

AICloud NativeDeep Learning

0 likes · 22 min read

Implementing a Cloud-Native Istio Gateway for 58.com Deep Learning Inference Platform

Alimama Tech

Dec 21, 2022 · Artificial Intelligence

Adaptive Parameter Generation Network for Click-Through Rate Prediction

Adaptive Parameter Generation Network (APG) dynamically creates sample‑specific model parameters for click‑through‑rate prediction using low‑rank factorization, parameter sharing, and over‑parameterization, achieving up to 0.2% AUC improvement, 3% CTR lift, and up to 96.6% storage reduction with faster inference.

CTR predictionDeep Learningadaptive parameter generation

0 likes · 14 min read

Adaptive Parameter Generation Network for Click-Through Rate Prediction

DataFunTalk

Dec 21, 2022 · Artificial Intelligence

A Comprehensive Overview of Computational Advertising: Architecture, Deep‑Learning Evolution, and Future Directions

This article provides a thorough examination of computational advertising, covering the oCPM pricing model as a superset, classic system architecture, the evolution of core modules such as ad ranking, pacing, bidding, federated learning, calibration, and conversion‑delay handling, and concludes with career advice for algorithm engineers.

CalibrationDeep LearningFederated Learning

0 likes · 29 min read

A Comprehensive Overview of Computational Advertising: Architecture, Deep‑Learning Evolution, and Future Directions

DataFunTalk

Dec 17, 2022 · Artificial Intelligence

Multimodal Pre‑training Techniques and Applications – Overview, OPPOVL Dataset, Architecture, and Performance

This article presents a comprehensive overview of multimodal pre‑training, describing its motivation, architecture choices, large‑scale Chinese image‑text dataset construction, training optimizations, performance benchmarks, downstream applications, and a Q&A session that highlights practical deployment considerations.

Computer VisionDeep LearningModel architecture

0 likes · 16 min read

Multimodal Pre‑training Techniques and Applications – Overview, OPPOVL Dataset, Architecture, and Performance

Alimama Tech

Dec 14, 2022 · Artificial Intelligence

Contrastive Image Representation Learning with Debiasing for CTR Prediction

The article proposes a three-stage contrastive learning framework—pre‑training, fine‑tuning, and debiasing—to generate unbiased, fine‑grained image embeddings for mobile Taobao CTR prediction, achieving higher accuracy, fairness, and a 4‑5% CTR lift in large‑scale offline and online evaluations.

CTR predictionDeep Learningbias mitigation

0 likes · 14 min read

Contrastive Image Representation Learning with Debiasing for CTR Prediction

Alipay Experience Technology

Dec 8, 2022 · Artificial Intelligence

How xNN Revolutionizes Edge AI with Scalable Modeling and Optimization

This article explains the evolution of Ant Group's xNN edge‑AI framework, detailing its four‑layer model‑optimization space, the lightweight modeling of version 1.0, and the transition to scalable modeling in version 2.0 to better exploit fragmented device compute resources.

Deep LearningMobile AIModel Optimization

0 likes · 21 min read

How xNN Revolutionizes Edge AI with Scalable Modeling and Optimization

vivo Internet Technology

Dec 7, 2022 · Artificial Intelligence

Mixing Heterogeneous Queues in Vivo's Information Flow and App Store: Challenges, Practices, and RL/Deep Learning Solutions

Vivo tackles the complex problem of mixing heterogeneous content queues—ads, games, and organic items—in its information‑flow and app‑store by evolving from rule‑based weighting to Q‑learning and deep‑learning position models that respect product constraints, preserve ordering, and balance short‑term revenue with long‑term user experience, while planning deeper personalization and on‑device solutions.

AdvertisingApp StoreDeep Learning

0 likes · 14 min read

Mixing Heterogeneous Queues in Vivo's Information Flow and App Store: Challenges, Practices, and RL/Deep Learning Solutions

Alimama Tech

Dec 7, 2022 · Artificial Intelligence

Adaptive Domain Interest Network for Multi-domain Recommendation

The Adaptive Domain Interest Network (ADIN) introduces a shared backbone with scenario‑specific subnetworks, domain‑specific batch normalization and SE‑Block attention to capture both commonalities and divergences across recommendation scenarios, and, combined with self‑supervised training, consistently outperforms baselines, delivering a 1.8% revenue lift in Alibaba’s display‑ad platform and now runs in production.

Deep Learningdomain adaptationrecommendation

0 likes · 12 min read

Adaptive Domain Interest Network for Multi-domain Recommendation

DataFunTalk

Dec 4, 2022 · Artificial Intelligence

Key Insights on Causal Inference: Motivation, Applications, Challenges, and Links to A/B Testing, ML, and Deep Learning

This article summarizes the motivations behind causal inference, its typical business applications such as intelligent decision‑making and prediction, the practical challenges of validation and data, and its relationship with A/B testing, machine learning, and deep learning, providing a concise overview for newcomers.

AB testingBusiness AnalyticsDeep Learning

0 likes · 10 min read

Key Insights on Causal Inference: Motivation, Applications, Challenges, and Links to A/B Testing, ML, and Deep Learning

Hulu Beijing

Dec 2, 2022 · Artificial Intelligence

How Disney+ Designs a Multi‑Task Video Search Ranking Model

This article explains the architecture of a video search ranking system that combines a deep encoding network, multi‑task expert networks, and a bias‑correction module to jointly optimize relevance, click‑through rate, and watch time for streaming platforms.

Bias CorrectionDeep Learningfeature engineering

0 likes · 15 min read

How Disney+ Designs a Multi‑Task Video Search Ranking Model

DataFunSummit

Nov 30, 2022 · Artificial Intelligence

Combining Knowledge Graphs with Personalized News Recommendation Systems

This article presents a comprehensive overview of a personalized news recommendation system that leverages knowledge graphs to improve accuracy, explainability, and user satisfaction, detailing background motivations, graph construction methods, model architecture, experimental results, and practical insights from a Meituan research perspective.

Deep Learningexplainabilitygraph neural networks

0 likes · 23 min read

Combining Knowledge Graphs with Personalized News Recommendation Systems

Architects Research Society

Nov 30, 2022 · Artificial Intelligence

A Comprehensive Overview of Machine Learning Tools and Libraries

An extensive survey ranks and compares a wide range of machine learning libraries and frameworks—both deep and shallow learning—detailing their languages, types, GPU acceleration, distributed computing capabilities, and typical academic and industrial applications, based on Google search popularity as of May.

Deep LearningGPU Accelerationdistributed computing

0 likes · 20 min read

A Comprehensive Overview of Machine Learning Tools and Libraries

Rare Earth Juejin Tech Community

Nov 24, 2022 · Artificial Intelligence

Spectral Normalization in GANs: Theory, Implementation, and Code Walkthrough

This article explains the motivation behind Spectral Normalization for stabilizing GAN training, derives its mathematical foundation, shows how it enforces the Lipschitz constraint, and provides a complete PyTorch implementation with detailed code explanations.

Deep LearningGANLipschitz Continuity

0 likes · 11 min read

Spectral Normalization in GANs: Theory, Implementation, and Code Walkthrough

Dada Group Technology

Nov 18, 2022 · Artificial Intelligence

JD Daojia Machine Learning Platform: Architecture and Implementation

This article introduces JD Daojia's machine learning platform, detailing its architecture, implementation principles, and practical applications in various business scenarios, achieving significant improvements in recommendation and search systems.

Deep LearningDistributed TrainingKubernetes

0 likes · 28 min read

JD Daojia Machine Learning Platform: Architecture and Implementation

SQB Blog

Nov 18, 2022 · Artificial Intelligence

Boosting AI Model Development with Alibaba's EasyModeling Framework

This article introduces the EasyModeling framework built on Alibaba Cloud's PAI platform, detailing its modular design, high reusability, integration with deep‑learning libraries, automated hyper‑parameter tuning, deployment scenarios, and a real‑world case study using RoBERTa for dish‑name standardization, demonstrating significant performance gains.

AI modelingAlibaba CloudAutoML

0 likes · 13 min read

Boosting AI Model Development with Alibaba's EasyModeling Framework

Zuoyebang Tech Team

Nov 17, 2022 · Artificial Intelligence

Scaling Deep Learning Model Serving: High‑Concurrency, Low‑Latency Solutions

This article examines the challenges of deploying dozens of deep‑learning models at Zuoyebang and compares three serving architectures—Gunicorn + Flask + Transformers, Tornado + PyTorch, and Tornado + Triton—highlighting performance trade‑offs and presenting a final high‑concurrency, low‑latency solution in production.

Deep LearningLow latencyModel Deployment

0 likes · 11 min read

Scaling Deep Learning Model Serving: High‑Concurrency, Low‑Latency Solutions

Tencent Advertising Technology

Nov 17, 2022 · Artificial Intelligence

Scaling Huge Embedding Model Training with Cache-Enabled Distributed Framework (HET): VLDB 2022 Best Paper and Its Industrial Deployment

The award‑winning VLDB 2022 paper introduces HET, a cache‑enabled distributed framework that dramatically reduces communication overhead for sparse trillion‑parameter embedding models, and Tencent Ads has industrialized this technology to train 10 TB‑scale models with up to 7×24‑hour online deep learning.

CacheDeep LearningEmbedding

0 likes · 9 min read

Scaling Huge Embedding Model Training with Cache-Enabled Distributed Framework (HET): VLDB 2022 Best Paper and Its Industrial Deployment

Alimama Tech

Nov 16, 2022 · Artificial Intelligence

STARDOM: Semantic-Aware Deep Hierarchical Forecasting Model for Search Traffic Prediction

STARDOM is an end‑to‑end deep hierarchical forecasting model that jointly learns hierarchical constraints, query semantics via pretrained BERT, and a calibration matrix within an encoder‑decoder architecture, using a distilled reconciliation loss and hierarchical sampling to accurately predict large‑scale search traffic and outperform state‑of‑the‑art baselines.

Deep Learninghierarchical modelingsearch advertising

0 likes · 22 min read

STARDOM: Semantic-Aware Deep Hierarchical Forecasting Model for Search Traffic Prediction

Python Crawling & Data Mining

Nov 11, 2022 · Artificial Intelligence

Top Python Books to Master Statistics, Data Science, and AI – Free Giveaway Inside

Discover a curated collection of beginner-friendly Python books covering statistics, data analysis, machine learning, deep learning, and computer vision, each with concise descriptions, key features, and practical examples, plus a chance to win a free copy by participating in the event.

BooksDeep LearningFree Giveaway

0 likes · 12 min read

DataFunSummit

Nov 10, 2022 · Artificial Intelligence

Voice‑Driven Facial Animation for Digital Humans: Techniques and OPPO XiaoBu Assistant Practice

This article introduces digital‑human voice‑driven facial animation technologies, compares motion‑capture, audio‑driven and key‑point methods, details OPPO XiaoBu Assistant's end‑side and cloud‑side Audio2Lip pipelines, explores BlendShape versus Mesh approaches, and discusses current challenges and future research directions.

Deep LearningOPPOaudio-to-facial

0 likes · 15 min read

Voice‑Driven Facial Animation for Digital Humans: Techniques and OPPO XiaoBu Assistant Practice

Shopee Tech Team

Nov 10, 2022 · Artificial Intelligence

ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video

ShopeeVideo OCR is a multi‑language text‑recognition system for Southeast Asian e‑commerce videos that unifies detection, Transformer‑based recognition, layout analysis, and large‑scale synthetic data generation to handle Indonesian, Filipino, English, Vietnamese, Thai and Chinese scripts, delivering industry‑leading accuracy and winning thirteen ICDAR first‑place awards.

Computer VisionDeep LearningMulti-language OCR

0 likes · 15 min read

ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video

Ctrip Technology

Nov 10, 2022 · Artificial Intelligence

Improving Search Intent Recognition and Term Weighting with Deep Learning and Model Distillation at Ctrip

This article describes how Ctrip's R&D team applied deep‑learning models, BERT‑based embeddings, knowledge distillation, and term‑weighting techniques to enhance e‑commerce search intent recognition and term importance estimation, achieving high accuracy while meeting sub‑10 ms latency requirements.

BERTDeep LearningSearch

0 likes · 12 min read

Improving Search Intent Recognition and Term Weighting with Deep Learning and Model Distillation at Ctrip

Bilibili Tech

Nov 8, 2022 · Artificial Intelligence

Real-Time Super-Resolution Algorithm for League of Legends S12 Live Streaming

A lightweight real‑time super‑resolution network was created for the 2022 League of Legends S12 World Championship, using pixel‑unshuffle/shuffle, structural re‑parameterization, and a multi‑loss (L1, perceptual, Sobel‑based texture, GAN) training pipeline that upscales 1080p streams to 4K at 75 fps on a V100 GPU, delivering clearer textures and reduced noise while remaining computationally efficient.

Deep LearningLoss Functionsgame streaming

0 likes · 10 min read

Real-Time Super-Resolution Algorithm for League of Legends S12 Live Streaming

JD Cloud Developers

Nov 7, 2022 · Artificial Intelligence

Detecting Time‑Series Anomalies Without Thresholds Using LSTM and Unsupervised Fusion

This article presents a threshold‑free anomaly detection framework for streaming time series that combines an LSTM‑based baseline module with an unsupervised detection module, detailing the architecture, training process, data preprocessing, and experimental results that demonstrate superior accuracy and F1 scores.

Deep LearningLSTMTime Series

0 likes · 15 min read

Detecting Time‑Series Anomalies Without Thresholds Using LSTM and Unsupervised Fusion

Model Perspective

Nov 5, 2022 · Artificial Intelligence

Explore the Most Popular Machine Learning Algorithms and How They Work

This comprehensive guide walks you through the most popular machine learning algorithms, explaining how they are classified by learning style and problem type, and highlighting key examples from supervised, unsupervised, deep learning, ensemble, and many other algorithm families.

Deep LearningUnsupervised Learningsupervised learning

0 likes · 11 min read

Explore the Most Popular Machine Learning Algorithms and How They Work

Model Perspective

Nov 5, 2022 · Artificial Intelligence

Explore 70+ Model Articles: From Differential Equations to Deep Learning

This curated list groups recent model articles for students, covering variation models, time‑series prediction, machine‑learning techniques, deep‑learning architectures, and gray‑system forecasting, each with direct links to the original Chinese resources.

Deep LearningModelingPrediction

0 likes · 6 min read

Explore 70+ Model Articles: From Differential Equations to Deep Learning

Baidu Geek Talk

Oct 31, 2022 · Artificial Intelligence

PaddleBox: A GPU‑Based Ultra‑Large‑Scale Sparse DNN Training Framework

PaddleBox is Baidu’s GPU‑based ultra‑large‑scale sparse DNN training framework that combines a three‑tier hierarchical parameter server (SSD, DRAM, HBM) with pipelined scheduling and multi‑machine multi‑GPU communication, delivering 5–40× cost‑performance gains over traditional CPU solutions and powering Baidu’s advertising services.

Deep LearningGPUPaddleBox

0 likes · 15 min read

PaddleBox: A GPU‑Based Ultra‑Large‑Scale Sparse DNN Training Framework

Alimama Tech

Oct 26, 2022 · Artificial Intelligence

GPU Utilization Analysis and Optimization for Alibaba's Intelligent Creative Video Service

The paper analyzes why Alibaba Mama’s intelligent creative video service suffers low GPU utilization—due to Python GIL blocking, lack of kernel fusion, and serialized CUDA streams—and details service‑level changes (separate CPU/GPU processes, shared‑memory queues, priority scheduling) and operator‑level kernel‑fusion techniques (channels‑last layouts, custom pooling, TensorRT conversion) that raise utilization from ~30 % to near 100 % and boost throughput by 75 %.

Deep LearningGPU OptimizationPython

0 likes · 20 min read

GPU Utilization Analysis and Optimization for Alibaba's Intelligent Creative Video Service

Rare Earth Juejin Tech Community

Oct 18, 2022 · Artificial Intelligence

Practical Implementation of Vision Transformer (ViT) for Image Classification in PyTorch

This article walks readers through building, training, and evaluating a Vision Transformer (ViT) model for a five‑class flower classification task, providing detailed code snippets, model architecture explanations, training script adjustments, and experimental results that highlight the importance of pre‑trained weights.

Deep LearningImage ClassificationPyTorch

0 likes · 13 min read

Practical Implementation of Vision Transformer (ViT) for Image Classification in PyTorch

Bilibili Tech

Oct 18, 2022 · Artificial Intelligence

Real-Time Super-Resolution Algorithm for League of Legends S12 Live Streaming

A real‑time super‑resolution network specially designed for the League of Legends S12 live broadcast upscales 1080p streams to 4K at 75 fps by compressing parameters, employing pixel‑unshuffle/shuffle, structural re‑parameterization, and a multi‑loss (L1, perceptual, Sobel, GAN) training pipeline, delivering markedly sharper textures and lower latency for live game streaming.

Deep LearningLoss Functionsgame streaming

0 likes · 12 min read

Ctrip Technology

Oct 13, 2022 · Artificial Intelligence

Chinese New Word Discovery: From Traditional Unsupervised Methods to CNN‑Based Deep Learning

The article examines the challenge of out‑of‑vocabulary terms in Chinese e‑commerce NLP, reviews classic unsupervised metrics such as frequency, cohesion and neighbor entropy, and proposes a lightweight fully‑convolutional network inspired by image‑segmentation techniques to automatically detect new words.

CNNDeep LearningNLP

0 likes · 10 min read

Chinese New Word Discovery: From Traditional Unsupervised Methods to CNN‑Based Deep Learning

DataFunTalk

Oct 13, 2022 · Artificial Intelligence

Multimodal Attribute-Level Sentiment Analysis for Social Media: Background, Tasks, and Recent Advances

This article reviews the rapid development of multimodal attribute-level sentiment analysis on social media, outlining its background, defining four core sub‑tasks, summarizing representative recent models—including unified multimodal transformers, coarse‑to‑fine image‑target matching, and vision‑language pre‑training—and discussing experimental results and future research directions.

Deep LearningNLPaspect based sentiment

0 likes · 21 min read

Multimodal Attribute-Level Sentiment Analysis for Social Media: Background, Tasks, and Recent Advances

Alibaba Cloud Big Data AI Platform

Oct 12, 2022 · Artificial Intelligence

Unlock Vision AI: How EasyCV Streamlines Datasets and Model Training

This article introduces EasyCV, an open‑source all‑in‑one visual algorithm platform that abstracts diverse data sources, provides SOTA self‑supervised models, and offers ready‑to‑download datasets for image classification, object detection, segmentation, and pose estimation, complete with configuration examples.

Computer VisionDatasetsDeep Learning

0 likes · 9 min read

Unlock Vision AI: How EasyCV Streamlines Datasets and Model Training

Rare Earth Juejin Tech Community

Oct 11, 2022 · Artificial Intelligence

GANomaly: Theory and Source Code Analysis

This article explains the GANomaly model for semi‑supervised anomaly detection, detailing its generator‑encoder‑discriminator architecture, loss functions, testing phase scoring, and provides annotated PyTorch source code to help readers implement and understand the approach.

Deep LearningEncoder-DecoderGAN

0 likes · 15 min read

GANomaly: Theory and Source Code Analysis

Rare Earth Juejin Tech Community

Oct 8, 2022 · Artificial Intelligence

Wasserstein GAN (WGAN): Theory and Hands‑On Implementation

This article explains why traditional GANs suffer from training instability, introduces the Wasserstein (Earth‑Mover) distance as a smoother alternative, derives the WGAN objective, discusses Lipschitz constraints, and provides practical PyTorch code modifications to convert a vanilla GAN into a stable WGAN.

Deep LearningGANPyTorch

0 likes · 21 min read

Wasserstein GAN (WGAN): Theory and Hands‑On Implementation

AntTech

Sep 27, 2022 · Artificial Intelligence

Ant Group’s Research Institute Publishes Four NeurIPS 2022 Papers on Advanced Computer Vision and AI

Ant Group’s Ant Technology Research Institute had four papers from its Visual Intelligence Lab accepted at NeurIPS 2022, covering rank diminishing in deep networks, geometry‑aware 3D image synthesis, dynamic discriminators for GANs, and uncertainty‑aware hierarchical refinement for incremental classification, highlighting the institute’s cutting‑edge AI research.

AI researchComputer VisionDeep Learning

0 likes · 8 min read

Ant Group’s Research Institute Publishes Four NeurIPS 2022 Papers on Advanced Computer Vision and AI

DataFunTalk

Sep 25, 2022 · Artificial Intelligence

Personalized News Recommendation System Based on Knowledge Graphs

This talk presents a personalized news recommendation system that leverages knowledge graphs to enhance recommendation accuracy, explainability, and user interest modeling, detailing background, graph construction methods, multi‑task deep learning architecture, experimental results, and future research directions.

Deep LearningGraph ConstructionKnowledge Graph

0 likes · 22 min read

Personalized News Recommendation System Based on Knowledge Graphs

Zhengtong Technical Team

Sep 22, 2022 · Artificial Intelligence

How YOLOv5 Powers Real‑Time City Management Video Analysis

This article explains the background, workflow, and technical details of using the YOLOv5 one‑stage object detection algorithm to enable fast, accurate video analytics for urban management, covering data augmentation, backbone design, FPN‑PAN neck, and prediction output processing.

AIComputer VisionDeep Learning

0 likes · 8 min read

How YOLOv5 Powers Real‑Time City Management Video Analysis

HomeTech

Sep 20, 2022 · Artificial Intelligence

Deep Learning for Image Classification: Classic Networks, Attention Mechanisms, and Their Application to Fine‑Grained Classification and Automotive Series Recognition

This article reviews the evolution of deep‑learning image‑classification networks, surveys attention mechanisms for fine‑grained tasks, describes the CVPR 2022 FGVC9 competition solution using RegNetY and random attention cropping, and discusses its deployment in automotive series recognition along with future challenges.

CVPRComputer VisionDeep Learning

0 likes · 19 min read

Deep Learning for Image Classification: Classic Networks, Attention Mechanisms, and Their Application to Fine‑Grained Classification and Automotive Series Recognition

Java Architect Essentials

Sep 18, 2022 · Industry Insights

Why AI Porn Detection Still Struggles: Key Challenges and the Need for Human Moderators

AI-powered porn detection leverages deep neural networks to classify images, but faces serious hurdles such as visual similarity with benign content, subjective standards of obscenity, and vulnerabilities stemming from training data, making human moderators indispensable for reliable content safety.

AI moderationContent SafetyDeep Learning

0 likes · 3 min read

Why AI Porn Detection Still Struggles: Key Challenges and the Need for Human Moderators

Programmer DD

Sep 13, 2022 · Artificial Intelligence

Why AI Porn Detection Still Struggles: Key Challenges Explained

AI-based porn detection uses deep neural networks to classify images, but faces tough hurdles such as visual similarity with benign content, subjective standards for nudity, and vulnerabilities from training‑data dependence, meaning human moderators remain essential for reliable safety.

AI moderationComputer VisionContent Safety

0 likes · 3 min read

Why AI Porn Detection Still Struggles: Key Challenges Explained

DataFunTalk

Sep 13, 2022 · Artificial Intelligence

Intelligent Question Answering in QQ Browser Search: Background, Key Technologies, and Frontier Research

This article presents an in‑depth overview of intelligent question answering in QQ Browser search, covering its background, the core KBQA and DeepQA technologies, system architecture, challenges, recent advances such as end‑to‑end, knowledge‑guided and multimodal QA, and practical Q&A for deployment.

AIDeep LearningKnowledge Graph

0 likes · 22 min read

Intelligent Question Answering in QQ Browser Search: Background, Key Technologies, and Frontier Research

Huolala Tech

Sep 10, 2022 · Artificial Intelligence

How AI Transforms Freight Safety: Real‑Time Risk Detection and Intervention

This article explains how AI technologies enable end‑to‑end freight safety monitoring, from pre‑trip and in‑trip risk identification to targeted interventions and governance, addressing challenges such as long‑tail data, small‑sample learning, fine‑grained classification, and multi‑level filtering.

AIDeep LearningLogistics

0 likes · 12 min read

How AI Transforms Freight Safety: Real‑Time Risk Detection and Intervention

DataFunSummit

Sep 9, 2022 · Artificial Intelligence

Wuliang: Tencent's Deep Learning Framework for Real‑Time Large‑Scale Recommendation

The presentation by Tencent expert Yuan Yi details the Wuliang deep learning system for recommendation, covering its background, technical challenges such as massive data and real‑time requirements, the parameter‑server based solutions for training and inference, model compression techniques, and continuous online deployment strategies.

Deep LearningLarge-Scale TrainingParameter Server

0 likes · 14 min read

Wuliang: Tencent's Deep Learning Framework for Real‑Time Large‑Scale Recommendation

Bilibili Tech

Sep 9, 2022 · Artificial Intelligence

Visual Lossless Deep Learning Pre‑processing for Video Transcoding Using DCT‑Based Low‑Rank Loss and a Lightweight Model

A visual‑lossless deep‑learning pre‑processor that employs a DCT‑based low‑rank loss and an ultra‑lightweight CPU‑friendly model achieves up to 20% bitrate reduction for 1080p videos while preserving high‑frequency details, enabling real‑time transcoding and bandwidth savings for popular content on Bilibili.

AIDCTDeep Learning

0 likes · 11 min read

Visual Lossless Deep Learning Pre‑processing for Video Transcoding Using DCT‑Based Low‑Rank Loss and a Lightweight Model

DataFunSummit

Sep 4, 2022 · Artificial Intelligence

Sparse Features in Machine Learning: Challenges, NVIDIA Ampere Structured Sparsity, Knowledge Distillation, and GAN Model Compression

This talk explores the challenges and opportunities of leveraging sparsity in machine learning models, covering fine‑grained and coarse‑grained sparsity, NVIDIA Ampere’s 2:4 structured sparsity, knowledge‑distillation techniques for converting unstructured to structured sparsity, and model compression strategies for generative adversarial networks.

Deep LearningGANGPU Acceleration

0 likes · 14 min read

Sparse Features in Machine Learning: Challenges, NVIDIA Ampere Structured Sparsity, Knowledge Distillation, and GAN Model Compression

Alibaba Cloud Big Data AI Platform

Sep 1, 2022 · Artificial Intelligence

How Uni‑Fold + Alibaba PAI Boost Protein Structure Prediction to 6.6k Amino Acids

DeepMind’s AlphaFold inspired Uni‑Fold, now accelerated with Alibaba Cloud’s PAI platform, can predict protein structures up to 6.6k amino acids—covering 99.992% of known sequences—delivering ten‑minute inference for SARS‑CoV‑2 spike trimers and setting new performance benchmarks for AI‑driven structural biology.

AI accelerationAlibaba PAIDeep Learning

0 likes · 7 min read

How Uni‑Fold + Alibaba PAI Boost Protein Structure Prediction to 6.6k Amino Acids

ByteDance Terminal Technology

Sep 1, 2022 · Artificial Intelligence

Hybrid Computer Vision and Deep Learning for Automated UI Background Color Extraction and Assertion

This article presents a hybrid pipeline combining traditional computer vision techniques and deep learning models to automatically extract and verify text background colors in UI automation screenshots, effectively addressing challenges like limited training data and complex borders to significantly reduce manual inspection costs while achieving high accuracy and robustness in production environments.

Automated TestingComputer VisionDeep Learning

0 likes · 10 min read

Hybrid Computer Vision and Deep Learning for Automated UI Background Color Extraction and Assertion

iQIYI Technical Product Team

Aug 26, 2022 · Industry Insights

How iQ Dubbing Cuts Film Dubbing Time by Over 3× Using Deep‑Learning AI

iQ Dubbing, iQIYI’s AI‑driven dubbing platform, leverages deep‑learning models to automate voice‑over, international voice extraction, and sound‑effect creation, reducing dubbing turnaround from months to under a week and boosting efficiency more than threefold, as recognized by the 2022 ChinaMM Innovation Product award.

AI dubbingDeep Learningindustry insight

0 likes · 4 min read

How iQ Dubbing Cuts Film Dubbing Time by Over 3× Using Deep‑Learning AI

DataFunSummit

Aug 23, 2022 · Artificial Intelligence

Graph Deep Learning for Content Risk Control and APT Detection

This article presents a comprehensive overview of Tencent AI Lab's graph‑based approaches for detecting misinformation and advanced persistent threats, detailing the challenges of modeling news content and social context, the design of the Post‑User Interaction Network (PSIN), experimental results on large multi‑topic datasets, and a novel graph‑pretraining pipeline for APT detection.

APT detectionDeep LearningSocial Network Analysis

0 likes · 12 min read

Graph Deep Learning for Content Risk Control and APT Detection

Tencent Cloud Developer

Aug 23, 2022 · Artificial Intelligence

Brain-Computer Interface Competition Showcases AI-Powered Mind-Controlled Technology

The Tencent Cloud‑backed “Tencent Cloud Cup” BCI competition, part of the World Robot Contest, drew over 250 teams from 26 provinces and three countries to tackle brain‑computer tasks like spelling and emotion detection, demonstrating typing, wheelchair and robotic arm applications, with the winning Chinese university team typing 81 characters in 285 seconds and results set for 5G deployment and publication in Brain Science Advances.

AI competitionDeep LearningHuman-Computer Interaction

0 likes · 8 min read

Brain-Computer Interface Competition Showcases AI-Powered Mind-Controlled Technology

DaTaobao Tech

Aug 19, 2022 · Artificial Intelligence

SepLUT: Separable Lookup Tables for Real-time Image Enhancement

SepLUT, a new separable lookup‑table framework, splits color enhancement into a 1‑D LUT for independent adjustments and a 3‑D LUT for correlated changes, predicted by a lightweight CNN, enabling quantizable, real‑time ISP performance with state‑of‑the‑art results on the FiveK benchmark.

Computer VisionDeep LearningReal-Time

0 likes · 12 min read

SepLUT: Separable Lookup Tables for Real-time Image Enhancement

Hulu Beijing

Aug 19, 2022 · Artificial Intelligence

Disney’s M5 Model: Multi‑Modal, Multi‑Interest, Multi‑Scenario Boost for Streaming Recommendations

Disney’s Content Discovery team introduces M5, a multi‑modal, multi‑interest, multi‑scenario recall model that enhances VOD and live streaming recommendations by leveraging rich metadata, user behavior, and contextual features, outperforming baseline methods with significant hit‑ratio gains across Hulu and Disney+.

Deep LearningM5 modelRecommendation Systems

0 likes · 22 min read

Disney’s M5 Model: Multi‑Modal, Multi‑Interest, Multi‑Scenario Boost for Streaming Recommendations

FunTester

Aug 18, 2022 · Artificial Intelligence

How AI Can Automate UI Testing: Building Image‑Based Anomaly Detection

This article examines the evolution of mobile UI testing toward AI‑driven approaches, outlines the challenges of large‑scale apps, and details a practical workflow for constructing image‑based anomaly datasets, training a ResNet‑18 model, and iterating on detection performance.

AI testingComputer VisionDeep Learning

0 likes · 13 min read

How AI Can Automate UI Testing: Building Image‑Based Anomaly Detection

Model Perspective

Aug 15, 2022 · Artificial Intelligence

Understanding Recurrent Neural Networks: From Vanilla RNN to LSTM with Keras

This article introduces recurrent neural networks (RNNs) and their ability to handle sequential data, explains the limitations of vanilla RNNs, presents the LSTM architecture with its gates, and provides complete Keras code for data loading, model building, and training both vanilla RNN and LSTM models.

Deep LearningKerasLSTM

0 likes · 5 min read

Understanding Recurrent Neural Networks: From Vanilla RNN to LSTM with Keras

JD Cloud Developers

Aug 15, 2022 · Artificial Intelligence

How FCA Doubles BERT’s Inference Speed with Less Than 1% Accuracy Loss

This article explains how the Fine‑ and Coarse‑Granularity Hybrid Self‑Attention (FCA) mechanism reduces BERT’s computational cost by over 50% while keeping accuracy loss under 1%, detailing the method, experimental results, and its significance for efficient large‑scale language models.

BERTDeep LearningFCA

0 likes · 8 min read

How FCA Doubles BERT’s Inference Speed with Less Than 1% Accuracy Loss

Model Perspective

Aug 6, 2022 · Artificial Intelligence

Understanding Activation Functions in Artificial Neural Networks

This article introduces artificial neural networks, explains the role of artificial neurons and their weighted connections, and provides an overview of common activation functions—including linear, nonlinear ramp, threshold/step, and sigmoid forms—highlighting their characteristics and typical saturation values.

Deep Learningactivation functionartificial neural network

0 likes · 2 min read

Understanding Activation Functions in Artificial Neural Networks

Model Perspective

Aug 3, 2022 · Artificial Intelligence

Explore the Most Popular Machine Learning Algorithms: A Comprehensive Guide

This article provides a thorough overview of the most widely used machine learning algorithms, classifying them by learning style and problem type, and highlighting popular methods such as supervised, unsupervised, semi‑supervised, regression, instance‑based, regularization, decision‑tree, Bayesian, clustering, association rule, neural network, deep learning, dimensionality‑reduction, and ensemble techniques.

AlgorithmsDeep LearningUnsupervised Learning

0 likes · 10 min read

Explore the Most Popular Machine Learning Algorithms: A Comprehensive Guide

Huawei Cloud Developer Alliance

Aug 3, 2022 · Artificial Intelligence

Unlock AI-Powered Makeup Transfer: Copy Any Look to Your Photo in Three Steps

This article introduces an AI-powered makeup transfer technique that copies makeup from any reference image onto a user's photo, explains its advantages over traditional sticker-based apps, outlines a three‑step workflow, and provides sample Python code runnable on Huawei Cloud ModelArts.

AI makeup transferDeep LearningHuawei Cloud

0 likes · 4 min read

Unlock AI-Powered Makeup Transfer: Copy Any Look to Your Photo in Three Steps

Zuoyebang Tech Team

Jul 29, 2022 · Artificial Intelligence

Boosting Chinese‑English Code‑Switching Speech Recognition with Language ID and LM Enhancements

This report details a series of experiments on Chinese‑English mixed‑language speech recognition, introducing language‑identification loss and language‑model integration to improve acoustic modeling, reduce mixed error rates, and achieve significant gains over a baseline end‑to‑end ASR system.

Code-SwitchingDeep Learninglanguage identification

0 likes · 16 min read

Boosting Chinese‑English Code‑Switching Speech Recognition with Language ID and LM Enhancements

GuanYuan Data Tech Team

Jul 28, 2022 · Artificial Intelligence

Unlocking Reinforcement Learning: Core Concepts, Algorithms, and Real‑World Applications

This article introduces reinforcement learning by defining agents, environments, rewards, and policies, explains key concepts such as Markov Decision Processes and Bellman equations, and surveys major algorithms—including dynamic programming, Monte‑Carlo, TD learning, policy gradients, Q‑learning, DQN, and evolution strategies—while highlighting practical challenges and notable case studies like AlphaGo Zero.

Deep LearningEvolution StrategiesMDP

0 likes · 27 min read

Unlocking Reinforcement Learning: Core Concepts, Algorithms, and Real‑World Applications

ITPUB

Jul 21, 2022 · Artificial Intelligence

From Blur to Brilliance: How AI‑Powered Image Quality Assessment Transformed 58.com’s Recruitment Images

This article reviews image quality assessment fundamentals, modern CNN‑based IQA models, and their deployment at 58.com to automatically score, filter, and rank millions of recruitment photos, achieving a drop in low‑quality images from 9% to zero while boosting overall accuracy to 94.7%.

Business ApplicationCNNComputer Vision

0 likes · 19 min read

From Blur to Brilliance: How AI‑Powered Image Quality Assessment Transformed 58.com’s Recruitment Images

JD Tech

Jul 21, 2022 · Artificial Intelligence

Improving JD Retail Recommendation Advertising Ranking with Variational Feature Learning, User Interest Network Optimization, and Global Collaborative Modeling

This article presents JD's comprehensive technical solution for boosting recommendation ad ranking by addressing cold‑start, shallow user interest extraction, and insufficient global data through a variational feature learning framework, enhanced user‑interest networks, and full‑domain collaborative modeling, achieving over 1% AUC gain and notable revenue growth.

CTR predictionDeep Learninge‑commerce

0 likes · 21 min read

Improving JD Retail Recommendation Advertising Ranking with Variational Feature Learning, User Interest Network Optimization, and Global Collaborative Modeling

Huawei Cloud Developer Alliance

Jul 20, 2022 · Artificial Intelligence

AI Boosts Ship‑Sea Target Detection: Lessons from the First Innovation Competition

The inaugural Ship‑Sea Data Intelligent Application Innovation Competition, co‑hosted by Taihu Laboratory, Huawei and Wuxi authorities, showcased cutting‑edge AI techniques—such as multi‑scale training, TTA, knowledge distillation, and model pruning—to improve surface and underwater target detection for vessels, nets, buoys, and marine life, while offering transparent rankings, research funding, and a platform for advancing maritime AI.

AIDeep LearningMaritime

0 likes · 7 min read

AI Boosts Ship‑Sea Target Detection: Lessons from the First Innovation Competition

NetEase Smart Enterprise Tech+

Jul 19, 2022 · Artificial Intelligence

How NER Dominated NLPCC 2022: Techniques Behind the Winning Model

This article reviews the recent NLPCC 2022 NER competition, explains the evolution of named entity recognition, details the five major modeling paradigms, and describes the winning team’s relation‑classification approach, data‑augmentation strategy, experimental results, and its practical deployment in NetEase Cloud Commerce services.

Deep LearningNLPRelation Classification

0 likes · 13 min read

How NER Dominated NLPCC 2022: Techniques Behind the Winning Model

JD Tech

Jul 18, 2022 · Artificial Intelligence

AI-Powered Visual Defect Detection for Mobile App UI Testing: Methodology, Data Construction, Model Training, and Evaluation

This article presents an end‑to‑end AI‑driven visual testing solution for mobile applications, detailing the business pain points, data set construction, CNN‑based model design, training procedures, performance evaluation with ROC and confusion matrices, and future directions for improving defect detection accuracy.

Computer VisionDeep LearningImage Classification

0 likes · 14 min read

AI-Powered Visual Defect Detection for Mobile App UI Testing: Methodology, Data Construction, Model Training, and Evaluation

DaTaobao Tech

Jul 13, 2022 · Artificial Intelligence

MNN 2.0: A Unified Edge‑Cloud Deep Learning Framework Overview

MNN 2.0 transforms Alibaba’s lightweight deep‑learning engine into a unified edge‑cloud framework, delivering ultra‑small binaries, broad model‑format support, and aggressive CPU/GPU/DSP/NPU optimizations—including SIMD, Winograd, quantization, and sparse computation—while providing Python‑style APIs for preprocessing, inference, and on‑device training.

Deep LearningEdge ComputingMNN

0 likes · 18 min read

MNN 2.0: A Unified Edge‑Cloud Deep Learning Framework Overview

Alibaba Cloud Big Data AI Platform

Jul 12, 2022 · Artificial Intelligence

How Whale Enables Efficient Giant Model Training on Heterogeneous GPUs

The article introduces Whale, an open‑source distributed training framework that unifies multiple parallelism strategies, uses hardware‑aware load balancing to accelerate giant models like BERT‑Large and the trillion‑parameter M6 on heterogeneous GPU clusters, and details its architecture, planning, and real‑world performance gains.

Deep LearningParallelismhardware-aware scheduling

0 likes · 11 min read

How Whale Enables Efficient Giant Model Training on Heterogeneous GPUs

DataFunTalk

Jul 9, 2022 · Artificial Intelligence

User Behavior Sequence Based Transaction Anti‑Fraud Detection

This presentation explains how leveraging user behavior sequences with supervised and unsupervised deep learning models, including end‑to‑end and two‑stage architectures, improves transaction fraud detection by identifying distinct patterns of account takeover and stolen‑card activities and outlines the engineering deployment pipeline.

Deep LearningEmbeddingfraud detection

0 likes · 12 min read

User Behavior Sequence Based Transaction Anti‑Fraud Detection

Meituan Technology Team

Jul 6, 2022 · Artificial Intelligence

Engineering Practices for Large-Scale Deep Learning Models in Meituan Takeaway Advertising

The article details Meituan's engineering journey from small DNNs to hundred‑gigabyte deep learning models for food‑delivery ads, analyzing online latency and offline efficiency challenges and presenting distributed storage, CPU/GPU acceleration, OpenVINO, TensorRT, CodeGen, and data‑pipeline optimizations that dramatically improve throughput, memory usage, and sample‑building speed.

CPU accelerationDeep LearningGPU Acceleration

0 likes · 45 min read

Engineering Practices for Large-Scale Deep Learning Models in Meituan Takeaway Advertising

Alimama Tech

Jul 6, 2022 · Artificial Intelligence

How Mixed‑Curvature Graph Embeddings Boost E‑commerce Ad Retrieval

This article presents AMCAD, an adaptive mixed‑curvature graph embedding system that models heterogeneous e‑commerce search ad graphs in non‑Euclidean spaces, detailing its sample construction, three‑stage model architecture, offline and online experiments, and demonstrating significant performance gains over Euclidean baselines.

Deep Learningadvertisement retrievale‑commerce

0 likes · 13 min read

How Mixed‑Curvature Graph Embeddings Boost E‑commerce Ad Retrieval

Bilibili Tech

Jul 1, 2022 · Artificial Intelligence

Quality‑Controlled Scene‑Adaptive Video Transcoding System at Bilibili

Bilibili’s quality‑controlled scene‑adaptive transcoding system automatically splits videos into shot‑level segments, predicts optimal encoding parameters with a deep‑learning model, applies two‑pass VMAF‑targeted encoding and ROI‑aware bitrate allocation, achieving stable visual quality, 99% accuracy, and roughly 15% bitrate reduction.

Deep LearningROI encodingVMAF

0 likes · 25 min read

Quality‑Controlled Scene‑Adaptive Video Transcoding System at Bilibili

360 Quality & Efficiency

Jul 1, 2022 · Artificial Intelligence

Building an End-to-End Image Search System with Milvus and VGG

This article presents a complete image‑search solution that extracts visual features with the VGG16 model, stores them in the Milvus vector database, and provides a set of web APIs for training, querying, counting, searching, and deleting image vectors, all deployed via Docker containers.

AIDeep LearningMilvus

0 likes · 7 min read

Building an End-to-End Image Search System with Milvus and VGG

AntTech

Jun 24, 2022 · Artificial Intelligence

Hierarchical Residual Network for Multi‑Granularity Classification (HRN) – CVPR 2022 Paper Overview

This article presents a CVPR 2022 paper by Zhejiang University and Ant Group that introduces a label‑relation‑tree‑based Hierarchical Residual Network (HRN) for improving multi‑granularity image classification, detailing its motivation, architecture, composite loss design, extensive experiments on fine‑grained datasets, and practical impact on content‑security applications.

CVPR2022Computer VisionDeep Learning

0 likes · 12 min read

Hierarchical Residual Network for Multi‑Granularity Classification (HRN) – CVPR 2022 Paper Overview