Tagged articles
74 articles
Page 1 of 1
Machine Heart
Machine Heart
May 17, 2026 · Artificial Intelligence

How CASCADE Enables LLM Agents to Learn from Experience During Live Deployment

The paper introduces CASCADE, a deployment‑time learning framework that lets LLM agents continuously select and reuse past cases via a contextual‑bandit approach, achieving higher long‑term success rates across diverse online tasks without updating the base model.

CASCADECase-Based ReasoningContextual Bandit
0 likes · 10 min read
How CASCADE Enables LLM Agents to Learn from Experience During Live Deployment
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 2, 2026 · Artificial Intelligence

Real-World Large-Scale Test Shows Robots Learning While Deploying Outperform Baselines on Eight Tasks

The article presents the LWD (Learning While Deploying) framework, detailing its reinforcement‑learning‑driven data flywheel, the DIVL value‑evaluation and QAM policy‑optimization modules, and experimental results where a dual‑arm robot improves success rates by up to 17% and reduces cycle time by 23.75 seconds across eight real‑world tasks, surpassing strong baselines.

DIVLData FlywheelLWD
0 likes · 12 min read
Real-World Large-Scale Test Shows Robots Learning While Deploying Outperform Baselines on Eight Tasks
Machine Heart
Machine Heart
Apr 22, 2026 · Artificial Intelligence

Can LLMs Boost Reasoning Alone? Introducing SePT’s Simple Online Self‑Training

SePT (Self‑evolving Post‑Training) shows that a large language model can improve its mathematical reasoning ability by about ten percentage points using a reward‑free online self‑training loop that decouples generation temperature from standard SFT, matching or surpassing RL‑based methods without harming general performance.

LLMMathematical ReasoningOnline Learning
0 likes · 9 min read
Can LLMs Boost Reasoning Alone? Introducing SePT’s Simple Online Self‑Training
PaperAgent
PaperAgent
Apr 6, 2026 · Artificial Intelligence

Can LLMs Self‑Improve After Deployment? Inside Microsoft’s Online Experiential Learning

Microsoft’s Online Experiential Learning framework lets large language models continuously self‑evolve after deployment by extracting experience from user interactions and consolidating it into model parameters, eliminating the need for human labels, reward models, or server‑side environment access, and demonstrating scalable gains across tasks and model sizes.

AI researchLLMOnline Learning
0 likes · 9 min read
Can LLMs Self‑Improve After Deployment? Inside Microsoft’s Online Experiential Learning
Machine Heart
Machine Heart
Apr 2, 2026 · Artificial Intelligence

From Tokens to Revenue: Kuaishou’s GR4AD Pioneers Full‑Stack Generative Recommendation for Ads

GR4AD, Kuaishou’s generative recommendation system, redesigns the entire ad pipeline—from tokenizing multimodal ad material to value‑aware learning, lazy decoding, and dynamic beam search—delivering over 4 % revenue lift, higher eCPM, and sub‑100 ms latency for more than 400 million users.

AdvertisingGenerative RecommendationOnline Learning
0 likes · 17 min read
From Tokens to Revenue: Kuaishou’s GR4AD Pioneers Full‑Stack Generative Recommendation for Ads
DataFunSummit
DataFunSummit
Mar 23, 2026 · Artificial Intelligence

How to Build Long‑Term Memory for AI Agents: Foundations and Practical Techniques

This article explores the challenges and state of long‑term memory for AI agents, reviews mainstream industry solutions such as RAG, HRM, Titans and Engram, and proposes a four‑layer memory architecture with data acquisition, organization, utilization, and feedback loops to enable agents that remember and forget like humans.

AI memoryAgent ArchitectureLong‑Term Memory
0 likes · 12 min read
How to Build Long‑Term Memory for AI Agents: Foundations and Practical Techniques
Tencent Advertising Technology
Tencent Advertising Technology
Jan 22, 2026 · Artificial Intelligence

How Tencent’s Bidding Algorithms Evolved from GMPC to GRB: A Deep Dive into Generative RL for Ads

The article reviews the 2025 evolution of Tencent advertising’s bidding system—from the second‑generation GMPC control algorithm through the third‑generation MRB reinforcement‑learning model to the fourth‑generation generative RL GRB—detailing architectural upgrades, multi‑channel modeling, training pipelines, and experimental gains, and outlines the 2026 AI‑agent roadmap.

AdvertisingGenerative ModelsOnline Learning
0 likes · 15 min read
How Tencent’s Bidding Algorithms Evolved from GMPC to GRB: A Deep Dive into Generative RL for Ads
PaperAgent
PaperAgent
Jan 8, 2026 · Artificial Intelligence

How SOP Enables Scalable Online Post-Training for Real‑World Robots

The SOP (Scalable Online Post‑training) framework redesigns VLA post‑training from offline, single‑machine, sequential processing to a distributed, parallel online system, allowing robot fleets to continuously learn, share experiences, and scale intelligence while maintaining stability and generalization in complex real‑world environments.

Distributed TrainingOnline LearningRobotics
0 likes · 11 min read
How SOP Enables Scalable Online Post-Training for Real‑World Robots
DataFunSummit
DataFunSummit
Oct 9, 2025 · Artificial Intelligence

Why AI Coding Agents Still Struggle: Context Limits, Knowledge Gaps, and the Road to Human‑Like Assistants

This talk examines the core challenges facing AI coding agents—limited context windows, knowledge accumulation, and software‑engineering complexity—while outlining practical solutions such as context providing, RAG, fine‑tuning, online learning, feedback loops, and multi‑agent collaboration to move toward truly human‑like, continuously learning coding assistants.

AI CodingCoding AgentFeedback Loop
0 likes · 24 min read
Why AI Coding Agents Still Struggle: Context Limits, Knowledge Gaps, and the Road to Human‑Like Assistants
DataFunSummit
DataFunSummit
Oct 8, 2025 · Artificial Intelligence

How EasyRec Boosts Recommendation Training and Inference Performance

This article explains the EasyRec recommendation system’s training and inference architecture, detailing optimization techniques such as embedding parallelism, CPU/GPU placement, XLA and TRT fusion, online learning pipelines, network compression, and real‑world deployment results that dramatically improve throughput and latency.

AI InfrastructureEasyRecInference Optimization
0 likes · 15 min read
How EasyRec Boosts Recommendation Training and Inference Performance
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 6, 2025 · Artificial Intelligence

Time Series Paper Digest (Aug 23–Sep 5 2025)

It presents concise summaries of six recent arXiv papers on unsupervised domain adaptation, efficient forecasting, SHAP explanations, text‑reinforced multimodal forecasting, online prediction with feature adjustment, zero‑shot forecasting zoo, and a new anomaly‑detection metric, highlighting methods, datasets, and results.

Multimodal LearningOnline LearningSHAP
0 likes · 16 min read
Time Series Paper Digest (Aug 23–Sep 5 2025)
Data Party THU
Data Party THU
Aug 13, 2025 · Artificial Intelligence

How Dual Adaptivity Powers Universal Algorithms to Minimize Adaptive Regret

This article reviews the recent work by Zhou Zhihua’s team at Nanjing University on dual‑adaptivity universal algorithms for online convex optimization, introducing a meta‑expert framework, the UMA2 and UMA3 methods, and extending them to online composite optimization with strong adaptive‑regret guarantees.

Online Learningadaptive regretconvex optimization
0 likes · 10 min read
How Dual Adaptivity Powers Universal Algorithms to Minimize Adaptive Regret
DataFunSummit
DataFunSummit
Nov 22, 2024 · Artificial Intelligence

EasyRec Recommendation Algorithm Training and Inference Optimization

This article presents a comprehensive overview of EasyRec’s recommendation system architecture, detailing training and inference optimizations, embedding parallelism, CPU/GPU placement strategies, online learning pipelines, and network compression techniques that together improve scalability, latency, and cost efficiency.

Distributed SystemsEasyRecInference Optimization
0 likes · 15 min read
EasyRec Recommendation Algorithm Training and Inference Optimization
DataFunTalk
DataFunTalk
Oct 11, 2024 · Artificial Intelligence

E‑commerce Innovation and Data Governance: Summaries of Recent Research Topics

This article compiles concise overviews of recent e‑commerce research, covering real‑time online learning re‑ranking models, causal inference for user growth, full‑link data lineage, TikTok's data governance and attribution solutions, Volcano Engine's metric management, AI Agent applications on 1688, and XinXuan Group's live‑stream data architecture.

AIData GovernanceData Lineage
0 likes · 5 min read
E‑commerce Innovation and Data Governance: Summaries of Recent Research Topics
iQIYI Technical Product Team
iQIYI Technical Product Team
Oct 10, 2024 · Artificial Intelligence

Online Deep Learning (ODL) for Real‑Time Advertising Effectiveness: Challenges and Solutions

iQIYI’s minute‑level online deep‑learning framework overcomes stability, timeliness, compatibility, delayed feedback, catastrophic forgetting, and i.i.d. constraints through high‑availability pipelines, TensorFlow Example serialization, rapid P2P model distribution, flexible scheduling, disaster‑recovery rollbacks, PU‑loss adjustment, and knowledge‑distillation, delivering a 6.2% revenue boost.

AdvertisingCTR predictionDeep Learning
0 likes · 9 min read
Online Deep Learning (ODL) for Real‑Time Advertising Effectiveness: Challenges and Solutions
NewBeeNLP
NewBeeNLP
Sep 9, 2024 · Artificial Intelligence

Can Real‑Time Learning at Serving Time Transform Recommendation Re‑ranking?

This article introduces LAST, a novel online learning approach that updates recommendation models instantly at serving time, addressing real‑time learning challenges, re‑ranking complexities, and demonstrating superior offline and online performance in industrial e‑commerce scenarios.

AILASTOnline Learning
0 likes · 12 min read
Can Real‑Time Learning at Serving Time Transform Recommendation Re‑ranking?
Sohu Tech Products
Sohu Tech Products
Aug 28, 2024 · Artificial Intelligence

EasyRec Recommendation Algorithm Training and Inference Optimization

EasyRec, Alibaba Cloud’s modular recommendation framework, unifies configurable data, embedding, dense, and output layers on MaxCompute, EMR, and DLC, and speeds training with deduplication, EmbeddingParallel sharding, lock‑free hash tables, GPU embeddings, and AMX BF16, while inference benefits from operator fusion, low‑precision AVX/AMX kernels, compact caches, batch merging, and network compression, enabling real‑time online learning and delivering higher recommendation quality at lower cost in e‑commerce.

Alibaba CloudEasyRecInference Optimization
0 likes · 14 min read
EasyRec Recommendation Algorithm Training and Inference Optimization
DataFunTalk
DataFunTalk
Aug 25, 2024 · Artificial Intelligence

Learning at Serving Time (LAST): An Online Learning Approach for Real‑Time Re‑ranking in Recommendation Systems

This article introduces LAST, a novel online learning method that updates ranking models instantly at serving time without waiting for user feedback, addressing the latency and stability challenges of real‑time re‑ranking in industrial recommendation pipelines and demonstrating its superiority through offline and online experiments.

Online LearningReal-Timemachine learning
0 likes · 11 min read
Learning at Serving Time (LAST): An Online Learning Approach for Real‑Time Re‑ranking in Recommendation Systems
JD Tech
JD Tech
Jun 23, 2024 · Artificial Intelligence

Applying Large Models to Recommendation Systems: Strategies, Challenges, and E‑commerce Case Study

This article examines how large pre‑trained models such as GPT‑4 and BERT are integrated into modern recommendation systems, detailing their advantages, implementation strategies, real‑world e‑commerce case studies, and the technical and privacy challenges that must be addressed for effective deployment.

Artificial IntelligenceOnline Learninglarge models
0 likes · 14 min read
Applying Large Models to Recommendation Systems: Strategies, Challenges, and E‑commerce Case Study
Alipay Experience Technology
Alipay Experience Technology
May 9, 2024 · Artificial Intelligence

How Alipay Boosted Ad CTR and CPM with Cold‑Start Fixes, Knowledge Transfer, and Real‑Time Learning

This article details Alipay's advertising algorithm upgrades—including sample‑enhanced cold‑start mitigation, cross‑scene and user‑segmented knowledge transfer, and real‑time feature and online‑learning optimizations—that collectively lifted CTR, CPM, and overall business revenue.

AdvertisingCTR optimizationKnowledge Transfer
0 likes · 18 min read
How Alipay Boosted Ad CTR and CPM with Cold‑Start Fixes, Knowledge Transfer, and Real‑Time Learning
DataFunSummit
DataFunSummit
Nov 27, 2023 · Artificial Intelligence

Online Learning with Alink Model Flow: From Fundamentals to Model Flow 1.0 and 2.0

This article introduces Alibaba's Alink platform and its online learning capabilities, discusses common challenges in machine‑learning pipelines, explains Alink’s algorithm‑to‑application connection, various computation modes, usage methods, and details the evolution from Model Flow 1.0 to the more versatile Model Flow 2.0, including pipeline integration, incremental training, and embedding prediction services.

AlinkFlinkOnline Learning
0 likes · 9 min read
Online Learning with Alink Model Flow: From Fundamentals to Model Flow 1.0 and 2.0
DataFunSummit
DataFunSummit
Nov 21, 2023 · Artificial Intelligence

Automatic Hyperparameter Tuning in Tencent Recommendation System (TRS): Techniques, Evolution, and Practice

This article presents an in‑depth overview of Tencent's TRS automatic hyperparameter tuning, covering background, challenges, the evolution from Bayesian optimization to evolution strategies and reinforcement learning, a systematic platform solution, real‑world deployment results, and a Q&A session.

Bayesian OptimizationEvolution StrategiesOnline Learning
0 likes · 20 min read
Automatic Hyperparameter Tuning in Tencent Recommendation System (TRS): Techniques, Evolution, and Practice
Alimama Tech
Alimama Tech
Sep 20, 2023 · Artificial Intelligence

Exploring Model Dynamics for Accumulative Poisoning Detection

The paper, a joint effort by Alibaba Mama and HKBU TMLR, shows that monitoring model dynamics—specifically a newly defined memorization‑discrepancy metric—can reveal hidden accumulative poisoning attacks in online advertising streams, and introduces a discrepancy‑aware correction algorithm that consistently outperforms existing defenses across benchmark datasets.

Machine Learning SecurityOnline Learningdefense algorithms
0 likes · 13 min read
Exploring Model Dynamics for Accumulative Poisoning Detection
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 22, 2022 · Artificial Intelligence

How DeepRec Supercharges Weibo’s Hot Recommendation Engine

This article explains the architecture of Weibo's popular recommendation system, the role of the weidl online learning platform, and how DeepRec’s performance optimizations—such as oneDNN operator acceleration, cost‑aware scheduling, and adaptive memory allocation—significantly improve training speed, inference latency, and overall service throughput.

AIDeepRecOnline Learning
0 likes · 15 min read
How DeepRec Supercharges Weibo’s Hot Recommendation Engine
DataFunTalk
DataFunTalk
Nov 15, 2022 · Artificial Intelligence

Flink ML: Iterative Execution Engine, Design, API, and Efficient Algorithm Library

This article introduces Flink ML, a DataStream‑based iterative engine and machine‑learning algorithm library, covering its overview, iterative execution engine design and API, performance comparisons with Spark ML, online logistic regression and K‑Means demos, and future development roadmap.

FlinkIterative EngineKMeans
0 likes · 22 min read
Flink ML: Iterative Execution Engine, Design, API, and Efficient Algorithm Library
Alimama Tech
Alimama Tech
Aug 24, 2022 · Artificial Intelligence

Adversarial Gradient Driven Exploration for Deep Click-Through Rate Prediction

The authors introduce AGE, an adversarial‑gradient‑driven exploration framework that injects uncertainty‑scaled perturbations into ad embeddings to approximate the downstream learning effect, combines Monte‑Carlo dropout uncertainty, a dynamic gating unit, and achieves up to 15 % offline gains and 6 % online CTR improvement over strong baselines.

Exploration-ExploitationOnline Learningadversarial gradient
0 likes · 14 min read
Adversarial Gradient Driven Exploration for Deep Click-Through Rate Prediction
Alimama Tech
Alimama Tech
Apr 27, 2022 · Artificial Intelligence

DEFUSE and Bi-DEFUSE: Unbiased Delayed‑Feedback Modeling for CVR Prediction

The paper introduces DEFUSE and its multi‑task extension Bi‑DEFUSE, unbiased delayed‑feedback CVR models that correct label bias via rigorous importance‑sampling and a latent fake‑negative variable, achieving superior offline performance and a 2 % CVR lift in online deployment compared with existing industry baselines.

Bi-DEFUSECVRDEFUSE
0 likes · 25 min read
DEFUSE and Bi-DEFUSE: Unbiased Delayed‑Feedback Modeling for CVR Prediction
Tencent Cloud Developer
Tencent Cloud Developer
Apr 7, 2022 · Artificial Intelligence

Re‑ranking in Recommendation Systems: Architecture, Techniques, and Efficiency

The article surveys the re‑ranking stage of modern recommendation pipelines, detailing its architecture after recall and precise ranking, and examining how shuffling and diversity improve user experience, while multi‑task fusion, context‑aware learning‑to‑rank, real‑time online learning, and traffic‑control strategies balance accuracy, efficiency, and business responsiveness.

DiversityOnline LearningReal-Time
0 likes · 15 min read
Re‑ranking in Recommendation Systems: Architecture, Techniques, and Efficiency
DaTaobao Tech
DaTaobao Tech
Mar 22, 2022 · Artificial Intelligence

Online Learning for Real‑Time Ranking in Alibaba's Home‑Decor Channel

The article details Alibaba’s end‑to‑end online‑learning pipeline for real‑time ranking in the Taobao home‑decor channel, covering UT log parsing, full‑feature extraction, ODL sample creation, xDeepCTR model training, and deployment, which yielded up to 7.8% CTR improvement and demonstrates the value of rapid model adaptation.

AlibabaModel TrainingOnline Learning
0 likes · 15 min read
Online Learning for Real‑Time Ranking in Alibaba's Home‑Decor Channel
IEG Growth Platform Technology Team
IEG Growth Platform Technology Team
Dec 20, 2021 · Artificial Intelligence

Comprehensive Guide to pCTR Modeling, Optimization, and Online Learning in Real‑Time Advertising Systems

This article presents a three‑part technical guide covering the fundamentals of computational advertising and real‑time bidding, detailed offline pCTR model training pipelines with feature engineering, calibration and model structure improvements, and advanced online learning techniques such as parameter freezing, sample replay and knowledge distillation, all aimed at boosting CTR performance and reducing bias in large‑scale ad platforms.

AdvertisingCTR predictionOnline Learning
0 likes · 37 min read
Comprehensive Guide to pCTR Modeling, Optimization, and Online Learning in Real‑Time Advertising Systems
DataFunTalk
DataFunTalk
Dec 17, 2021 · Artificial Intelligence

Applying Reinforcement Learning to Solve Cold‑Start Problems in 58.com Job Recruitment

This talk explains how 58.com’s massive blue‑collar recruitment platform uses reinforcement‑learning techniques—including multi‑armed bandits, contextual MAB, and linear UCB—to address cold‑start and interest‑divergence challenges, describes the system architecture, offline evaluation, online deployment, and reports an 8% uplift in new‑user conversion.

Online LearningReinforcement Learningcold start
0 likes · 26 min read
Applying Reinforcement Learning to Solve Cold‑Start Problems in 58.com Job Recruitment
DataFunTalk
DataFunTalk
Mar 1, 2021 · Artificial Intelligence

Online Learning and Real‑Time Model Updating in JD Retail Search Using Flink

The article describes JD's end‑to‑end online learning pipeline for retail search, covering the background, system architecture, real‑time feature collection, sample stitching, Flink‑based incremental training, parameter updates, and full‑link monitoring to achieve low‑latency, high‑accuracy model serving.

FlinkModel ServingOnline Learning
0 likes · 9 min read
Online Learning and Real‑Time Model Updating in JD Retail Search Using Flink
DataFunTalk
DataFunTalk
Feb 21, 2021 · Artificial Intelligence

Advances in Pre‑Ranking for Large‑Scale Advertising: The COLD Framework and Its Technical Evolution

This article reviews the development history, technical routes, and recent breakthroughs of pre‑ranking (coarse ranking) in large‑scale advertising systems, focusing on Alibaba's COLD (Computing‑power‑cost‑aware Online and Lightweight Deep) framework, its model design, engineering optimizations, experimental results, and future research directions.

AdvertisingCOLDOnline Learning
0 likes · 20 min read
Advances in Pre‑Ranking for Large‑Scale Advertising: The COLD Framework and Its Technical Evolution
DataFunTalk
DataFunTalk
Nov 19, 2020 · Artificial Intelligence

58 Tongzhen Home Feed Recommendation System: Architecture, Features, and Evolution

This talk details the design, data pipeline, feature engineering, model evolution, and operational insights of the 58 Tongzhen home feed recommendation system, covering its architecture, localization strategies, recall and ranking models, online learning, and future directions for AI-driven content delivery in the down‑market.

AIOnline Learningdown‑market
0 likes · 34 min read
58 Tongzhen Home Feed Recommendation System: Architecture, Features, and Evolution
DataFunSummit
DataFunSummit
Nov 8, 2020 · Artificial Intelligence

Architecture and Evolution of 58 Tongzhen Local Feed Recommendation System

This article details the design, data pipeline, feature engineering, model development, and iterative optimization of the 58 Tongzhen local feed recommendation system, covering business background, user profiling, recall strategies, ranking models such as XGBoost, XDeepFM, and online learning, and future directions.

AIOnline Learningfeature engineering
0 likes · 33 min read
Architecture and Evolution of 58 Tongzhen Local Feed Recommendation System
Tencent Cloud Developer
Tencent Cloud Developer
Sep 30, 2020 · Artificial Intelligence

Tencent Kankan Information Feed: Architecture, Challenges, and Optimizations

Peng Mo’s talk details Tencent Kankan’s billion‑user feed architecture—layered data, recall, ranking, and exposure control—while addressing real‑time feature generation, massive concurrency, memory‑intensive caching, and fast indexing, and explains solutions such as multi‑level caches, online minute‑level model updates, Redis bloom‑filter exposure filtering, a lock‑free hash‑plus‑linked‑list index, and distributed optimizations that halve latency to under 500 ms and support auto‑scaling and cold‑start handling.

Online Learninglarge-scale architecturereal-time features
0 likes · 15 min read
Tencent Kankan Information Feed: Architecture, Challenges, and Optimizations
DataFunTalk
DataFunTalk
Aug 20, 2020 · Artificial Intelligence

Weibo Recommendation Algorithm Practice and Machine Learning Platform Evolution

This article shares Weibo’s experience in building and evolving its recommendation algorithms, covering the recommendation scenario, machine learning workflow, feature engineering, model upgrades, large‑scale challenges, deployment via the Weiflow platform, and the capabilities of its machine‑learning infrastructure.

Online LearningWeibofeature engineering
0 likes · 14 min read
Weibo Recommendation Algorithm Practice and Machine Learning Platform Evolution
HomeTech
HomeTech
Mar 18, 2020 · Artificial Intelligence

Automobile Home Recommendation System Architecture and Ranking Models

This article presents a comprehensive overview of the Automobile Home recommendation system, detailing its objectives, architecture, various ranking models from LR to DeepFM, online learning mechanisms, service APIs, feature engineering pipelines, model training platforms, debugging tools, and future optimization directions.

AB testingAutoMLOnline Learning
0 likes · 18 min read
Automobile Home Recommendation System Architecture and Ranking Models
Qunar Tech Salon
Qunar Tech Salon
Mar 13, 2020 · Artificial Intelligence

The Evolution of AutoHome Recommendation System Ranking Algorithms

This article details the architecture, model evolution, feature processing, online learning, and future optimization plans of AutoHome's recommendation system, covering stages from resource collection to ranking, various models such as LR, XGBoost, FM, DeepFM, and operational practices like AB testing and debugging.

Online Learningfeature engineeringranking algorithm
0 likes · 18 min read
The Evolution of AutoHome Recommendation System Ranking Algorithms
DataFunTalk
DataFunTalk
Feb 28, 2020 · Artificial Intelligence

Evolution of Autohome's Recommendation System Ranking Algorithms

The article details the five‑year evolution of Autohome's recommendation system, covering its overall architecture, the progression of ranking models from LR to DeepFM and online learning, feature engineering pipelines, ranking service APIs, AB testing practices, and future optimization directions.

AB testingAIOnline Learning
0 likes · 20 min read
Evolution of Autohome's Recommendation System Ranking Algorithms
Qunar Tech Salon
Qunar Tech Salon
Feb 27, 2020 · Artificial Intelligence

iQIYI Dual‑DNN Ranking Model with Online Knowledge Distillation

This article describes iQIYI’s dual‑DNN ranking architecture that combines a high‑capacity teacher network with a lightweight student network via online knowledge distillation, addressing the trade‑off between model effectiveness and inference efficiency in large‑scale recommendation systems.

CTR predictionOnline LearningRanking Models
0 likes · 12 min read
iQIYI Dual‑DNN Ranking Model with Online Knowledge Distillation
Tencent Cloud Developer
Tencent Cloud Developer
Feb 25, 2020 · Big Data

Building Data Middle Platforms: Lean Data Innovation Systems

During the pandemic, Tencent Cloud University and its Most Valuable Experts launched a free online course, led by ThoughtWorks' Shi Kai, to teach enterprises how to build data middle platforms with a lean data innovation system, offering live sessions, a weekly schedule, and incentives such as USB drives and Bluetooth speakers.

Data Middle PlatformOnline Learningenterprise intelligence
0 likes · 4 min read
Building Data Middle Platforms: Lean Data Innovation Systems
DataFunTalk
DataFunTalk
Feb 22, 2020 · Artificial Intelligence

Double DNN Ranking Model with Online Knowledge Distillation for Real‑Time Recommendation at iQIYI

The article introduces iQIYI's double‑DNN ranking architecture that combines a high‑performance teacher network with a lightweight student network through online knowledge distillation, detailing the evolution of deep learning‑based ranking models, the motivation for model upgrades, training pipelines, and experimental results that demonstrate significant latency reduction and ROI improvement.

Deep LearningOnline LearningRanking Models
0 likes · 13 min read
Double DNN Ranking Model with Online Knowledge Distillation for Real‑Time Recommendation at iQIYI
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 21, 2020 · Artificial Intelligence

Dual DNN Ranking Model with Online Knowledge Distillation for Recommender Systems

iQIYI’s dual‑DNN ranking model uses an online teacher‑student knowledge‑distillation framework where a complex teacher DNN shares representations with a lightweight student DNN, enabling end‑to‑end training, large‑scale feature crossing, and substantially higher recommendation accuracy while cutting inference latency and model size.

CTR predictionOnline Learningdual DNN
0 likes · 15 min read
Dual DNN Ranking Model with Online Knowledge Distillation for Recommender Systems
DataFunTalk
DataFunTalk
Jan 21, 2020 · Artificial Intelligence

How to Enhance Real-Time Updating of Recommendation System Models

The article examines various techniques—including full, incremental, online, and local updates—as well as client‑side embedding refreshes to improve the real‑time performance of recommendation system models, balancing freshness with global optimality.

AIIncremental LearningOnline Learning
0 likes · 9 min read
How to Enhance Real-Time Updating of Recommendation System Models
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 5, 2019 · Artificial Intelligence

How Alibaba’s Alink Empowers Real‑Time Machine Learning on Flink

Alink, Alibaba’s open‑source machine‑learning platform built on Apache Flink, offers a rich library of batch and streaming algorithms, a Python API, iterative computation optimizations, and real‑world case studies, positioning it as a powerful AI solution for large‑scale, low‑latency data processing.

AIAlinkFlink
0 likes · 13 min read
How Alibaba’s Alink Empowers Real‑Time Machine Learning on Flink
iQIYI Technical Product Team
iQIYI Technical Product Team
Oct 31, 2019 · Artificial Intelligence

Online Learning for Large‑Scale DNN Ranking Models in iQIYI Feed Recommendation

iQIYI’s feed recommendation system adopts an online‑learning framework that continuously trains a massive Wide‑and‑Deep DNN on billions of streaming samples, handling dynamic user interests, OOV embeddings, delayed labels, and non‑convex optimization, enabling hourly model refreshes and delivering up to 3.8 % higher consumption versus offline baselines.

DNNOnline LearningReal-time Training
0 likes · 17 min read
Online Learning for Large‑Scale DNN Ranking Models in iQIYI Feed Recommendation
iQIYI Technical Product Team
iQIYI Technical Product Team
Oct 18, 2019 · Artificial Intelligence

iQIYI Effect Advertising: Algorithm Architecture, Click‑Conversion Estimation, and Smart Bidding

The talk details iQIYI’s effect advertising system, describing its feed and in‑frame architecture, the oCPX billing model, multi‑stage recall‑ranking pipelines, real‑time feature engineering, online FM and Wide&Deep models for sparse conversion prediction, and a smart‑bidding mechanism that balances cost, quality, and volume.

Online Learningadvertising algorithmfeature engineering
0 likes · 11 min read
iQIYI Effect Advertising: Algorithm Architecture, Click‑Conversion Estimation, and Smart Bidding
DataFunTalk
DataFunTalk
Sep 25, 2019 · Artificial Intelligence

Practical Exploration of OCPC Advertising Algorithm at Phoenix New Media

This article presents a comprehensive overview of the OCPC (Optimized Cost Per Click) advertising algorithm deployed by Phoenix New Media, detailing its background, problem definition, two‑price mechanism, smart bidding, CVR estimation techniques, online learning architecture, challenges such as data sparsity and conversion delay, and future research directions.

CVR estimationOCPCOnline Learning
0 likes · 15 min read
Practical Exploration of OCPC Advertising Algorithm at Phoenix New Media
DataFunTalk
DataFunTalk
Feb 27, 2019 · Artificial Intelligence

Human‑Interactive Machine Translation: Research, Techniques, and Productization

This article reviews the current state of machine translation, explores the challenges of ambiguity, quality, and domain specificity, and presents human‑in‑the‑loop translation techniques—including attention‑enhanced models, transformer architectures, and online learning—while discussing practical productization and deployment considerations.

AI productizationHuman-in-the-LoopOnline Learning
0 likes · 16 min read
Human‑Interactive Machine Translation: Research, Techniques, and Productization
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 28, 2018 · Artificial Intelligence

Elastic Feature Scaling: Boosting Alibaba’s Online Recommendation CTR by 4%

This article describes how Ant Financial’s AI team redesigned TensorFlow to enable elastic feature scaling, introduced a Group‑Lasso optimizer and streaming frequency filtering, compressed models by 90%, and achieved significant CTR and efficiency gains in Alipay’s online recommendation system.

Online LearningTensorFlowfeature scaling
0 likes · 20 min read
Elastic Feature Scaling: Boosting Alibaba’s Online Recommendation CTR by 4%
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 21, 2018 · Artificial Intelligence

X-DeepLearning: Alibaba’s Open‑Source Framework for Large‑Scale Sparse Deep Learning

Alibaba's X‑DeepLearning (XDL) is an open‑source deep‑learning framework optimized for high‑dimensional sparse data, offering industrial‑grade distributed training, built‑in CTR/recommendation algorithms, structured compression, and online learning capabilities, with benchmark results demonstrating superior scalability and performance.

CTR predictionDeep LearningDistributed Training
0 likes · 18 min read
X-DeepLearning: Alibaba’s Open‑Source Framework for Large‑Scale Sparse Deep Learning
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 4, 2018 · Artificial Intelligence

Unlocking Elastic TensorFlow: Boosting Online Recommendation CTR by 30%

This article presents a comprehensive set of innovations—including elastic feature scaling, a Group Lasso optimizer, streaming frequency filtering, and graph‑cut model compression—that transform TensorFlow for large‑scale online learning, delivering significant CTR gains and up to 90% model size reduction in Alibaba's recommendation systems.

Online Learningfeature engineeringgroup lasso
0 likes · 19 min read
Unlocking Elastic TensorFlow: Boosting Online Recommendation CTR by 30%
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 16, 2018 · Artificial Intelligence

How Alibaba’s Search Engine Evolved Over a Decade of Double‑11: From Offline Models to Real‑Time AI

This article traces the ten‑year evolution of Alibaba’s e‑commerce search system, detailing four major stages—from the early Pora streaming engine to dual‑link real‑time architectures, the integration of deep and reinforcement learning, and the shift to large‑scale online deep learning—while highlighting the technical drivers and future AI‑enabled search vision.

Online LearningReinforcement Learninge‑commerce
0 likes · 16 min read
How Alibaba’s Search Engine Evolved Over a Decade of Double‑11: From Offline Models to Real‑Time AI
Meituan Technology Team
Meituan Technology Team
Nov 15, 2018 · Artificial Intelligence

Reinforcement Learning for Meituan's "Guess You Like" Recommendation Ranking

Meituan enhanced its homepage “Guess You Like” recommendation slot by modeling user‑item interactions as a Markov Decision Process and applying an improved DDPG reinforcement‑learning agent that adjusts the ranking trade‑off parameter, uses advantage‑based Q decomposition, shares actor‑critic weights, and runs in a real‑time TensorFlow pipeline, delivering consistent lifts in click‑through, dwell time, and depth.

DDPGMDP ModelingOnline Learning
0 likes · 21 min read
Reinforcement Learning for Meituan's "Guess You Like" Recommendation Ranking
DataFunTalk
DataFunTalk
Nov 7, 2018 · Artificial Intelligence

Evolution of Ele.me Recommendation Algorithms and Online Learning Practice

This article outlines the background of Ele.me's recommendation business, details the evolution of its recommendation algorithms from rule‑based models to deep learning and online learning, and explains the practical implementation of real‑time data pipelines, feature engineering, model training, and deployment.

Ele.meOnline Learningmachine learning
0 likes · 13 min read
Evolution of Ele.me Recommendation Algorithms and Online Learning Practice
DataFunTalk
DataFunTalk
Oct 12, 2018 · Artificial Intelligence

Market Mechanisms and Control Measures in Ele.me Food Delivery Recommendation Algorithms

The article presents a comprehensive overview of Ele.me's food‑delivery recommendation system, detailing its business model, platform goals, unique challenges, market‑driven efficiency mechanisms, control strategies, system architecture, model evolution, and online‑learning techniques used to balance short‑term performance with long‑term ecosystem health.

AIEle.meOnline Learning
0 likes · 15 min read
Market Mechanisms and Control Measures in Ele.me Food Delivery Recommendation Algorithms
Meitu Technology
Meitu Technology
Jun 25, 2018 · Artificial Intelligence

Meitu's Personalized Recommendation System: Architecture, Features, and Optimization Strategies

Meitu’s personalized recommendation platform for the Meipai app combines offline feature engineering, near‑real‑time streaming, and online serving to recall, estimate, and rank billions of short videos using multi‑modal content features, user profiling, online learning, cold‑start bandit strategies, and multi‑objective diversity optimization, delivering timely, diverse feeds across live, homepage, and video‑detail scenarios.

Online Learningcold startcontent diversity
0 likes · 17 min read
Meitu's Personalized Recommendation System: Architecture, Features, and Optimization Strategies
AntTech
AntTech
Jun 14, 2018 · Artificial Intelligence

A Local Online Learning Approach for Non-linear Data (SCW-LOL)

This paper introduces the SCW-LOL algorithm, a local online learning method based on Soft Confidence Weighted that extends a global model with multiple local classifiers, uses online K‑Means for sample assignment, provides theoretical error bounds, and demonstrates superior performance on ten benchmark datasets, especially for multi‑class classification.

Online LearningSCW algorithmdata mining
0 likes · 9 min read
A Local Online Learning Approach for Non-linear Data (SCW-LOL)
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 26, 2018 · Artificial Intelligence

How TensorFlowRS Supercharges Large‑Scale Search & Recommendation with 10×‑100× Speedups

This article describes TensorFlowRS, an Alibaba‑built extension of TensorFlow that tackles the massive compute and sparse‑feature challenges of search, advertising and recommendation by redesigning the parameter server, adding fail‑over, gradient‑compensation, online‑learning support, advanced training modes and visualisation, achieving up to 100× training speedup and improved model quality.

Distributed TrainingOnline LearningParameter Server
0 likes · 16 min read
How TensorFlowRS Supercharges Large‑Scale Search & Recommendation with 10×‑100× Speedups
Meituan Technology Team
Meituan Technology Team
Nov 23, 2017 · Artificial Intelligence

O2O Machine Learning Applications Seminar

The O2O Machine Learning Applications Seminar, featuring experts from Meituan‑Dianping and Alibaba, explores real‑world ML implementations for online‑to‑offline services, including online learning for search, Alibaba’s Ali Xiaomi intelligent assistant, deep‑learning‑driven recommendation systems, and advertising algorithms such as CTR and CVR optimization, sharing practical insights and best practices.

Artificial IntelligenceDeep LearningO2O
0 likes · 5 min read
O2O Machine Learning Applications Seminar
Tencent Advertising Technology
Tencent Advertising Technology
Jun 15, 2017 · Artificial Intelligence

Tencent Social Ads Data Mining Expert Q&A: Feature Engineering, Modeling, and Competition Insights

In a Q&A session, a Tencent social ads data mining expert addressed competition participants' questions on data delays, full‑set versus sliding‑window statistics, dataset authenticity, Bayesian smoothing, feature selection, handling missing values, large‑scale training, feature interactions, model stacking, online mini‑batch training, and provided reference resources.

Online LearningVowpal Wabbitcompetition
0 likes · 11 min read
Tencent Social Ads Data Mining Expert Q&A: Feature Engineering, Modeling, and Competition Insights
Tencent Advertising Technology
Tencent Advertising Technology
Jun 14, 2017 · Big Data

Techniques for Handling Large-Scale Competition Data: Sampling, Feature Processing, and External‑Memory Learning

This article presents practical strategies for processing massive competition datasets—including down‑sampling, streaming feature extraction, external‑memory learning, and tool recommendations—to overcome memory constraints and improve model building efficiency.

Online Learningdata samplingexternal memory learning
0 likes · 4 min read
Techniques for Handling Large-Scale Competition Data: Sampling, Feature Processing, and External‑Memory Learning
21CTO
21CTO
Apr 19, 2017 · Artificial Intelligence

How Alibaba Transformed E‑Commerce Search with Real‑Time AI and Reinforcement Learning

Alibaba’s e‑commerce search engine evolved over three years from offline batch models to a sophisticated AI-driven system that integrates real‑time feature ingestion, online learning, deep and reinforcement learning, enabling dynamic personalization and decision‑making that boosts conversion during high‑traffic events like Double 11.

AIOnline LearningReal‑Time Computing
0 likes · 15 min read
How Alibaba Transformed E‑Commerce Search with Real‑Time AI and Reinforcement Learning
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 3, 2017 · Artificial Intelligence

How DNN Breaks Feature Scaling Limits in Search Ranking

This article examines the challenges of high‑dimensional sparse features in search ranking, explains why traditional linear models struggle, and describes how deep neural networks with novel encoding schemes and online updates can dramatically improve CTR prediction and real‑time performance.

CTR predictionDNNDeep Learning
0 likes · 12 min read
How DNN Breaks Feature Scaling Limits in Search Ranking
Qunar Tech Salon
Qunar Tech Salon
Feb 24, 2016 · Artificial Intelligence

Overview and Architecture of Pora: A Real‑Time Personalization Analytics Platform

The article introduces Pora, a real‑time offline‑realtime analytics system for personalized search that combines high‑throughput stream processing, low‑latency computation, online learning algorithms, and a modular architecture to support continuous 24/7 operation and large‑scale performance optimizations.

AIOnline LearningReal-time analytics
0 likes · 6 min read
Overview and Architecture of Pora: A Real‑Time Personalization Analytics Platform
21CTO
21CTO
Jan 16, 2016 · Artificial Intelligence

How Alibaba’s Dual-Path Real-Time Computing Powers Search During Double 11

This article explains Alibaba’s dual‑link real‑time computing framework, detailing its micro‑ and macro‑level pipelines, key components such as Pora, iGraph and SP, online learning architectures, pointwise and pairwise ranking models, bandit‑based strategy optimization, PID‑controlled traffic balancing, and the impressive performance gains achieved during the Double 11 shopping festival.

AlibabaOnline LearningPID control
0 likes · 22 min read
How Alibaba’s Dual-Path Real-Time Computing Powers Search During Double 11
Architect
Architect
Jan 16, 2016 · Artificial Intelligence

Real‑Time Computing System for Alibaba Search: Architecture, Online Learning, and Strategy Optimization

The article presents Alibaba's real‑time computing platform for search, detailing its micro‑ and macro‑level architectures, online learning frameworks, point‑wise and pair‑wise ranking models, bandit‑based strategy optimization, and PID‑controlled traffic regulation, and reports significant performance gains during the Double‑11 shopping festival.

Online LearningPID controlReal‑Time Computing
0 likes · 22 min read
Real‑Time Computing System for Alibaba Search: Architecture, Online Learning, and Strategy Optimization
21CTO
21CTO
Sep 28, 2015 · Artificial Intelligence

How Meituan Built a Scalable AI‑Powered Recommendation Engine

This article details Meituan's end‑to‑end recommendation system, covering its four‑layer architecture, data sources, candidate‑generation strategies, fusion methods, and both linear and non‑linear re‑ranking models, while highlighting practical optimizations like AB testing and online learning.

MeituanOnline Learningdata pipelines
0 likes · 15 min read
How Meituan Built a Scalable AI‑Powered Recommendation Engine
21CTO
21CTO
Sep 8, 2015 · Artificial Intelligence

Inside Meituan’s Recommendation Engine: From Data to Real‑Time Ranking

This article outlines Meituan’s end‑to‑end recommendation system, describing its data layer, candidate‑generation triggers, fusion strategies, and machine‑learning‑based ranking models—including collaborative filtering, location‑based, query‑based, graph‑based methods, and both linear and non‑linear models—while highlighting practical optimizations such as AB testing, real‑time behavior handling, and fallback strategies.

MeituanOnline Learningcandidate generation
0 likes · 19 min read
Inside Meituan’s Recommendation Engine: From Data to Real‑Time Ranking
Baidu Tech Salon
Baidu Tech Salon
Mar 21, 2014 · Artificial Intelligence

Baidu's Large-Scale Machine Learning Technology: Enabling Trillion-Feature Processing with Minute-Level Model Updates

Baidu's Big Data Machine Learning team, led by Xia Fen, unveiled a suite of five novel algorithms that together allow trillion‑scale feature processing, minute‑level model updates, and up to thousand‑fold efficiency gains in training and inference, dramatically surpassing existing solutions such as Google's billion‑feature systems.

BaiduCTR predictionDeep Learning
0 likes · 6 min read
Baidu's Large-Scale Machine Learning Technology: Enabling Trillion-Feature Processing with Minute-Level Model Updates