Tagged articles
455 articles
Page 1 of 5
DataFunSummit
DataFunSummit
May 19, 2026 · Artificial Intelligence

Designing Next‑Gen Recommendation and Search with Agentic RAG Architecture

The article reviews cutting‑edge AI techniques for high‑concurrency, multimodal recommendation and search, detailing Alibaba Cloud's Agentic RAG evolution, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's generative ranking model GRAB, each with architecture diagrams, performance metrics, and real‑world deployment insights.

AI AgentsAgentic RAGGenerative Ranking
0 likes · 6 min read
Designing Next‑Gen Recommendation and Search with Agentic RAG Architecture
DataFunSummit
DataFunSummit
May 17, 2026 · Artificial Intelligence

How Agentic Architecture Powers Next‑Generation Recommendation and Search Systems

The article reviews cutting‑edge AI search and recommendation techniques—including Alibaba Cloud's Agentic RAG, Huawei Noah's LLM‑enhanced recommender, Baidu's generative ranking model GRAB, and Elasticsearch‑based vector RAG—detailing their challenges, architectural evolutions, performance gains, and real‑world deployment results.

AI searchAgentic RAGElasticsearch
0 likes · 6 min read
How Agentic Architecture Powers Next‑Generation Recommendation and Search Systems
DataFunSummit
DataFunSummit
May 8, 2026 · Artificial Intelligence

Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems

This article reviews cutting‑edge AI search and recommendation technologies, covering Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's generative ranking model GRAB, while detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI searchAgentic RAGGenerative Ranking
0 likes · 6 min read
Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems
DataFunSummit
DataFunSummit
May 5, 2026 · Artificial Intelligence

How Huawei Noah’s KAR Project Leverages LLMs to Advance Recommendation Systems

The article reviews the evolution of recommendation systems from deep learning to large language models, analyzes core challenges such as noisy implicit feedback and limited semantic understanding, and details Huawei Noah’s KAR solution that uses factorized prompting, multi‑expert adapters, and AI‑Agent architectures to achieve a 1.5% AUC lift and validated online A/B test results.

AI AgentAUCHuawei
0 likes · 5 min read
How Huawei Noah’s KAR Project Leverages LLMs to Advance Recommendation Systems
DataFunTalk
DataFunTalk
May 5, 2026 · Artificial Intelligence

Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems

This article reviews cutting‑edge AI search and recommendation techniques—including Alibaba Cloud's Agentic RAG, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's generative ranking model GRAB—detailing their architectural evolution, multimodal retrieval strategies, GPU acceleration, and measured performance gains.

AI searchAgentic RAGGPU Acceleration
0 likes · 6 min read
Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems
DataFunSummit
DataFunSummit
May 1, 2026 · Artificial Intelligence

How Agentic Architectures Power the Next‑Gen Recommendation and Search Systems

This article summarizes a technical ebook that analyzes the evolution of recommendation and search systems—from deep‑learning models to large‑language‑model agents—detailing multi‑agent RAG architectures, Huawei’s KAR knowledge adapters, Baidu’s generative ranking (GRAB), Elasticsearch vector search, and performance results such as a 1.5% AUC lift and GPU‑accelerated throughput gains.

ElasticsearchGenerative RankingMulti-Agent Architecture
0 likes · 6 min read
How Agentic Architectures Power the Next‑Gen Recommendation and Search Systems
Kuaishou Tech
Kuaishou Tech
Apr 24, 2026 · Artificial Intelligence

ICLR 2026: Kuaishou Tech Team’s Cutting‑Edge AI Research Highlights

This article reviews eight Kuaishou‑authored papers accepted at ICLR 2026, summarizing their problem statements, novel methods such as front‑door causal attribution, visual table retrieval, denoising rerankers, difficulty‑adaptive reasoning, diffusion code infilling, generative ordinal regression, multimodal video retrieval, e‑commerce dialogue benchmarks, and a new LLM creativity evaluator, together with reported experimental gains.

Causal AttributionICLR 2026Kuaishou
0 likes · 19 min read
ICLR 2026: Kuaishou Tech Team’s Cutting‑Edge AI Research Highlights
DataFunSummit
DataFunSummit
Apr 21, 2026 · Industry Insights

How AI Search & Recommendation Systems Beat Multi-Modal, High-Concurrency Hurdles

This article reviews cutting‑edge technical practices from Alibaba Cloud AI Search, Huawei Noah's recommendation platform, and Baidu's GRAB model, detailing how multi‑agent RAG architectures, large‑language‑model enhancements, and generative ranking overcome high‑concurrency, multi‑modal data, and feature‑engineering bottlenecks.

AI searchGenerative RankingMulti-Modal Retrieval
0 likes · 6 min read
How AI Search & Recommendation Systems Beat Multi-Modal, High-Concurrency Hurdles
Alimama Tech
Alimama Tech
Mar 26, 2026 · Industry Insights

How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters

The article analyzes the evolution from traditional modular recommendation models to a generative Large User Model (LUM), detailing its three‑stage paradigm, tokenization, training objectives, scaling‑law findings, offline and online experiments, and the AI‑infra innovations that enabled a 4.5% CTR lift in production.

CTR predictionGenerative ModelingRecommendation Systems
0 likes · 18 min read
How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters
Tencent Advertising Technology
Tencent Advertising Technology
Mar 23, 2026 · Industry Insights

Why Tencent’s $885K KDD Cup Challenge Could Redefine Recommendation Systems

The 2026 KDD Cup, powered by Tencent’s Advertising Algorithm Competition with an $885,000 prize pool, challenges participants to unify sequence modeling and feature interaction in large‑scale recommendation systems, offering academic publication paths, real‑world deployment opportunities, and strict latency constraints that push both research and engineering innovation.

AIKDD CupRecommendation Systems
0 likes · 16 min read
Why Tencent’s $885K KDD Cup Challenge Could Redefine Recommendation Systems
AI Explorer
AI Explorer
Mar 20, 2026 · Industry Insights

Key AI Breakthroughs and Market Moves on March 20 2026

On March 20 2026, Alibaba’s Qwen 3.5‑Max topped the LMArena blind‑test, OpenAI bought Astral to boost AI coding, Zhejiang University released a real‑time 4D world model, Meta’s Agent leaked data, and a series of AI‑driven innovations from Nvidia, robotics to drug discovery reshaped the industry.

AIAI design toolsAI hardware
0 likes · 7 min read
Key AI Breakthroughs and Market Moves on March 20 2026
Kuaishou Tech
Kuaishou Tech
Mar 4, 2026 · Artificial Intelligence

How LLMs Are Revolutionizing Reinforcement Learning for Recommendation Systems

This survey examines the emerging LLM‑RL collaborative recommendation paradigm, outlining its research background, five main collaboration patterns, standardized evaluation protocols, and the key challenges and future directions for building smarter, more robust recommender systems.

LLMRecommendation Systemsartificial intelligence
0 likes · 14 min read
How LLMs Are Revolutionizing Reinforcement Learning for Recommendation Systems
AI Explorer
AI Explorer
Mar 3, 2026 · Industry Insights

How Meta’s Social Graph Could Redefine E‑Commerce Recommendations

Meta is secretly testing an AI shopping assistant that leverages billions of users' social profiles, shifting recommendation logic from reactive behavior data to proactive identity‑driven suggestions, while raising significant privacy and ecosystem implications for e‑commerce.

AIMetaRecommendation Systems
0 likes · 6 min read
How Meta’s Social Graph Could Redefine E‑Commerce Recommendations
DataFunSummit
DataFunSummit
Feb 27, 2026 · Artificial Intelligence

How Large Language Models Are Revolutionizing Ad Recommendation and Solving Cold‑Start Problems

This article explains how advertising recommendation is evolving from traditional feature‑engineered models to LLM‑driven pipelines, detailing data‑infrastructure challenges, semantic upgrades with multimodal embeddings, case studies in short‑video ads, user cold‑start prompt engineering, and future directions for generative recommendation systems.

Ad TechLLMRecommendation Systems
0 likes · 12 min read
How Large Language Models Are Revolutionizing Ad Recommendation and Solving Cold‑Start Problems
DeWu Technology
DeWu Technology
Feb 11, 2026 · Artificial Intelligence

How Generative Models Transform Re‑ranking Architecture for Faster, More Diverse Recommendations

This article examines the evolution of re‑ranking systems from traditional pointwise models to a two‑stage generation‑evaluation framework, compares autoregressive and non‑autoregressive generative approaches, details inference speed optimizations with GPU and model‑server upgrades, and outlines a future end‑to‑end sequence generation architecture enhanced by reinforcement learning and contrastive learning.

AIGenerative ModelsInference Optimization
0 likes · 14 min read
How Generative Models Transform Re‑ranking Architecture for Faster, More Diverse Recommendations
Ximalaya Technology Team
Ximalaya Technology Team
Feb 11, 2026 · Artificial Intelligence

How Ximalaya Used Generative AI to Revolutionize Audio Recommendations

This article details Ximalaya's journey from traditional multi‑stage recommendation pipelines to generative AI‑driven models, covering business challenges, architectural and model differences, phased deployments, knowledge distillation, semantic ID encoding, decoder‑only strategies, extensive offline and online evaluations, and future research directions.

Encoder-DecoderRecommendation Systemsaudio recommendation
0 likes · 24 min read
How Ximalaya Used Generative AI to Revolutionize Audio Recommendations
Data Party THU
Data Party THU
Feb 9, 2026 · Artificial Intelligence

Aligning Collaborative Filtering with LLM Token Generation: The TCA4Rec Breakthrough

This paper introduces the TCA4Rec framework that directly aligns item‑level collaborative‑filtering preferences with token‑level objectives of large language models, presenting novel modules, extensive experiments, and analysis that demonstrate significant performance gains in generative recommendation tasks.

Generative RecommendationLLMRecommendation Systems
0 likes · 9 min read
Aligning Collaborative Filtering with LLM Token Generation: The TCA4Rec Breakthrough
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Jan 25, 2026 · Artificial Intelligence

RecFlow Breaks DLRM Inference Bottleneck with Fine-Grained GPU Parallelism

RecFlow, a new inference engine from Beijing University of Posts and Telecommunications and Meituan, tackles the resource mismatch of DLRM models by coordinating embedding and DNN operators at the intra‑SM level and introducing interference‑aware adaptive scheduling and incremental batching, achieving up to 9.34× higher throughput on RTX 3090.

DLRMFine-grained parallelismGPU Acceleration
0 likes · 7 min read
RecFlow Breaks DLRM Inference Bottleneck with Fine-Grained GPU Parallelism
Kuaishou Tech
Kuaishou Tech
Jan 19, 2026 · Artificial Intelligence

How OneSug Revolutionizes E‑commerce Query Suggestion with End‑to‑End Generative Modeling

OneSug introduces an end‑to‑end generative framework that unifies recall, coarse‑ranking, and fine‑ranking for e‑commerce query suggestion, addressing the limitations of traditional multi‑stage cascades and dramatically improving relevance, efficiency, and business metrics in real‑world deployments.

Generative ModelsRecommendation Systemse‑commerce
0 likes · 10 min read
How OneSug Revolutionizes E‑commerce Query Suggestion with End‑to‑End Generative Modeling
DataFunTalk
DataFunTalk
Jan 4, 2026 · Artificial Intelligence

How Agentic RAG and Generative Ranking Are Redefining AI Search and Recommendation

This article summarizes three cutting‑edge AI techniques—Alibaba Cloud's Agentic RAG architecture for multimodal search, Huawei Noah's large‑model‑driven recommendation system evolution, and Baidu's generative ranking (GRAB) model for ads—detailing their challenges, designs, performance gains, and practical deployment insights.

AI searchGenerative RankingMulti-Agent Architecture
0 likes · 7 min read
How Agentic RAG and Generative Ranking Are Redefining AI Search and Recommendation
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Dec 31, 2025 · Artificial Intelligence

Why AI Inference Is Slow and How Cutting‑Edge Tech Boosts It in Industrial Settings

The article analyzes the severe inference bottlenecks of large language models, CNNs, and recommendation systems and presents a suite of research‑driven accelerations—including token‑level pipeline parallelism (HPipe), KV‑cache clustering (ClusterAttn), quantization (QoKV), heterogeneous edge frameworks (DeepZoning, PICO), delay‑aware edge‑cloud scheduling (DECC), and operator choreography (RACE)—validated on real‑world industrial workloads.

AI inferenceRecommendation Systemsedge AI
0 likes · 16 min read
Why AI Inference Is Slow and How Cutting‑Edge Tech Boosts It in Industrial Settings
DataFunSummit
DataFunSummit
Dec 19, 2025 · Artificial Intelligence

How Agentic RAG, LLM‑Powered Recommendations, and Generative Ranking Transform AI Search and Ads

This article surveys cutting‑edge AI techniques—including Alibaba Cloud's Agentic RAG for multimodal search, Huawei Noah's LLM‑enhanced recommendation evolution, and Baidu's generative ranking (GRAB) for ads—detailing their architectures, optimization tricks, performance gains, and real‑world deployment results.

AI searchAgentic RAGGPU Acceleration
0 likes · 9 min read
How Agentic RAG, LLM‑Powered Recommendations, and Generative Ranking Transform AI Search and Ads
JD Retail Technology
JD Retail Technology
Dec 11, 2025 · Artificial Intelligence

How GIN-Based Cohort Modeling Boosts Cold-Start CTR Prediction by 2%

This article explains a SIGIR 2025 paper that tackles cold‑start click‑through‑rate prediction in JD's ad system by using a Graph Isomorphism Network‑based cohort modeling framework, detailing its three‑module architecture, extensive experiments on public and industrial datasets, and a live deployment that achieved a 2.13% CTR lift.

CTR predictionGinGraph Neural Network
0 likes · 9 min read
How GIN-Based Cohort Modeling Boosts Cold-Start CTR Prediction by 2%
DataFunTalk
DataFunTalk
Dec 2, 2025 · Artificial Intelligence

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

This article reviews three cutting‑edge AI search and recommendation techniques—Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's GRAB generative ranking model—detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI AgentsAI searchGenerative Ranking
0 likes · 8 min read
How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search
DataFunSummit
DataFunSummit
Nov 29, 2025 · Artificial Intelligence

How LLMs Are Transforming Long-Term Cross-Domain Interest Modeling for Recommendations

The Datafun Summit 2025 talk by JD’s algorithm engineer Tian Mingyang explains how generative AI is driving a paradigm shift in recommendation systems, detailing the limits of traditional models, the new dynamic cross‑domain inference chain technique, joint engineering‑algorithm optimizations, and the remaining challenges for future deployment.

AICross-Domain ModelingEngineering Optimization
0 likes · 32 min read
How LLMs Are Transforming Long-Term Cross-Domain Interest Modeling for Recommendations
DataFunTalk
DataFunTalk
Nov 25, 2025 · Artificial Intelligence

Unlocking Agentic RAG and Generative Ranking: AI Search & Recommendation Breakthroughs

This article summarizes cutting‑edge techniques from Alibaba Cloud AI Search’s Agentic RAG architecture, Huawei Noah’s LLM‑enhanced recommendation evolution, and Baidu’s GRAB generative ranking model, detailing multi‑agent retrieval, multimodal data handling, scaling laws, causal attention, and performance gains demonstrated through benchmarks and real‑world deployments.

AI searchAgentic RAGGenerative Ranking
0 likes · 8 min read
Unlocking Agentic RAG and Generative Ranking: AI Search & Recommendation Breakthroughs
DataFunSummit
DataFunSummit
Nov 22, 2025 · Artificial Intelligence

Breaking the Recommendation Filter Bubble: Alibaba 1688’s Inference‑Driven AI

Alibaba’s 1688 platform leverages inference‑based large language models to enhance recommendation discovery, addressing the filter‑bubble problem by analyzing long‑term buyer behavior, compressing extensive activity streams, generating nuanced demand queries, and integrating multimodal data and market trend agents to deliver more diverse, explainable product suggestions for B‑type buyers.

AIE‑commerceInference
0 likes · 23 min read
Breaking the Recommendation Filter Bubble: Alibaba 1688’s Inference‑Driven AI
DataFunSummit
DataFunSummit
Nov 9, 2025 · Artificial Intelligence

How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks

This article reviews Kuaishou's two‑year exploration of large‑model techniques in advertising, detailing the challenges of content‑domain ad estimation, the use of multimodal and LLM technologies to harness full‑scope user behavior and external knowledge, and the COPE and LEARN frameworks that delivered measurable business gains.

AdvertisingKnowledge TransferMultimodal AI
0 likes · 6 min read
How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks
JD Tech
JD Tech
Nov 6, 2025 · Artificial Intelligence

LLMs Revolutionize Recommendation Systems: From Generative Models to Production

This article surveys the evolution of generative recommendation systems powered by large language models, detailing their technical foundations, engineering challenges, recent breakthroughs, and future research directions, while highlighting why the paradigm shift is occurring now.

AI EngineeringGenerative RecommendationLLM
0 likes · 30 min read
LLMs Revolutionize Recommendation Systems: From Generative Models to Production
Zhihu Tech Column
Zhihu Tech Column
Nov 4, 2025 · Artificial Intelligence

How Multimodal Large Models Transform Recommendation Systems: From Tags to Embeddings

This article explores how multimodal large models like Qwen2.5‑VL enable high‑dimensional tag generation and universal embeddings for recommendation systems, detailing data synthesis, model training, quantization, fine‑tuning, and the resulting improvements in click‑through rate and exposure interaction.

EmbeddingMultimodal AIRecommendation Systems
0 likes · 17 min read
How Multimodal Large Models Transform Recommendation Systems: From Tags to Embeddings
Kuaishou Large Model
Kuaishou Large Model
Oct 31, 2025 · Artificial Intelligence

EMER: End-to-End Multi-Objective Ranking That Transforms Short-Video Recommendations

EMER, Kuaishou’s end‑to‑end multi‑objective ensemble ranking framework, replaces handcrafted scoring formulas with a transformer‑based model that learns comparative preferences, integrates normalized rank features, optimizes relative satisfaction and multi‑dimensional proxy metrics, and dynamically balances objectives via a self‑evolving advantage evaluator, delivering significant online gains.

Recommendation SystemsTransformermachine learning
0 likes · 17 min read
EMER: End-to-End Multi-Objective Ranking That Transforms Short-Video Recommendations
Kuaishou Tech
Kuaishou Tech
Oct 30, 2025 · Artificial Intelligence

How EMER Revolutionizes Short‑Video Ranking with End‑to‑End Multi‑Objective Learning

This article details the EMER framework—a Transformer‑based, end‑to‑end multi‑objective ranking system that replaces handcrafted formulas with a learnable AI model, introduces relative‑satisfaction signals and dynamic loss weighting, and demonstrates significant offline and online performance gains in Kuaishou's short‑video recommendation pipeline.

AIRecommendation Systemsmulti-objective learning
0 likes · 16 min read
How EMER Revolutionizes Short‑Video Ranking with End‑to‑End Multi‑Objective Learning
Ele.me Technology
Ele.me Technology
Oct 27, 2025 · Artificial Intelligence

How IAK Transforms Multi‑Domain Recommendation with Pre‑Training and Fine‑Tuning

This paper introduces IAK, a unified multi‑domain recommendation paradigm that treats the system as a large model, leveraging pre‑training and fine‑tuning with an information‑aware adaptive kernel to capture rapid user interest shifts while reducing training costs and improving online performance.

Recommendation Systemsfine‑tuninginformation bottleneck
0 likes · 18 min read
How IAK Transforms Multi‑Domain Recommendation with Pre‑Training and Fine‑Tuning
DataFunSummit
DataFunSummit
Oct 12, 2025 · Artificial Intelligence

How Kuaishou Uses Large Models to Supercharge Ad Targeting with COPE and LEARN

This article reviews Kuaishou's two‑year exploration of multimodal large‑model techniques for advertising, outlining challenges in content‑domain ad estimation, the COPE unified product representation framework, and the LEARN LLM knowledge‑transfer approach that together improve ad system performance.

AdvertisingKuaishouLLM
0 likes · 6 min read
How Kuaishou Uses Large Models to Supercharge Ad Targeting with COPE and LEARN
DataFunSummit
DataFunSummit
Oct 12, 2025 · Artificial Intelligence

How Baidu’s Generative Recall System (COBRA) Revolutionizes Ad Recommendations

This article details Baidu's generative recommendation ad recall framework, introducing the COBRA system and its three development stages—dense representation compression, sparse quantization with ID generation, and dense‑sparse cascading—highlighting coarse‑to‑fine inference, performance gains, long‑sequence extensions, online deployment, and future research directions.

COBRARecommendation Systemsad recall
0 likes · 18 min read
How Baidu’s Generative Recall System (COBRA) Revolutionizes Ad Recommendations
DataFunSummit
DataFunSummit
Oct 10, 2025 · Artificial Intelligence

How Kuaishou Boosted Ad Performance with Multimodal Large Models

This article reviews Kuaishou's two‑year exploration of large‑model techniques in advertising, outlining challenges in content‑domain ad estimation, introducing the COPE unified content representation framework and the LEARN LLM knowledge‑transfer approach, and showing how these innovations delivered tangible business gains.

AIAdvertisingKnowledge Transfer
0 likes · 5 min read
How Kuaishou Boosted Ad Performance with Multimodal Large Models
DataFunSummit
DataFunSummit
Oct 9, 2025 · Artificial Intelligence

How Kuaishou Boosted Ad Performance with Multimodal Large Models: COPE & LEARN

This article reviews Kuaishou's two‑year exploration of multimodal large‑model techniques for advertising, detailing challenges of fragmented user behavior, the COPE unified product representation framework, and the LEARN LLM knowledge‑transfer approach that together delivered measurable business gains.

AIAdvertisingKnowledge Transfer
0 likes · 6 min read
How Kuaishou Boosted Ad Performance with Multimodal Large Models: COPE & LEARN
Alimama Tech
Alimama Tech
Oct 1, 2025 · Artificial Intelligence

How RecIS Revolutionizes Large‑Scale Sparse‑Dense Recommendation Training

RecIS is an open‑source, PyTorch‑based unified framework designed for ultra‑large‑scale sparse‑dense computation in recommendation systems, offering a full solution for training models with massive samples, multimodal inputs, and large embeddings, and demonstrating significant performance gains over TensorFlow and TorchRec in production deployments.

PyTorchRecommendation Systemsdeep learning framework
0 likes · 24 min read
How RecIS Revolutionizes Large‑Scale Sparse‑Dense Recommendation Training
DataFunSummit
DataFunSummit
Sep 30, 2025 · Artificial Intelligence

How Kuaishou Uses Large Models to Boost Ad Performance with COPE and LEARN

This article outlines Kuaishou's two‑year exploration of large‑model techniques in advertising, detailing challenges of sparse cross‑domain data, the COPE unified product representation framework, and the LEARN LLM knowledge‑transfer approach that together improve ad system effectiveness.

COPELLMRecommendation Systems
0 likes · 6 min read
How Kuaishou Uses Large Models to Boost Ad Performance with COPE and LEARN
DataFunSummit
DataFunSummit
Sep 30, 2025 · Artificial Intelligence

How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks

Over the past two years, Kuaishou has leveraged multimodal large‑model techniques to overcome sparse advertising data, integrating full‑domain user behavior and external knowledge via the COPE unified product representation framework and the LEARN LLM knowledge‑transfer system, achieving measurable business gains.

KuaishouLLMRecommendation Systems
0 likes · 6 min read
How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks
HyperAI Super Neural
HyperAI Super Neural
Sep 30, 2025 · Artificial Intelligence

OnePiece: Applying LLM‑Style Reasoning to Item‑ID Sequences for Generative Recommendation

The article presents the OnePiece framework, which injects LLM‑style context engineering and latent reasoning into item‑ID based search‑and‑recommendation models, details the design choices, training tricks, attention analysis, and reports online gains of around 1% GMV and ad revenue, offering a thorough technical dissection of generative recommendation in industrial settings.

Context EngineeringGenerative RecommendationLLM Reasoning
0 likes · 31 min read
OnePiece: Applying LLM‑Style Reasoning to Item‑ID Sequences for Generative Recommendation
DataFunSummit
DataFunSummit
Sep 17, 2025 · Product Management

How AI Is Redefining Recommendation Strategies and Product Management Careers

This presentation explores AI-era recommendation strategies, defines strategy product roles and ability models, outlines the three generations of product managers, discusses AI-driven trends, workflow simplifications, 2024 observations, and offers practical guidance for career growth in product management.

AIRecommendation Systemslarge models
0 likes · 14 min read
How AI Is Redefining Recommendation Strategies and Product Management Careers
DataFunTalk
DataFunTalk
Sep 12, 2025 · Artificial Intelligence

How Large Language Models Are Transforming Health E‑Commerce Recommendations

This article explains how JD Health’s recommendation team integrates large‑model technologies—scaling CTR models, enhancing pipelines with LLMs, and adopting generative models—into e‑commerce recommendation systems, highlighting practical applications and technical challenges specific to the health‑commerce sector.

AICTR modelsRecommendation Systems
0 likes · 5 min read
How Large Language Models Are Transforming Health E‑Commerce Recommendations
DataFunSummit
DataFunSummit
Sep 9, 2025 · Artificial Intelligence

How Baidu’s GRAB Model Uses Scaling Laws to Transform Ad Ranking

This article explains Baidu's generative ranking model GRAB, detailing how scaling laws from large language models inspire a new recommendation paradigm, the model's architecture, custom attention mechanisms, training strategies, deployment optimizations, and the resulting business gains in CTR and revenue.

BaiduCTR predictionRecommendation Systems
0 likes · 22 min read
How Baidu’s GRAB Model Uses Scaling Laws to Transform Ad Ranking
JD Tech Talk
JD Tech Talk
Sep 9, 2025 · Artificial Intelligence

How JD’s Dynamic Re‑Ranking Model Boosted Search Relevance and Won SIGIR 2024

The author recounts how, by modeling user intent with a multi‑layer Gaussian‑based PODM‑MI framework and addressing a novel ‘sand‑glass’ bottleneck in RQ‑VAE semantic identifiers, JD’s search ranking achieved significant UCVR gains, annual order increases of over ten million, and a SIGIR 2024 paper acceptance.

E-commerce AIRecommendation SystemsSIGIR
0 likes · 8 min read
How JD’s Dynamic Re‑Ranking Model Boosted Search Relevance and Won SIGIR 2024
Data Party THU
Data Party THU
Aug 14, 2025 · Artificial Intelligence

How FilterLLM Turns One LLM Pass into Billion‑User Cold‑Start Recommendations

The article analyzes the FilterLLM approach, which augments a frozen LLM with billions of learnable user tokens to predict a full‑user interaction probability distribution in a single forward pass, dramatically speeding up cold‑start recommendation while preserving recommendation quality across multiple benchmarks.

AIFilterLLMLLM
0 likes · 8 min read
How FilterLLM Turns One LLM Pass into Billion‑User Cold‑Start Recommendations
Kuaishou Tech
Kuaishou Tech
Jul 29, 2025 · Artificial Intelligence

How Kuaishou’s 8 Groundbreaking Papers Are Shaping AI at KDD 2025

Eight Kuashou research papers covering recommendation systems, multi‑task learning, multimodal large models, large language models, and combinatorial optimization have been accepted to the premier AI data‑mining conference KDD 2025, highlighting the company’s cutting‑edge innovations and their potential impact on the field.

AIMultimodal LearningRecommendation Systems
0 likes · 18 min read
How Kuaishou’s 8 Groundbreaking Papers Are Shaping AI at KDD 2025
Kuaishou Tech
Kuaishou Tech
Jul 23, 2025 · Artificial Intelligence

Revolutionizing Cascade Ranking with LCRON: End-to-End Training for Ads

This article introduces LCRON, a novel end-to-end training framework for cascade ranking systems that aligns training objectives with overall recall, addresses stage interaction challenges, and demonstrates significant performance gains on public benchmarks and in Kuaishou’s commercial advertising platform.

AdvertisingRecommendation Systemscascade ranking
0 likes · 14 min read
Revolutionizing Cascade Ranking with LCRON: End-to-End Training for Ads
DataFunSummit
DataFunSummit
Jul 9, 2025 · Artificial Intelligence

How LAST Enables Real‑Time Learning for Re‑Ranking in E‑Commerce Recommendations

This article presents LAST, a novel Learning-at-Serving-Time approach that enables real‑time online learning for re‑ranking in industrial recommendation pipelines, eliminating feedback latency, detailing its architecture, challenges, experimental validation, and practical advantages over traditional online learning methods.

LAST algorithmRecommendation Systemsonline serving
0 likes · 12 min read
How LAST Enables Real‑Time Learning for Re‑Ranking in E‑Commerce Recommendations
DataFunTalk
DataFunTalk
Jul 3, 2025 · Artificial Intelligence

How Vivo’s Blue Heart XiaoV Leverages LLMs to Transform Conversational Recommendations

In an interview with Vivo AI engineer Liang Tianan, the article explores the challenges of post‑Q&A recommendation, the integration of large language models into recall, ranking and evaluation pipelines, and the engineering trade‑offs required to deliver high‑quality, diverse suggestions on mobile devices.

LLMMobile AIRecommendation Systems
0 likes · 15 min read
How Vivo’s Blue Heart XiaoV Leverages LLMs to Transform Conversational Recommendations
DataFunTalk
DataFunTalk
Jun 27, 2025 · Artificial Intelligence

How Generative AI is Revolutionizing Ad Recommendation Systems

Join Baidu senior algorithm engineer Ji Zhi at the DataFun Summit 2025 to explore how generative AI transforms ad recommendation recall, covering item representation, evolving solution architectures, long‑sequence challenges, and practical insights for building efficient large‑model recommendation systems.

AI researchAd TechBaidu
0 likes · 3 min read
How Generative AI is Revolutionizing Ad Recommendation Systems
Meituan Technology Team
Meituan Technology Team
May 15, 2025 · Artificial Intelligence

How Meituan’s MTGR Framework Achieved 65× Faster Inference with Scaling Laws

Meituan’s recommendation team introduced the MTGR framework, aligning traditional DLRM features with a unified HSTU‑based Transformer to explore scaling laws, delivering a 65‑fold FLOPs boost, 12% lower inference cost, and record gains in online CTR and order volume across its food‑delivery platform.

Inference OptimizationLarge-Scale TrainingMTGR
0 likes · 26 min read
How Meituan’s MTGR Framework Achieved 65× Faster Inference with Scaling Laws
AntTech
AntTech
May 15, 2025 · Artificial Intelligence

Live Deep Dive into Two Award‑Winning WSDM 2025 Papers on Popularity Bias in Recommendation Models and Graph‑Based Causal Inference

This announcement introduces a live session that will dissect two best‑paper award research works from WSDM 2025—one revealing how recommendation models amplify popularity bias through spectral analysis and proposing a lightweight regularizer, and the other presenting a graph disentangle causal model that integrates GNNs with structural causal models to improve causal inference on networked observational data.

Recommendation SystemsWSDM 2025causal inference
0 likes · 4 min read
Live Deep Dive into Two Award‑Winning WSDM 2025 Papers on Popularity Bias in Recommendation Models and Graph‑Based Causal Inference
JD Tech
JD Tech
May 6, 2025 · Artificial Intelligence

One4All Generative Recommendation Framework for CPS Advertising

This article reviews recent advances in applying large language models to CPS advertising recommendation, outlines business requirements and core technical challenges, proposes an extensible multi‑task generative framework with explicit intent perception and multi‑objective optimization, and presents offline and online performance gains along with future research directions.

AI OptimizationCPS advertisingGenerative Models
0 likes · 13 min read
One4All Generative Recommendation Framework for CPS Advertising
JD Tech Talk
JD Tech Talk
Apr 27, 2025 · Artificial Intelligence

Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval

This paper investigates the "sandglass" phenomenon in residual‑quantized semantic identifiers for generative search and recommendation, analyzes its causes of path sparsity and long‑tail token distribution, and proposes heuristic and adaptive token‑removal methods that substantially improve model performance in e‑commerce scenarios.

Generative RetrievalRecommendation Systemsadaptive token removal
0 likes · 10 min read
Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval
Alimama Tech
Alimama Tech
Apr 3, 2025 · Artificial Intelligence

UQABench: A Personalized QA Benchmark for Evaluating User Embeddings in LLM‑Driven Recommendation Systems

UQABench introduces the first benchmark for assessing high‑density user embeddings that serve as soft prompts in LLM‑driven recommendation, featuring a three‑stage pre‑train‑align‑evaluate pipeline, seven personalized QA tasks, and findings that transformer encoders, side‑information, simple linear adapters, and larger models markedly improve accuracy while cutting input tokens to about five percent.

AIBenchmarkLLM
0 likes · 12 min read
UQABench: A Personalized QA Benchmark for Evaluating User Embeddings in LLM‑Driven Recommendation Systems
Alimama Tech
Alimama Tech
Mar 28, 2025 · Artificial Intelligence

How Alibaba’s Taobao AI Models Revolutionize E‑Commerce Recommendations and Bidding

Alibaba’s Taobao Group unveiled its AIGX technology suite, including the RecGPT recommendation model, the AIGB generative bidding system, and a new AI‑generated video engine, detailing open‑source benchmarks, NeurIPS workshop participation, and measurable ROI improvements for e‑commerce advertising.

AIGenerative BiddingRecommendation Systems
0 likes · 5 min read
How Alibaba’s Taobao AI Models Revolutionize E‑Commerce Recommendations and Bidding
58 Tech
58 Tech
Mar 11, 2025 · Artificial Intelligence

Applying Large Language Models to Real Estate Recommendation: Case Studies and Optimization Techniques

This article presents a comprehensive case study on how large language models are integrated into 58.com’s real‑estate recommendation platform, detailing challenges, data adaptation, prompt and parameter optimizations, embedding generation, conversational recommendation, and future directions for multimodal and generative recommendation systems.

AI OptimizationEmbeddingPrompt Engineering
0 likes · 14 min read
Applying Large Language Models to Real Estate Recommendation: Case Studies and Optimization Techniques
Cognitive Technology Team
Cognitive Technology Team
Mar 6, 2025 · Artificial Intelligence

From Traditional Machine Learning to Deep Learning: A Comprehensive Guide to Algorithms, Feature Engineering, and Model Training

This article provides a step‑by‑step tutorial that walks readers through the fundamentals of traditional machine‑learning algorithms, feature‑engineering techniques, model training pipelines, evaluation metrics, and then advances to deep‑learning concepts such as MLPs, activation functions, transformers, and modern recommendation‑system models.

Deep LearningModel TrainingPython
0 likes · 63 min read
From Traditional Machine Learning to Deep Learning: A Comprehensive Guide to Algorithms, Feature Engineering, and Model Training
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 6, 2025 · Artificial Intelligence

From Linear Regression to Transformers: Mastering Machine Learning Foundations

This comprehensive guide walks readers through the evolution of machine learning, starting with basic linear models and feature engineering, progressing through logistic regression, decision trees, and deep learning architectures like MLPs, CNNs, RNNs, and transformers, and demonstrates practical implementations with code examples and evaluation metrics.

Deep LearningEvaluation MetricsRecommendation Systems
0 likes · 64 min read
From Linear Regression to Transformers: Mastering Machine Learning Foundations
DataFunSummit
DataFunSummit
Mar 5, 2025 · Artificial Intelligence

Evolution and Future Trends of Recommendation Systems: From Deep Learning to Large Language Models and AI Agents

This article reviews a decade of recommendation‑system research, outlines the shift from traditional listwise methods to deep‑learning models, discusses the impact of large language models and AI agents, and presents future directions such as multimodal interaction, responsible AI, cognitive modeling, and ecosystem integration.

Recommendation Systemsuser behavior modeling
0 likes · 31 min read
Evolution and Future Trends of Recommendation Systems: From Deep Learning to Large Language Models and AI Agents
Qunar Tech Salon
Qunar Tech Salon
Feb 17, 2025 · Artificial Intelligence

Evolution of Qunar Hotel Search Ranking: From LambdaMart to LambdaDNN and Multi‑Objective Optimization

The article details Qunar’s hotel search ranking system evolution, covering the shift from rule‑based sorting to LambdaMart, the adoption of LambdaDNN deep models, multi‑objective MMOE architectures, multi‑scenario integration, extensive feature engineering, and experimental results demonstrating significant offline and online performance gains.

Learning-to-RankRecommendation Systemsdeep-learning
0 likes · 36 min read
Evolution of Qunar Hotel Search Ranking: From LambdaMart to LambdaDNN and Multi‑Objective Optimization
DataFunSummit
DataFunSummit
Feb 14, 2025 · Artificial Intelligence

Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform

This presentation details how Alibaba Cloud's AI platform integrates big‑data pipelines, feature‑store services, and large language model capabilities to construct high‑performance search‑recommendation architectures, covering system design, training and inference optimizations, LLM‑driven use cases, and open‑source RAG tooling.

AI PlatformBig DataDistributed Training
0 likes · 17 min read
Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform
ByteDance Data Platform
ByteDance Data Platform
Feb 12, 2025 · Fundamentals

Why A/B Tests Fail in Recommendation Systems and How to Fix Them

This article examines the hidden complexities of A/B experiments in short‑video recommendation feeds, explains why traditional designs produce biased results due to learning, double‑sided, and network effects, and presents practical double‑sided and community‑randomized experiment frameworks to obtain unbiased strategy evaluations.

A/B testingCommunity randomizationDouble-sided effects
0 likes · 21 min read
Why A/B Tests Fail in Recommendation Systems and How to Fix Them
JD Retail Technology
JD Retail Technology
Feb 12, 2025 · Artificial Intelligence

Accelerating Generative Recommendation with NVIDIA TensorRT‑LLM in JD Advertising

JD Advertising accelerates its generative‑recall recommendation system by integrating NVIDIA TensorRT‑LLM, which simplifies the pipeline, injects LLM knowledge, scales to billions of parameters, and delivers over five‑fold throughput gains, one‑fifth the cost, and significant CTR improvements in both recommendation and search.

Inference OptimizationLLMRecommendation Systems
0 likes · 13 min read
Accelerating Generative Recommendation with NVIDIA TensorRT‑LLM in JD Advertising
DataFunTalk
DataFunTalk
Feb 6, 2025 · Artificial Intelligence

Why Graph Neural Networks Are Suitable for Recommendation Systems

Graph Neural Networks excel in recommendation systems because they can model complex user‑item relationships, capture high‑order interactions, adapt dynamically to real‑time behavior, propagate multi‑step information, enrich contextual embeddings, alleviate data sparsity, and improve long‑tail item coverage, with practical e‑commerce case studies available for download.

GNNRecommendation Systemsartificial intelligence
0 likes · 5 min read
Why Graph Neural Networks Are Suitable for Recommendation Systems
JD Retail Technology
JD Retail Technology
Jan 21, 2025 · Artificial Intelligence

Tech Insight: Selected JD Retail Technology Papers in Artificial Intelligence (2024)

Tech Insight highlights ten 2024 JD Retail Technology AI papers presented at top conferences—including CVPR, SIGIR, WWW, AAAI and IJCAI—that advance open‑vocabulary object detection, unified search‑recommendation, pre‑ranking consistency, diversity‑aware re‑ranking, a diversified product‑search dataset, graph‑based query classification, plug‑in CTR models, parallel ad‑ranking, trajectory‑based CTR stability, and task‑aware decoding for large language models.

CTR predictionComputer VisionE‑commerce
0 likes · 20 min read
Tech Insight: Selected JD Retail Technology Papers in Artificial Intelligence (2024)
ZhongAn Tech Team
ZhongAn Tech Team
Jan 19, 2025 · Artificial Intelligence

Weekly AI Digest Issue 11: Recommendation Algorithms, Video Generation Advances, and AGI Research

This issue of the weekly AI digest explores Xiaohongshu’s NoteLLM recommendation system, compares Chinese text generation in video AI across major platforms, highlights Alibaba’s Tongyi Wanxiang breakthroughs, discusses Keras founder François Chollet’s new AGI‑focused lab, and reviews Google’s Veo 2 and Imagen‑3 advancements.

AGIAIRecommendation Systems
0 likes · 11 min read
Weekly AI Digest Issue 11: Recommendation Algorithms, Video Generation Advances, and AGI Research
DataFunTalk
DataFunTalk
Jan 18, 2025 · Artificial Intelligence

Understanding Xiaohongshu’s Content Recommendation Mechanisms: NoteLLM and SSD

This article analyzes Xiaohongshu’s content recommendation system by reviewing two official papers, detailing the NoteLLM framework for interest discovery and the Sliding Spectrum Decomposition (SSD) method for diversified recommendations, and explaining their underlying models, loss functions, and experimental results.

DiversityLLMRecommendation Systems
0 likes · 13 min read
Understanding Xiaohongshu’s Content Recommendation Mechanisms: NoteLLM and SSD
Kuaishou Tech
Kuaishou Tech
Jan 17, 2025 · Artificial Intelligence

Kuaishou Achieves 7 Papers Accepted at AAAI 2025

Kuaishou has achieved a significant milestone with 7 papers accepted at AAAI 2025, covering diverse AI research areas including video processing, recommendation systems, and image restoration, demonstrating the company's strong research capabilities in artificial intelligence.

AAAI 2025Image RestorationKuaishou
0 likes · 10 min read
Kuaishou Achieves 7 Papers Accepted at AAAI 2025
JD Cloud Developers
JD Cloud Developers
Jan 14, 2025 · Artificial Intelligence

How Generative Recommendation Systems Transform E‑Commerce with LLMs

This article explains how large language models reshape recommendation systems by simplifying pipelines, integrating world knowledge, and leveraging scaling laws, and details the engineering steps for deploying generative recall models—including product encoding, user prompting, model training, TensorRT‑LLM optimization, and continuous performance improvements.

AI OptimizationGenerative RecommendationLLM
0 likes · 13 min read
How Generative Recommendation Systems Transform E‑Commerce with LLMs
Tencent Advertising Technology
Tencent Advertising Technology
Dec 27, 2024 · Artificial Intelligence

Tencent's AutoML Research for Advertising Recommendation Systems

This article outlines Tencent's AutoML research, presenting several recent papers that introduce novel neural architecture search, feature selection, pooling, embedding size, and hyper‑parameter optimization techniques to improve the efficiency, accuracy, and scalability of large‑scale advertising recommendation systems.

AutoMLEmbedding Size SearchNeural Architecture Search
0 likes · 10 min read
Tencent's AutoML Research for Advertising Recommendation Systems
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 18, 2024 · Artificial Intelligence

How STAR Enables Training‑Free Recommendations with Large Language Models

The article reviews the STAR framework, a training‑free recommendation approach that leverages large language model embeddings and collaborative co‑occurrence scores to retrieve and rank items, and evaluates its performance, hyper‑parameter effects, and ablation studies against existing LLM‑based recommender methods.

LLMRecommendation Systemsartificial intelligence
0 likes · 10 min read
How STAR Enables Training‑Free Recommendations with Large Language Models
Kuaishou Tech
Kuaishou Tech
Nov 30, 2024 · Artificial Intelligence

Kuaishou and Tsinghua University Win First Prize in Qian Weichang Chinese Information Processing Award for Content Recommendation Technology

Kuaishou and Tsinghua University were honored with the first‑place Qian Weichang Chinese Information Processing Science and Technology Award for their collaborative content recommendation project, which achieved international‑level innovations in explainable recommendation, bias correction, and edge intelligence, and has been applied widely in Kuaishou's platform and top academic conferences.

FairnessKuaishouRecommendation Systems
0 likes · 5 min read
Kuaishou and Tsinghua University Win First Prize in Qian Weichang Chinese Information Processing Award for Content Recommendation Technology
Baobao Algorithm Notes
Baobao Algorithm Notes
Nov 25, 2024 · Artificial Intelligence

How Non‑Autoregressive Generative Models Transform Recommendation Reranking

This article presents a KDD‑2024 accepted solution that replaces autoregressive generators with a non‑autoregressive model for video recommendation reranking, detailing the challenges, model architecture, novel loss function, extensive offline and online experiments, and practical Q&A from the conference.

KDD2024Recommendation SystemsReranking
0 likes · 11 min read
How Non‑Autoregressive Generative Models Transform Recommendation Reranking
NewBeeNLP
NewBeeNLP
Nov 14, 2024 · Artificial Intelligence

What’s Trending in Recommendation Systems at KDD 2024? A Comprehensive Paper Overview

The 30th SIGKDD conference in Barcelona featured 2,046 research papers with a 20% acceptance rate, and this article compiles the 59 recommendation‑system papers—covering large‑model recommenders, graph‑based methods, sequential models, fairness, privacy, advertising, debiasing, reinforcement learning and more—for researchers to explore the latest academic advances.

FairnessKDD2024Recommendation Systems
0 likes · 15 min read
What’s Trending in Recommendation Systems at KDD 2024? A Comprehensive Paper Overview
DataFunTalk
DataFunTalk
Nov 7, 2024 · Product Management

Strategy Product Definition, AI‑Era Trends, and Career Path in Recommendation Systems

This article introduces the concept and capability model of strategy products, outlines the three generations of product managers, presents a simplified talent development framework, discusses practical workflow, examines six AI‑era strategic product questions, and shares 2024 observations on recommendation performance and future skill development.

Career DevelopmentRecommendation Systemsartificial intelligence
0 likes · 13 min read
Strategy Product Definition, AI‑Era Trends, and Career Path in Recommendation Systems
Zhuanzhuan Tech
Zhuanzhuan Tech
Nov 6, 2024 · Artificial Intelligence

Multi-Task Learning for E-commerce Search: Overview, Practices, and Model Design in the Zhuanzhuan Scenario

This article reviews the necessity, benefits, and practical implementations of multi-task learning in e‑commerce search, detailing model selection, architecture extensions such as ESMM and ESM², and future directions for handling user behavior sequences and multi‑objective optimization.

Deep LearningESMMModel architecture
0 likes · 13 min read
Multi-Task Learning for E-commerce Search: Overview, Practices, and Model Design in the Zhuanzhuan Scenario
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 24, 2024 · Artificial Intelligence

How NoteLLM-2 Boosts Multimodal Recommendations with In-Content Learning

NoteLLM-2 introduces multimodal In-Content Learning and Late Fusion to overcome visual‑modality bias in end‑to‑end fine‑tuned large representation models, delivering significant gains over baseline multimodal LLMs and traditional retrieval methods in recommendation tasks.

AI researchRecommendation Systemscontrastive learning
0 likes · 11 min read
How NoteLLM-2 Boosts Multimodal Recommendations with In-Content Learning
Tencent Advertising Technology
Tencent Advertising Technology
Oct 14, 2024 · Artificial Intelligence

Generative Retrieval Based on Yuan Large Model: Implementation and Practice in Tencent Advertising

This paper presents the implementation and practice of generative retrieval based on Yuan large model in Tencent Advertising, addressing three key challenges: user intent capture, model alignment in advertising domain, and high-performance platform design under ROI constraints.

Generative RetrievalHigh‑performance computingModel Optimization
0 likes · 17 min read
Generative Retrieval Based on Yuan Large Model: Implementation and Practice in Tencent Advertising
DataFunSummit
DataFunSummit
Oct 5, 2024 · Artificial Intelligence

Optimizing TorchRec for Large‑Scale Recommendation Systems on PyTorch

This article details the performance‑focused optimizations applied to TorchRec, PyTorch's large‑scale recommendation system library, including CUDA graph capture, multithreaded kernel launches, pinned memory copies, and input‑distribution refinements that together achieve a 2.25× speedup on MLPerf DLRM‑DCNv2 across 16 DGX H100 nodes.

CUDA GraphDistributed TrainingGPU Optimization
0 likes · 11 min read
Optimizing TorchRec for Large‑Scale Recommendation Systems on PyTorch
NewBeeNLP
NewBeeNLP
Sep 9, 2024 · Artificial Intelligence

Can Real‑Time Learning at Serving Time Transform Recommendation Re‑ranking?

This article introduces LAST, a novel online learning approach that updates recommendation models instantly at serving time, addressing real‑time learning challenges, re‑ranking complexities, and demonstrating superior offline and online performance in industrial e‑commerce scenarios.

AILASTOnline Learning
0 likes · 12 min read
Can Real‑Time Learning at Serving Time Transform Recommendation Re‑ranking?
JD Retail Technology
JD Retail Technology
Aug 30, 2024 · Artificial Intelligence

GPU Optimization Practices for Training and Inference in JD Advertising Recommendation Systems

The article details JD Advertising's technical challenges and solutions for large‑scale sparse recommendation models, describing GPU‑focused storage, compute and I/O optimizations for both training and low‑latency inference, including distributed pipelines, heterogeneous deployment, batch aggregation, multi‑stream execution, and compiler extensions.

Distributed SystemsGPU OptimizationInference
0 likes · 13 min read
GPU Optimization Practices for Training and Inference in JD Advertising Recommendation Systems
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 22, 2024 · Artificial Intelligence

How RECom Accelerates Recommendation Model Inference on GPUs

The RECom compiler introduces a subgraph‑parallel fusion technique and symbolic shape handling to dramatically speed up GPU inference of deep recommendation models with massive embedding columns, achieving up to 6.61× lower latency and 1.91× higher throughput than TensorFlow baselines, while eliminating redundant computations.

GPU OptimizationRecommendation Systemscompiler
0 likes · 10 min read
How RECom Accelerates Recommendation Model Inference on GPUs
Model Perspective
Model Perspective
Aug 18, 2024 · Fundamentals

How to Judge a Mathematical Model: 6 Practical Criteria for Success

This article outlines six essential criteria—accuracy, robustness, simplicity, explainability, generalization, and scalability—for evaluating the quality of mathematical models such as e‑commerce recommendation systems, helping readers assess whether a model is truly reliable or merely a flashy façade.

Model EvaluationRecommendation SystemsRobustness
0 likes · 3 min read
How to Judge a Mathematical Model: 6 Practical Criteria for Success
DataFunSummit
DataFunSummit
Aug 10, 2024 · Artificial Intelligence

Leveraging Large Language Models for Graph Recommendation System Optimization

This article reviews cutting‑edge research on integrating large language models with graph‑based recommendation systems, detailing four key strategies—LLM node embeddings, deep graph‑LLM fusion, model‑driven graph data training, and text‑modal enhancements—while analyzing representation learning, InfoNCE optimization, explainable recommendations, and extensive experimental validation.

InfoNCELLMRecommendation Systems
0 likes · 18 min read
Leveraging Large Language Models for Graph Recommendation System Optimization
DataFunTalk
DataFunTalk
Aug 5, 2024 · Artificial Intelligence

Enhancing Taobao Display Advertising with Multimodal Representations: Challenges, Approaches, and Insights

This article presents a comprehensive study on integrating multimodal image‑text representations into large‑scale e‑commerce advertising CTR models, introducing a semantic‑aware contrastive pre‑training (SCL) method and two application algorithms (SimTier and MAKE) that together achieve over 1 % GAUC improvement and significant online gains.

CTR predictionRecommendation Systemscontrastive learning
0 likes · 21 min read
Enhancing Taobao Display Advertising with Multimodal Representations: Challenges, Approaches, and Insights
DataFunSummit
DataFunSummit
Aug 4, 2024 · Artificial Intelligence

Graph Technology Overview and Applications – From GraphGPT to Graph Databases

This article presents a comprehensive overview of recent advances in graph technology, covering GraphGPT for large language models, knowledge transfer on complex graphs, financial fraud detection, telecom network optimization, graph foundation models, Baidu's multi‑domain recommendation, high‑availability graph databases, and Kuaishou's efficient recommendation architecture.

Recommendation Systemsfinancial fraud detectiongraph databases
0 likes · 4 min read
Graph Technology Overview and Applications – From GraphGPT to Graph Databases
Alimama Tech
Alimama Tech
Aug 2, 2024 · Artificial Intelligence

Multimodal Representations Boost Taobao Display Advertising CTR

Alibaba’s advertising team introduces semantic‑aware contrastive learning to pre‑train multimodal image‑text embeddings, integrates them via SimTier and MAKE into ID‑based CTR models, achieving up to 6.9% lift in Taobao display ad click‑through rates and improving long‑tail item performance.

CTR predictionMultimodal LearningRecommendation Systems
0 likes · 21 min read
Multimodal Representations Boost Taobao Display Advertising CTR
DataFunSummit
DataFunSummit
Jul 29, 2024 · Artificial Intelligence

Large Language Models for Recommendation Systems: Current Progress, Challenges, and Future Directions

This article reviews the state‑of‑the‑art applications of large language models in recommendation systems, summarizing background knowledge, recent advances such as LLM4Rec, various tuning strategies, agent‑based approaches, open research problems, and future directions for generative recommendation.

AIIn-Context LearningLLM
0 likes · 24 min read
Large Language Models for Recommendation Systems: Current Progress, Challenges, and Future Directions
Meituan Technology Team
Meituan Technology Team
Jul 25, 2024 · Artificial Intelligence

Selected Meituan Papers Accepted at KDD 2024: Summaries of Five Long Papers

Meituan’s five long papers accepted at KDD 2024 introduce a dual‑intent model for search‑recommendation, a joint auction mechanism for ads, a robust ATE estimator for heavy‑tailed metrics, a decision‑focused causal learning framework for marketing, and an efficient on‑demand order‑pooling system for real‑time courier assignments.

Controlled ExperimentsKDD 2024Recommendation Systems
0 likes · 12 min read
Selected Meituan Papers Accepted at KDD 2024: Summaries of Five Long Papers
NewBeeNLP
NewBeeNLP
Jul 22, 2024 · Artificial Intelligence

How Meta Scales User Modeling for Ads: Inside the SUM Framework

This article examines Meta's SUM (Scaling User Modeling) system, detailing its upstream‑downstream architecture, the SOAP online asynchronous serving platform, production optimizations, and extensive offline and online experiments that demonstrate significant gains in ad personalization performance.

Deep LearningMetaRecommendation Systems
0 likes · 19 min read
How Meta Scales User Modeling for Ads: Inside the SUM Framework
DataFunSummit
DataFunSummit
Jul 10, 2024 · Artificial Intelligence

Applying Large Language Models to Recommendation Systems at Ant Group

The article presents Ant Group's research on integrating large language models into recommendation pipelines, covering background challenges, knowledge extraction, teacher‑model distillation, efficient deployment, experimental results, and future directions to improve accuracy and reduce bias.

AILLMRecommendation Systems
0 likes · 13 min read
Applying Large Language Models to Recommendation Systems at Ant Group
NewBeeNLP
NewBeeNLP
Jul 5, 2024 · Artificial Intelligence

Unveiling Meta’s Wukong: How Scaling Laws Boost Large‑Scale Recommendation Performance

Meta’s new paper introduces the Wukong model, demonstrating that expanding dense‑layer parameters and computational FLOPs in large‑scale recommendation systems follows a clear scaling law, yielding consistent performance gains across massive internal datasets, with detailed analysis of feature modules, parameter impacts, and experimental results.

CTR modelsDeep LearningMeta
0 likes · 10 min read
Unveiling Meta’s Wukong: How Scaling Laws Boost Large‑Scale Recommendation Performance
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 20, 2024 · Artificial Intelligence

Xiaohongshu 2024 Large Model Frontier Paper Sharing Live Event

On June 27, 2024, Xiaohongshu’s technical team will livestream a two‑hour session across WeChat Channels, Bilibili, Douyin and Xiaohongshu, showcasing six top‑conference papers on large‑model advances—including early‑stopping and fine‑grained self‑consistency, novel evaluation methods, negative‑sample‑assisted distillation, and LLM‑based note recommendation—followed by a Q&A and recruitment briefing.

AI researchModel EvaluationRecommendation Systems
0 likes · 12 min read
Xiaohongshu 2024 Large Model Frontier Paper Sharing Live Event
DataFunSummit
DataFunSummit
Jun 17, 2024 · Artificial Intelligence

Strategies for Reducing Cost and Improving Efficiency in Recommendation Systems with Alibaba Cloud PAI‑Rec

This article discusses how Alibaba Cloud’s AI platform PAI‑Rec reduces recommendation system costs and boosts efficiency by optimizing training resources, leveraging FeatureStore, EasyRec and TorchEasyRec frameworks, detailing workflow stages, feature consistency, GPU acceleration, componentized model configuration, and practical deployment timelines.

AI PlatformFeature StoreGPU Acceleration
0 likes · 14 min read
Strategies for Reducing Cost and Improving Efficiency in Recommendation Systems with Alibaba Cloud PAI‑Rec