Tagged articles
28 articles
Page 1 of 1
AI Engineer Programming
AI Engineer Programming
May 16, 2026 · Artificial Intelligence

How to Boost RAG Retrieval Quality: Real‑World Cost‑Benefit Analysis

This article examines practical ways to improve Retrieval‑Augmented Generation (RAG) retrieval quality—covering vector database choices, data chunking, embedding models, query expansion, and re‑ranking—while weighing performance gains against operational costs through multiple real‑world case studies.

LLMRAGcost-benefit
0 likes · 16 min read
How to Boost RAG Retrieval Quality: Real‑World Cost‑Benefit Analysis
Lao Guo's Learning Space
Lao Guo's Learning Space
May 6, 2026 · Artificial Intelligence

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

This article examines why Retrieval‑Augmented Generation systems that work in demos often fail in production, detailing common pitfalls—from chunking and vector‑database selection to hybrid retrieval and re‑ranking—and offers concrete strategies, configuration tips, and a decision tree to build reliable enterprise‑grade RAG solutions.

Enterprise AIHybrid RetrievalRAG
0 likes · 12 min read
Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide
Machine Heart
Machine Heart
Apr 8, 2026 · Artificial Intelligence

Can Generative Reasoning Re‑ranking Unlock New Gains for LLM‑Based Recommender Systems?

The article analyzes a recent paper that introduces a generative reasoning re‑ranker for LLM‑driven recommendation, detailing its SFT and RL training pipeline, semantic‑ID embedding, target vs. reject sampling strategies, and experimental gains of 2.4% Recall@5 and 1.3% NDCG@5 over the OneRec‑Think baseline.

Generative ReasoningLLMSupervised Fine‑Tuning
0 likes · 9 min read
Can Generative Reasoning Re‑ranking Unlock New Gains for LLM‑Based Recommender Systems?
Data STUDIO
Data STUDIO
Mar 9, 2026 · Artificial Intelligence

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

This article dissects why naive Retrieval‑Augmented Generation (RAG) often yields only 60% accuracy, then presents eleven concrete ingestion, query, and hybrid techniques—complete with code samples, performance trade‑offs, and real‑world case studies—that together can raise RAG accuracy to 94% while outlining practical implementation roadmaps and common pitfalls.

EmbeddingKnowledge GraphLLM
0 likes · 31 min read
Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies
DeWu Technology
DeWu Technology
Feb 11, 2026 · Artificial Intelligence

How Generative Models Transform Re‑ranking Architecture for Faster, More Diverse Recommendations

This article examines the evolution of re‑ranking systems from traditional pointwise models to a two‑stage generation‑evaluation framework, compares autoregressive and non‑autoregressive generative approaches, details inference speed optimizations with GPU and model‑server upgrades, and outlines a future end‑to‑end sequence generation architecture enhanced by reinforcement learning and contrastive learning.

AIGenerative ModelsInference Optimization
0 likes · 14 min read
How Generative Models Transform Re‑ranking Architecture for Faster, More Diverse Recommendations
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Oct 23, 2025 · Artificial Intelligence

Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained

This article walks developers through three essential upgrades for Retrieval‑Augmented Generation systems—hybrid search combining vector and keyword retrieval, query rewriting to clarify conversational inputs, and re‑ranking with a cross‑encoder—providing step‑by‑step code examples using LangChain to dramatically improve answer quality.

AIHybrid SearchLangChain
0 likes · 9 min read
Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained
JD Cloud Developers
JD Cloud Developers
Sep 9, 2025 · Artificial Intelligence

How JD’s PODM‑MI Framework Revolutionized E‑commerce Search Ranking

This article recounts a JD engineer’s journey from theory to practice, detailing the development of the PODM‑MI re‑ranking framework, its three‑layer distribution‑based design, the discovery of a novel SID bottleneck, and the resulting multi‑million‑order impact validated at SIGIR 2024.

E-commerce AISIGIRdistribution modeling
0 likes · 8 min read
How JD’s PODM‑MI Framework Revolutionized E‑commerce Search Ranking
DaTaobao Tech
DaTaobao Tech
Aug 25, 2025 · Artificial Intelligence

Mastering RAG: From Quick Start to Deep Optimization Strategies

This article dives into the practical implementation of Retrieval‑Augmented Generation (RAG), covering document chunking, semantic and reverse HyDE indexing, embedding, hybrid search, and re‑ranking techniques, and provides concrete code examples and optimization tips for building high‑performance AI applications.

EmbeddingHybrid SearchRAG
0 likes · 18 min read
Mastering RAG: From Quick Start to Deep Optimization Strategies
JD Tech
JD Tech
Jul 11, 2025 · Artificial Intelligence

How JD’s PODM‑MI Model Revolutionizes E‑Commerce Search Ranking

JD’s algorithm engineer recounts how his team transformed e‑commerce search by developing the PODM‑MI re‑ranking framework, uncovering a novel “hourglass” bottleneck in generative retrieval, and implementing lightweight solutions that boosted diversity, relevance, and order volume, culminating in a SIGIR publication.

Gaussian modelinge‑commercelarge-scale systems
0 likes · 8 min read
How JD’s PODM‑MI Model Revolutionizes E‑Commerce Search Ranking
JD Retail Technology
JD Retail Technology
Jul 11, 2025 · Artificial Intelligence

How JD’s PODM‑MI Model Boosted E‑commerce Search Diversity and Sales

JD’s algorithm engineer describes how a three‑layer PODM‑MI re‑ranking framework, combining Gaussian preference modeling, mutual‑information optimization, and utility‑matrix fusion, overcame the hourglass bottleneck in generative retrieval, dramatically improving search diversity, user experience, and generating over ten million additional orders.

AIe‑commercelarge-scale systems
0 likes · 9 min read
How JD’s PODM‑MI Model Boosted E‑commerce Search Diversity and Sales
DataFunSummit
DataFunSummit
Jul 9, 2025 · Artificial Intelligence

How LAST Enables Real‑Time Learning for Re‑Ranking in E‑Commerce Recommendations

This article presents LAST, a novel Learning-at-Serving-Time approach that enables real‑time online learning for re‑ranking in industrial recommendation pipelines, eliminating feedback latency, detailing its architecture, challenges, experimental validation, and practical advantages over traditional online learning methods.

LAST algorithmRecommendation Systemsonline serving
0 likes · 12 min read
How LAST Enables Real‑Time Learning for Re‑Ranking in E‑Commerce Recommendations
JavaEdge
JavaEdge
Oct 2, 2024 · Artificial Intelligence

Boost RAG Retrieval Accuracy with Contextual Embeddings and BM25

This article presents a contextual retrieval technique that combines contextual embeddings and contextual BM25 to reduce RAG miss rates by up to 67%, explains the underlying methods, implementation steps, cost considerations, experimental results, and practical deployment guidance.

AIBM25Contextual Retrieval
0 likes · 17 min read
Boost RAG Retrieval Accuracy with Contextual Embeddings and BM25
NewBeeNLP
NewBeeNLP
Sep 9, 2024 · Artificial Intelligence

Can Real‑Time Learning at Serving Time Transform Recommendation Re‑ranking?

This article introduces LAST, a novel online learning approach that updates recommendation models instantly at serving time, addressing real‑time learning challenges, re‑ranking complexities, and demonstrating superior offline and online performance in industrial e‑commerce scenarios.

AILASTOnline Learning
0 likes · 12 min read
Can Real‑Time Learning at Serving Time Transform Recommendation Re‑ranking?
DataFunTalk
DataFunTalk
Aug 31, 2024 · Artificial Intelligence

Preference‑Oriented Diversity Model Based on Mutual Information for E‑commerce Search Re‑ranking (SIGIR 2024)

This paper, accepted at SIGIR 2024, introduces PODM‑MI, a preference‑oriented diversity re‑ranking model for e‑commerce search that jointly optimizes accuracy and diversity by modeling user intent with multivariate Gaussian distributions and maximizing mutual information between user preferences and candidate items.

E-commerce SearchUser Preference ModelingVariational Inference
0 likes · 11 min read
Preference‑Oriented Diversity Model Based on Mutual Information for E‑commerce Search Re‑ranking (SIGIR 2024)
JD Tech
JD Tech
Aug 26, 2024 · Artificial Intelligence

Preference-oriented Diversity Model Based on Mutual Information for E-commerce Search Re-ranking (SIGIR 2024)

This article presents the SIGIR 2024 accepted PODM‑MI model, which uses variational inference and mutual‑information maximization to jointly optimize relevance and diversity in JD e‑commerce search re‑ranking, demonstrating significant gains in both user conversion and result diversity through extensive online experiments.

DiversityE-commerce SearchPreference Modeling
0 likes · 11 min read
Preference-oriented Diversity Model Based on Mutual Information for E-commerce Search Re-ranking (SIGIR 2024)
JD Tech Talk
JD Tech Talk
Aug 26, 2024 · Artificial Intelligence

Preference-oriented Diversity Model Based on Mutual Information for E-commerce Re-ranking (SIGIR 2024)

The paper proposes PODM‑MI, a mutual‑information‑driven, preference‑oriented diversity model that jointly optimizes accuracy and diversity in e‑commerce search re‑ranking by modeling user preferences with multivariate Gaussian distributions and adapting rankings via a learnable utility matrix, showing significant gains in JD's main search experiments.

AIDiversityE-commerce Search
0 likes · 12 min read
Preference-oriented Diversity Model Based on Mutual Information for E-commerce Re-ranking (SIGIR 2024)
JD Cloud Developers
JD Cloud Developers
Aug 26, 2024 · Artificial Intelligence

Boosting E‑Commerce Re‑ranking Diversity and Accuracy with Mutual‑Information

This paper introduces PODM‑MI, a preference‑oriented diversity model that jointly optimizes relevance and diversity in e‑commerce search re‑ranking by leveraging variational inference and mutual‑information, demonstrating significant gains in both user conversion and result variety on JD.com.

DiversityE-commerce SearchPreference Modeling
0 likes · 10 min read
Boosting E‑Commerce Re‑ranking Diversity and Accuracy with Mutual‑Information
JD Retail Technology
JD Retail Technology
Aug 26, 2024 · Artificial Intelligence

Preference-oriented Diversity Model Based on Mutual Information for E-commerce Search Re-ranking (SIGIR 2024)

This article introduces PODM‑MI, a preference‑oriented diversity model that uses mutual information and variational Gaussian representations to jointly optimize accuracy and diversity in e‑commerce search re‑ranking, and reports significant online A/B test improvements on JD.com.

DiversityPreference Modelinge‑commerce
0 likes · 10 min read
Preference-oriented Diversity Model Based on Mutual Information for E-commerce Search Re-ranking (SIGIR 2024)
Sohu Tech Products
Sohu Tech Products
Dec 6, 2023 · Artificial Intelligence

Real-time Controllable Multi-Objective Re-ranking Models for Taobao Feed Recommendation

The paper introduces a real‑time controllable, multi‑objective re‑ranking framework for Taobao’s feed recommendation that combines actor‑critic reinforcement learning with hypernetworks to instantly adjust objective weights, handling diverse media and cold‑start constraints while delivering higher click‑through, diversity, and cold‑start ratios with only 20‑25 ms latency.

AlibabaReal-time ControlRecommendation Systems
0 likes · 34 min read
Real-time Controllable Multi-Objective Re-ranking Models for Taobao Feed Recommendation
DataFunTalk
DataFunTalk
Nov 14, 2023 · Artificial Intelligence

Real-Time Controllable Multi-Objective Re‑ranking for Taobao Feed

This article presents a comprehensive study of a controllable multi‑objective re‑ranking model for Taobao's information‑flow recommendation, detailing the challenges of complex feed scenarios, three modeling paradigms (V1‑V3), an actor‑critic reinforcement learning framework with hypernet‑generated weights, and extensive online evaluation results.

Real-time ControlRecommendation Systemshypernetworks
0 likes · 31 min read
Real-Time Controllable Multi-Objective Re‑ranking for Taobao Feed
Meituan Technology Team
Meituan Technology Team
Jun 16, 2022 · Artificial Intelligence

Edge AI Re‑ranking in Meituan/Dianping Search: Architecture, Algorithms, and Deployment

Meituan/Dianping’s edge‑AI re‑ranking system moves large‑scale deep‑learning models onto users’ devices, using dense networks and cloud‑served embeddings, advanced feedback‑sequence and multi‑view attention models, and aggressive compression to deliver real‑time, privacy‑preserving search personalization that boosts click‑through rates by up to 0.43 %.

Model Deploymentedge AImobile search
0 likes · 25 min read
Edge AI Re‑ranking in Meituan/Dianping Search: Architecture, Algorithms, and Deployment
Tencent Cloud Developer
Tencent Cloud Developer
Apr 7, 2022 · Artificial Intelligence

Re‑ranking in Recommendation Systems: Architecture, Techniques, and Efficiency

The article surveys the re‑ranking stage of modern recommendation pipelines, detailing its architecture after recall and precise ranking, and examining how shuffling and diversity improve user experience, while multi‑task fusion, context‑aware learning‑to‑rank, real‑time online learning, and traffic‑control strategies balance accuracy, efficiency, and business responsiveness.

DiversityOnline LearningReal-Time
0 likes · 15 min read
Re‑ranking in Recommendation Systems: Architecture, Techniques, and Efficiency
DataFunTalk
DataFunTalk
Feb 10, 2022 · Artificial Intelligence

Evolution of Re‑ranking Techniques in Kuaishou Short‑Video Recommendation System

This article details the technical evolution of Kuaishou's short‑video recommendation pipeline, focusing on sequence re‑ranking, multi‑content mixing, and on‑device re‑ranking, and explains how transformer‑based models, generator‑evaluator frameworks, and reinforcement‑learning strategies are employed to maximize overall sequence value, user engagement, and revenue.

KuaishouSequence Modelingmulti-content mixing
0 likes · 15 min read
Evolution of Re‑ranking Techniques in Kuaishou Short‑Video Recommendation System
DataFunSummit
DataFunSummit
Nov 19, 2021 · Artificial Intelligence

Sliding Spectrum Decomposition (SSD) for Diversified Recommendation in Re‑ranking

This article reviews the Sliding Spectrum Decomposition (SSD) model presented by Xiaohongshu at KDD 2021, explaining how it incorporates sliding‑window diversity into the re‑ranking stage, combines content‑based and collaborative‑filtering embeddings via the CB2CF framework, and demonstrates its effectiveness through offline and online A/B experiments.

DiversityEmbeddingSSD
0 likes · 14 min read
Sliding Spectrum Decomposition (SSD) for Diversified Recommendation in Re‑ranking
DataFunTalk
DataFunTalk
Apr 17, 2021 · Artificial Intelligence

Personalized Re-ranking for Recommendation (ResSys'19)

This article introduces a personalized re‑ranking model for recommendation systems, explaining the limitations of traditional point‑wise ranking, describing the PRM architecture with input, encoding, and output layers using multi‑head attention and pre‑trained personalization features, and presenting experimental results and future extensions.

CTRTransformerattention
0 likes · 7 min read
Personalized Re-ranking for Recommendation (ResSys'19)
DataFunTalk
DataFunTalk
Dec 17, 2020 · Artificial Intelligence

Context‑Aware Re‑ranking in Industrial Recommendation Systems: Design and Practice of a List Retrieval System

The article presents a comprehensive study of re‑ranking in large‑scale industrial recommendation pipelines, identifies four key challenges—context awareness, permutation specificity, computational complexity, and business constraints—and proposes a two‑stage List Retrieval System that combines fast sequence search and a generative re‑ranking network with a deep context‑wise model, achieving significant online gains across multiple Taobao feed scenarios.

Context-AwareDeep LearningIndustrial AI
0 likes · 28 min read
Context‑Aware Re‑ranking in Industrial Recommendation Systems: Design and Practice of a List Retrieval System
Meituan Technology Team
Meituan Technology Team
Apr 16, 2020 · Artificial Intelligence

Transformer Applications in Meituan Search Ranking: Practice and Experience

Meituan’s search ranking system integrates Transformer‑based models across feature engineering, behavior sequence modeling, and re‑ranking, adapting AutoInt‑style embeddings and multi‑stage attention mechanisms to boost QV_CTR and NDCG, while outlining future enhancements with BERT, graph neural networks, and reinforcement learning.

MeituanTransformerbehavior modeling
0 likes · 16 min read
Transformer Applications in Meituan Search Ranking: Practice and Experience
iQIYI Technical Product Team
iQIYI Technical Product Team
Jun 28, 2019 · Artificial Intelligence

Watchdog Team's TOP1 Solution for the iQIYI & ACMMM2019 Multimodal Video Person Recognition Challenge

Watchdog team won TOP1 in iQIYI & ACMMM2019 multimodal video person recognition challenge using pre‑extracted multimodal features, a 2048‑dim classifier with BCE loss, re‑ranking, DALI‑accelerated re‑detection, fine‑tuned InsightFace, and multi‑model ensembling achieving ~91% test accuracy.

Multimodal Learningfeature fusionmodel ensemble
0 likes · 12 min read
Watchdog Team's TOP1 Solution for the iQIYI & ACMMM2019 Multimodal Video Person Recognition Challenge