Tag

information retrieval

0 views collected around this technical thread.

Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 4, 2025 · Artificial Intelligence

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

This comprehensive article examines Retrieval‑Augmented Generation (RAG), covering its historical evolution, core theory, implementation variants, practical code examples, diverse applications, current controversies, and future research directions within the AI and NLP landscape.

Artificial IntelligenceGenerative ModelsNatural Language Processing
0 likes · 21 min read
Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends
Baidu Geek Talk
Baidu Geek Talk
Apr 7, 2025 · Artificial Intelligence

COBRA: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

COBRA, Baidu’s new generative retrieval framework, unifies sparse ID generation and dense vector encoding through a cascaded architecture that first predicts hierarchical IDs then refines them into dense representations, achieving state‑of‑the‑art recall, NDCG and conversion gains across public benchmarks and large‑scale advertising production.

AICobragenerative recommendation
0 likes · 13 min read
COBRA: Unified Generative Recommendations with Cascaded Sparse-Dense Representations
Architect
Architect
Mar 22, 2025 · Artificial Intelligence

Understanding and Mitigating Failures in Retrieval‑Augmented Generation (RAG) Systems

Retrieval‑augmented generation (RAG) combines external knowledge retrieval with large language models to improve answer accuracy, but it often suffers from retrieval mismatches, algorithmic flaws, chunking issues, embedding biases, inefficiencies, generation errors, reasoning limits, formatting problems, system‑level failures, and high resource costs, which this article analyzes and offers solutions for.

AI ReliabilityLLMRAG
0 likes · 32 min read
Understanding and Mitigating Failures in Retrieval‑Augmented Generation (RAG) Systems
Baidu Tech Salon
Baidu Tech Salon
Mar 21, 2025 · Artificial Intelligence

Semantic Embedding with Large Language Models: A Comprehensive Survey

This survey reviews the evolution of semantic embedding—from Word2vec and GloVe to BERT, Sentence‑BERT, and recent contrastive methods—then examines how large language models improve embeddings via synthetic data generation and backbone architectures, detailing techniques such as contrastive prompting, in‑context learning, knowledge distillation, and discussing resource, privacy, and interpretability challenges.

In-Context LearningNLPcontrastive learning
0 likes · 27 min read
Semantic Embedding with Large Language Models: A Comprehensive Survey
JD Tech
JD Tech
Feb 5, 2025 · Artificial Intelligence

Tech Insight: Highlights of Ten JD Retail Technology Papers Published in Top AI Conferences (2024)

Tech Insight presents concise overviews of ten JD retail technology papers accepted at top AI conferences in 2024, covering topics such as open‑vocabulary object detection, multi‑scenario ranking, diversity‑aware re‑ranking, a diversified product search dataset, semi‑supervised query classification, plug‑in CTR models, and methods to mitigate LLM hallucinations.

AIRankingcomputer vision
0 likes · 17 min read
Tech Insight: Highlights of Ten JD Retail Technology Papers Published in Top AI Conferences (2024)
JD Retail Technology
JD Retail Technology
Jan 21, 2025 · Artificial Intelligence

Tech Insight: Selected JD Retail Technology Papers in Artificial Intelligence (2024)

Tech Insight highlights ten 2024 JD Retail Technology AI papers presented at top conferences—including CVPR, SIGIR, WWW, AAAI and IJCAI—that advance open‑vocabulary object detection, unified search‑recommendation, pre‑ranking consistency, diversity‑aware re‑ranking, a diversified product‑search dataset, graph‑based query classification, plug‑in CTR models, parallel ad‑ranking, trajectory‑based CTR stability, and task‑aware decoding for large language models.

Artificial IntelligenceCTR predictionLarge Language Models
0 likes · 20 min read
Tech Insight: Selected JD Retail Technology Papers in Artificial Intelligence (2024)
JD Tech
JD Tech
Dec 14, 2024 · Artificial Intelligence

Generative Retrieval for E‑commerce Search: Lexical and Semantic ID Approaches

This article presents a comprehensive study of generative retrieval for large‑scale e‑commerce search, comparing lexical‑based and Semantic‑ID‑based methods, introducing a Query‑to‑MultiSpan framework, analyzing the sand‑glass distribution problem in residual quantization, and proposing heuristic and adaptive solutions to improve recall and efficiency.

AILarge Language Modelse-commerce search
0 likes · 20 min read
Generative Retrieval for E‑commerce Search: Lexical and Semantic ID Approaches
Aikesheng Open Source Community
Aikesheng Open Source Community
Nov 12, 2024 · Artificial Intelligence

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Large Language Models

ChatDBA is a conversational AI system built by Shanghai Aikesheng that employs large language models and Retrieval‑Augmented Generation to help database administrators diagnose faults, learn domain knowledge, and generate or optimize SQL, with a redesigned architecture that addresses early‑stage shortcomings and outlines future enhancements.

ChatDBADatabase AIFault Diagnosis
0 likes · 10 min read
ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Large Language Models
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jul 29, 2024 · Artificial Intelligence

Scaling Laws for Dense Retrieval: Empirical Study of Model Size, Training Data, and Annotation Quality

The award‑winning study shows that dense retrieval performance follows precise power‑law scaling with model size, training data quantity, and annotation quality, introduces contrast entropy for evaluation, validates joint scaling formulas on MS MARCO and T2Ranking, and uses cost models to guide budget‑optimal resource allocation.

annotation qualitycontrast entropydense retrieval
0 likes · 13 min read
Scaling Laws for Dense Retrieval: Empirical Study of Model Size, Training Data, and Annotation Quality
Cognitive Technology Team
Cognitive Technology Team
May 25, 2024 · Artificial Intelligence

Google’s Generative AI Search Tools and Their Impact on the Web: A “Toilet Theory” Perspective

The article examines Google’s new generative AI search suite, its “AI Overview” feature, and the resulting “zero‑click” phenomenon, arguing that while the technology offers convenience, it may simplify web content, undermine publishers, and reshape how users and platforms interact with information.

GoogleSearchWeb Ecosystem
0 likes · 9 min read
Google’s Generative AI Search Tools and Their Impact on the Web: A “Toilet Theory” Perspective
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Apr 28, 2024 · Artificial Intelligence

Generative Dense Retrieval: Memory Can Be a Burden

The paper introduces Generative Dense Retrieval (GDR), a two‑stage retrieval framework that first maps queries to memory‑efficient document‑cluster identifiers and then uses dense vectors to locate individual documents, achieving higher recall and better scalability than traditional generative retrieval while incurring modest latency and capacity trade‑offs.

Memory Mechanismgenerative dense retrievalinformation retrieval
0 likes · 13 min read
Generative Dense Retrieval: Memory Can Be a Burden
DataFunTalk
DataFunTalk
Mar 15, 2024 · Artificial Intelligence

Application of Agent Technology in Voice Assistant Scenarios

Senior algorithm engineer Qi Jianwei from Xiaomi presents a comprehensive overview of building a large‑model‑centric Agent framework for voice assistants, covering prompt design, information retrieval, RAG processes, and future optimization directions to enhance performance and stability.

Voice Assistantagentinformation retrieval
0 likes · 2 min read
Application of Agent Technology in Voice Assistant Scenarios
php中文网 Courses
php中文网 Courses
Feb 18, 2024 · Backend Development

Implementing Information Retrieval and SEO with PHP

This article explains the fundamentals of information retrieval and search engine optimization and provides practical PHP code examples for keyword search, full‑text search, and common SEO techniques such as keyword, internal, and external link optimization.

Full-Text SearchPHPWeb Optimization
0 likes · 7 min read
Implementing Information Retrieval and SEO with PHP
政采云技术
政采云技术
Dec 19, 2023 · Backend Development

Principles and Simple Implementation of a Search Engine in Go

This article explains the fundamental concepts of search engine technology—including forward and inverted indexes, tokenizers, stop words, synonym handling, ranking algorithms, and NLP integration—and provides a concise Go implementation with code examples and performance testing.

GoInverted IndexNLP
0 likes · 21 min read
Principles and Simple Implementation of a Search Engine in Go
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Dec 4, 2023 · Artificial Intelligence

Xiaohongshu Search Engine Innovations Presented at SIGIR-AP 2023

At SIGIR‑AP 2023 in Beijing, Xiaohongshu’s technical team unveiled four key innovations—advanced user‑intent analysis via multi‑stage LLM pre‑training, multimodal vector retrieval, generative inverted‑index enhancements, and a three‑stage relevance‑ranking pipeline with knowledge distillation—to tackle high multi‑intent, long‑tail, and multimodal search challenges for its 260 million‑user platform.

Artificial IntelligenceSIGIR-APVector Retrieval
0 likes · 13 min read
Xiaohongshu Search Engine Innovations Presented at SIGIR-AP 2023
php中文网 Courses
php中文网 Courses
Aug 31, 2023 · Backend Development

Implementing Information Retrieval and SEO with PHP

This article explains the fundamentals of information retrieval and search engine optimization, demonstrating how to implement keyword and full‑text search using PHP and MySQL, and presenting practical PHP techniques for keyword, internal, and external link optimization to improve website visibility.

Full-Text Searchbackend developmentinformation retrieval
0 likes · 6 min read
Implementing Information Retrieval and SEO with PHP
360 Tech Engineering
360 Tech Engineering
Aug 16, 2023 · Artificial Intelligence

Improving ChatGPT Real‑time Accuracy with Document Retrieval: A Practical Approach

This article examines ChatGPT's limitations in real‑time information and answer accuracy, then proposes a retrieval‑augmented method that combines up‑to‑date document search with large language models to deliver more reliable and current responses across various scenarios.

AIChatGPTNLP
0 likes · 7 min read
Improving ChatGPT Real‑time Accuracy with Document Retrieval: A Practical Approach
Architect
Architect
May 29, 2023 · Artificial Intelligence

Understanding Embeddings and Vector Databases for LLM Applications

This article explains what embeddings and vector databases are, how they are generated with models like OpenAI's Ada, why they enable semantic search and help overcome large language model token limits, and demonstrates a practical workflow for retrieving relevant document chunks using cosine similarity.

EmbeddingsLLMSemantic Search
0 likes · 7 min read
Understanding Embeddings and Vector Databases for LLM Applications
360 Quality & Efficiency
360 Quality & Efficiency
May 26, 2023 · Artificial Intelligence

Enhancing ChatGPT Real‑Time Accuracy through Document Retrieval: A Practical Approach

The article examines ChatGPT's limitations in timeliness and factual accuracy, especially for security‑related queries, and proposes a method that combines external document search with the model to deliver up‑to‑date, reliable answers across intelligent‑assistant scenarios.

Artificial IntelligenceChatGPTNLP
0 likes · 8 min read
Enhancing ChatGPT Real‑Time Accuracy through Document Retrieval: A Practical Approach
Baidu Geek Talk
Baidu Geek Talk
Mar 13, 2023 · Artificial Intelligence

Recent Advances in Sparse and Dense Retrieval for Search Engines

The article surveys recent academic advances in both sparse inverted‑index and dense semantic retrieval for large‑scale search, highlighting key papers on decision frameworks, benchmarks, sparse lexical models, dual encoders, and hybrid techniques, while discussing challenges such as single‑vector limits and proposing multi‑view and hybrid solutions.

PretrainingRankingSearch Engines
0 likes · 12 min read
Recent Advances in Sparse and Dense Retrieval for Search Engines