Artificial Intelligence 9 min read

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) enhances large language models by fetching up‑to‑date external knowledge before generation, mitigating knowledge‑cutoff limits and hallucinations through a retrieval step (using text, vector, or graph methods) and a generation step, evolving from naive single‑method approaches to advanced, modular, graph‑based, and agentic systems that enable adaptive, multi‑hop reasoning and future intelligent, multimodal pipelines.

Tencent Cloud Developer

Apr 2, 2025

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) is a technique that enhances large language models (LLMs) by retrieving external knowledge before generation, thereby improving accuracy and reducing hallucinations caused by the LLM’s knowledge cutoff.

The article first explains two fundamental problems of LLMs: knowledge cutoff (the model only knows what was present in its training data) and hallucination (producing plausible‑looking but factually incorrect text). RAG addresses both issues by allowing the model to consult up‑to‑date documents or structured data.

RAG consists of two main components:

Retrieval : query external sources such as knowledge bases, vector databases, or web search APIs using full‑text, vector, or graph retrieval methods.

Generation : feed the retrieved information to the LLM to produce a grounded answer.

From a historical perspective, RAG has evolved through several stages:

1. Naive RAG

Uses a single retrieval method (e.g., TF‑IDF, BM25, or vector search) to fetch documents and directly augment the LLM. It is simple to implement but suffers from limited semantic understanding and sub‑optimal output quality.

2. Advanced RAG

Improves the three phases of retrieval (pre‑retrieval, retrieval, post‑retrieval). Techniques include document enrichment, index optimization, query rewriting, fine‑tuned embeddings, reranking, and context compression, leading to more accurate and relevant results.

3. Modular RAG

Adopts a component‑based architecture where retrieval, storage, routing, and generation are independent, reusable modules. This enables mixed‑retrieval strategies, API integration, and greater flexibility for different domains.

4. Graph RAG

Introduces graph‑based indexes to support multi‑hop reasoning and richer contextual information, especially useful for structured or relational data.

5. Agentic RAG

Leverages LLM‑based agents that can dynamically decide whether to retrieve, which tools to use (search engines, calculators, etc.), and how to process retrieved results. This adds intelligent decision‑making and higher retrieval accuracy for complex, multi‑domain queries.

The article also presents a comparative table summarizing the characteristics, advantages, and typical use‑cases of each RAG type.

Future directions highlighted include:

Intelligent RAG : deeper integration of agentic capabilities to make retrieval more adaptive.

Data diversification : unifying heterogeneous data formats (text, graphs, code, images) within a single RAG pipeline.

Overall, the piece serves as a comprehensive guide for developers and researchers interested in improving LLM performance through retrieval‑augmented techniques.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

AI RAG Retrieval Augmented Generation Knowledge Retrieval Agentic AI hallucination mitigation

Written by

Tencent Cloud Developer

Official Tencent Cloud community account that brings together developers, shares practical tech insights, and fosters an influential tech exchange community.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.