Tagged articles

Retrieval-Augmented Generation

165 articles · Page 1 of 2

Jun 29, 2026 · Artificial Intelligence

Managing LLM Hallucinations: Strategies, Metrics, and Layered Controls

The article examines why large language models hallucinate, categorizes factual, faithfulness, and reasoning hallucinations, critiques existing benchmarks, and proposes a layered governance framework—including training‑time RLHF/DPO, retrieval‑augmented generation, post‑generation verification, uncertainty quantification, and compliance considerations—to mitigate risks in production systems.

EvaluationHallucinationLLM

0 likes · 13 min read

Managing LLM Hallucinations: Strategies, Metrics, and Layered Controls

DataFunTalk

Jun 26, 2026 · Artificial Intelligence

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Best Practices

This article examines how large‑model shortcomings such as hallucination, staleness, and data‑privacy risks are mitigated by Retrieval‑Augmented Generation, and walks through a layered enterprise‑grade RAG 2.0 design—including offline document parsing, multi‑turn query rewriting, hybrid vector‑plus‑full‑text retrieval, two‑stage ranking, knowledge filtering, and prompt‑driven generation—while sharing concrete model choices, evaluation metrics, and lessons learned.

Document ParsingEnterprise AIHybrid Retrieval

0 likes · 23 min read

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Best Practices

Coder Trainee

Jun 20, 2026 · Artificial Intelligence

Java RAG Tutorial: Vector Search and Knowledge‑Base Integration

This article explains how to equip a Java application with Retrieval‑Augmented Generation (RAG) so large language models can access private PDFs, Word files, and internal documents, covering the core architecture, two implementation paths using LangChain4j and Spring AI, vector‑store options, and practical tuning techniques.

JavaLangChain4jRAG

0 likes · 12 min read

Java RAG Tutorial: Vector Search and Knowledge‑Base Integration

PaperAgent

Jun 18, 2026 · Artificial Intelligence

How FlowRAG Evolves GraphRAG to Let Evidence Chains Flow Automatically

The article examines FlowRAG, a new variant of GraphRAG that shifts retrieval from similarity‑based text chunk ranking to constructing explicit, frequency‑aware reasoning paths, detailing its three‑step design, benchmark improvements, efficiency gains, and ablation results that reveal how it mitigates entity sparsity and noise propagation.

Dual-Granularity ActivationFlowRAGFrequency-aware weighting

0 likes · 8 min read

How FlowRAG Evolves GraphRAG to Let Evidence Chains Flow Automatically

DeepHub IMBA

Jun 16, 2026 · Artificial Intelligence

10 Essential LangChain & LangGraph Concepts Every AI Engineer Must Master

The article outlines ten core concepts—State, Node, Chain vs Graph, Routing, Retrieval, Structured Output, Streaming, Memory, Checkpointing, and Human‑in‑the‑Loop—explaining why they are crucial for building reliable, scalable AI agents and showing concrete Python examples for each.

AI AgentsLangChainLangGraph

0 likes · 11 min read

10 Essential LangChain & LangGraph Concepts Every AI Engineer Must Master

ZhiKe AI

Jun 15, 2026 · Artificial Intelligence

Why AI Hallucinates and How Retrieval-Augmented Generation Gives It a Research Assistant

Retrieval-Augmented Generation (RAG) equips large language models with a three‑step "retrieve‑augment‑generate" workflow, turning closed‑book AI into an open‑book system that lowers hallucinations, updates knowledge in real time, and improves answer accuracy, though it still faces retrieval errors and reasoning limits.

AI hallucinationEnterprise AIRetrieval-Augmented Generation

0 likes · 5 min read

Why AI Hallucinates and How Retrieval-Augmented Generation Gives It a Research Assistant

AI Engineer Programming

Jun 14, 2026 · Artificial Intelligence

10 RAG Architectures Every AI Engineer Should Master

The article debunks the claim that Retrieval‑Augmented Generation is obsolete, explains why huge context windows are impractical, and systematically presents ten RAG patterns—from basic Naïve RAG to advanced Graph and Multimodal RAG—detailing their trade‑offs, costs, and suitable use cases.

AI ArchitectureEmbedding ModelsRAG

0 likes · 16 min read

10 RAG Architectures Every AI Engineer Should Master

DataFunTalk

Jun 13, 2026 · Artificial Intelligence

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Best Practices

This article examines the practical challenges of deploying Retrieval‑Augmented Generation (RAG) in enterprise settings, detailing the modular architecture, offline and online pipelines, hybrid retrieval, multi‑stage ranking, knowledge filtering, and two‑stage generation techniques that together improve search completeness, ranking quality, and answer accuracy.

Enterprise AIHybrid SearchKnowledge Graph

0 likes · 21 min read

DataFunTalk

Jun 10, 2026 · Artificial Intelligence

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Practices

This article analyses the enterprise‑level RAG 2.0 solution, covering its background problems, layered architecture, offline and online pipelines, document parsing, multi‑turn query rewriting, hybrid vector‑plus‑BM25 retrieval, ranking models such as RRF, ColBERT and cross‑encoder, knowledge filtering, two‑stage generation with FoRAG, and practical evaluation metrics.

Document ParsingEnterprise AIHybrid Retrieval

0 likes · 22 min read

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Practices

DataFunSummit

Jun 9, 2026 · Artificial Intelligence

From Poor RAG Performance to Production‑Ready Systems: A Deep Technical Walkthrough

The article dissects why early RAG deployments suffer from low recall, hallucinations and runaway costs, then presents a step‑by‑step diagnostic framework, hybrid search architecture, knowledge‑engineering tricks, caching and routing strategies, and explores advanced GraphRAG and Agentic RAG techniques to build reliable, enterprise‑grade solutions.

Agentic RAGGraphRAGHybrid Search

0 likes · 20 min read

From Poor RAG Performance to Production‑Ready Systems: A Deep Technical Walkthrough

DataFunSummit

Jun 6, 2026 · Artificial Intelligence

From Traffic Links to Task Management: 1688’s Agentic AI Evolution

The article details how 1688 transformed its platform from a traditional intent‑matching traffic hub into an Agentic AI system that understands business tasks, outlining a three‑step implementation of knowledge, trajectory and environment redesign, dual‑track evolution, novel evaluation methods, and the emerging role of product managers as evaluation engineers.

Agentic AILarge Language ModelRetrieval-Augmented Generation

0 likes · 13 min read

From Traffic Links to Task Management: 1688’s Agentic AI Evolution

AI Engineer Programming

Jun 5, 2026 · Artificial Intelligence

Multi‑Hop Reasoning vs Document Parsing: Comparing GraphRAG, LightRAG, AgenticRAG and RAGFlow

The article analyzes the classic vector RAG pipeline, highlights its shortcomings for multi‑hop reasoning and global theme inference, and then systematically compares four open‑source frameworks—GraphRAG, LightRAG, AgenticRAG and RAGFlow—detailing their design choices, processing stages, trade‑offs, limitations, and practical selection guidance for production use.

AgenticRAGGraphRAGKnowledge Graph

0 likes · 17 min read

Multi‑Hop Reasoning vs Document Parsing: Comparing GraphRAG, LightRAG, AgenticRAG and RAGFlow

Java Architect Handbook

Jun 3, 2026 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and Why It Matters for LLM Interviews

The article explains Retrieval‑Augmented Generation (RAG), why large language models suffer from hallucination, knowledge cutoff, domain gaps and traceability issues, and how RAG’s offline‑online pipeline, comparison with fine‑tuning and long‑context approaches, and emerging trends like Agentic and Graph‑RAG can be discussed in technical interviews.

AI interviewLarge Language ModelPrompt Engineering

0 likes · 12 min read

What Is Retrieval‑Augmented Generation (RAG) and Why It Matters for LLM Interviews

DeepHub IMBA

May 31, 2026 · Artificial Intelligence

Chunking Strategies for Video RAG: Pause‑Based, Sliding‑Window, and LLM‑Driven Methods

The article examines how to chunk transcribed video text for Retrieval‑Augmented Generation, comparing pause‑based, overlapping‑window, length‑based fallback, and LLM‑driven topic chunking methods, and shows how combining fine‑grained and thematic chunks yields a multi‑layered pipeline that improves context coverage for both precise and broad queries.

ChunkingLLMRAG

0 likes · 8 min read

Chunking Strategies for Video RAG: Pause‑Based, Sliding‑Window, and LLM‑Driven Methods

PaperAgent

May 28, 2026 · Artificial Intelligence

AgenticRAG Delivers 5.9× Recall Boost in Enterprise Retrieval – Real‑World Pre‑Production Results

The article analyzes Microsoft’s AgenticRAG, a tool‑based RAG framework that lets LLMs control retrieval, showing up to a 5.9× recall improvement over standard methods, reduced need for fine‑tuning, and practical design insights from pre‑production deployment.

AgenticRAGClaudeGPT-5-mini

0 likes · 12 min read

AgenticRAG Delivers 5.9× Recall Boost in Enterprise Retrieval – Real‑World Pre‑Production Results

DataFunTalk

May 24, 2026 · Artificial Intelligence

Engineering and Algorithm Innovations for RAG Engines in Office Scenarios

The article analyzes the challenges of deploying large language models in enterprise settings and presents a modular Retrieval‑Augmented Generation (RAG) solution that combines document parsing, multi‑turn query rewriting, hybrid vector‑plus‑BM25 retrieval, two‑stage ranking (RRF, ColBERT, cross‑encoder) and knowledge‑filtered prompt engineering to achieve more comprehensive search, better ranking and more accurate answers.

Document ParsingHybrid RetrievalKnowledge Filtering

0 likes · 22 min read

Engineering and Algorithm Innovations for RAG Engines in Office Scenarios

Tencent Tech

May 20, 2026 · Artificial Intelligence

The Three Evolutions of AI Engineering: Prompt, Context, and Harness

This article analyzes the progressive stages of AI‑driven software engineering—Prompt Engineering, Context Engineering, and Harness Engineering—illustrating how each addresses specific challenges, presenting real‑world experiments from OpenAI and Anthropic, and outlining a roadmap for engineers to master the new paradigm.

AI AgentsHarness EngineeringPrompt Engineering

0 likes · 19 min read

The Three Evolutions of AI Engineering: Prompt, Context, and Harness

SuanNi

May 20, 2026 · Artificial Intelligence

AI‑Powered Research Workflow: When to Trust the Tools and When to Supervise

The article surveys AI‑assisted research across the full lifecycle—creation, writing, validation, and dissemination—detailing the capabilities of prompt engineering, retrieval‑augmented generation, training‑free agents and hybrid methods, reporting benchmark numbers, failure modes, and governance challenges that dictate when human oversight remains essential.

AI research automationGovernancePrompt Engineering

0 likes · 17 min read

AI‑Powered Research Workflow: When to Trust the Tools and When to Supervise

Tech Minimalism

May 16, 2026 · Artificial Intelligence

One‑page guide to the three RAG architectures: Classic, Graph, and Agentic

The article explains why plain large language models cannot answer internal company questions, introduces Retrieval‑Augmented Generation (RAG) as a solution, and compares three RAG variants—Classic, Graph, and Agentic—detailing their workflows, strengths, limitations, and how to choose the right one for a given problem.

Agentic RAGClassic RAGGraph RAG

0 likes · 17 min read

One‑page guide to the three RAG architectures: Classic, Graph, and Agentic

Lao Guo's Learning Space

May 12, 2026 · Artificial Intelligence

Demystifying the Core Technologies Behind ChatGPT, GPT‑4, and DeepSeek

This article breaks down the key algorithms that power large‑language models—Transformer, Mixture‑of‑Experts, Flash Attention, KV‑Cache, Multi‑Token Prediction, quantization, Chain‑of‑Thought and Retrieval‑Augmented Generation—explaining how each contributes to the performance of ChatGPT, GPT‑4 and DeepSeek.

Chain-of-ThoughtFlash AttentionKV cache

0 likes · 10 min read

Demystifying the Core Technologies Behind ChatGPT, GPT‑4, and DeepSeek

James' Growth Diary

May 12, 2026 · Artificial Intelligence

GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs

This article explains why traditional vector RAG loses relational information, how GraphRAG reconstructs entity‑relationship triples into a knowledge graph, and provides step‑by‑step code, performance benchmarks, retrieval modes, and practical tips that raise multi‑hop reasoning accuracy from around 50% to 85%.

GraphRAGKnowledge GraphLangChain

0 likes · 14 min read

GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs

Linyb Geek Road

May 12, 2026 · Artificial Intelligence

10 Open‑Source Tools Cutting AI Agent Costs Ten‑Fold: Prompt Compression, Memory Management, Model Routing

The article explains how AI agents become expensive because they ingest massive, irrelevant context and shows ten open‑source projects—LLMLingua, mem0, LiteLLM, LlamaIndex + Chroma, Letta, Guidance, Aider, tiktoken + ttok—that compress prompts, manage memory, route models dynamically, add retrieval‑augmented generation, and enforce token budgeting, collectively reducing daily token usage by millions and slashing costs dramatically.

AI AgentsMemory ManagementRetrieval-Augmented Generation

0 likes · 17 min read

10 Open‑Source Tools Cutting AI Agent Costs Ten‑Fold: Prompt Compression, Memory Management, Model Routing

James' Growth Diary

May 9, 2026 · Artificial Intelligence

Agentic RAG Deep Dive: Letting the Agent Decide When and How Often to Retrieve

The article analyzes the shortcomings of traditional one‑shot RAG pipelines, introduces four Agentic RAG patterns that let an LLM‑driven agent control retrieval strategy, source selection, query rewriting and retry limits, and provides concrete TypeScript implementations with LangGraph, code snippets, and practical pitfalls.

Agentic RAGLLMLangGraph

0 likes · 16 min read

Agentic RAG Deep Dive: Letting the Agent Decide When and How Often to Retrieve

Lao Guo's Learning Space

May 6, 2026 · Artificial Intelligence

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

This article examines why Retrieval‑Augmented Generation systems that work in demos often fail in production, detailing common pitfalls—from chunking and vector‑database selection to hybrid retrieval and re‑ranking—and offers concrete strategies, configuration tips, and a decision tree to build reliable enterprise‑grade RAG solutions.

ChunkingEnterprise AIHybrid Retrieval

0 likes · 12 min read

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

DataFunSummit

May 4, 2026 · Artificial Intelligence

Inside Alibaba Cloud AI Search: Agentic RAG Architecture and Multi‑Agent Techniques

Alibaba Cloud AI Search tackles high‑concurrency, multimodal, and multi‑hop queries by evolving its Agentic RAG architecture from a single agent to a coordinated multi‑agent system that integrates planning, retrieval, and generation, leverages hybrid vector‑text‑DB‑graph recall, GPU‑accelerated indexing, quantization, NL2SQL, and multimodal search, with performance data and real‑world case studies.

AI SearchAgentic RAGAlibaba Cloud

0 likes · 6 min read

Inside Alibaba Cloud AI Search: Agentic RAG Architecture and Multi‑Agent Techniques

DataFunTalk

May 4, 2026 · Artificial Intelligence

Engineering and Algorithm Innovations for RAG Engines in Office Applications

This article analyzes the challenges and practical solutions of building a Retrieval‑Augmented Generation (RAG) system for office scenarios, covering background issues, modular architecture, offline and online pipelines, hybrid retrieval, ranking models, knowledge filtering, prompt design, and two‑stage generation techniques.

AIDocument ParsingHybrid Retrieval

0 likes · 22 min read

Engineering and Algorithm Innovations for RAG Engines in Office Applications

DataFunSummit

May 3, 2026 · Artificial Intelligence

From Flawed to Production-Ready: Deep Dive into Building Enterprise-Grade RAG Systems

The article analyzes why early RAG deployments often fall short, dissects the most common technical pain points—from document parsing to vector overload—and presents a systematic roadmap that includes hybrid search, reranking, GraphRAG, Agentic RAG, model selection, scalability tricks, and security controls for robust B‑side production.

Agentic RAGEnterprise AIGraphRAG

0 likes · 20 min read

From Flawed to Production-Ready: Deep Dive into Building Enterprise-Grade RAG Systems

AI Engineer Programming

May 3, 2026 · Artificial Intelligence

From Single Retrieval to Autonomous Reasoning: Understanding Agentic RAG

The article analyzes why traditional Retrieval‑Augmented Generation fails on multi‑hop, vague, or multi‑source queries and explains how Agentic RAG uses an LLM‑driven agent loop to make dynamic retrieval decisions, outlining its architecture, suitable scenarios, and limitations.

AI reasoningAgentic RAGLLM Agents

0 likes · 7 min read

From Single Retrieval to Autonomous Reasoning: Understanding Agentic RAG

Spring Full-Stack Practical Cases

May 3, 2026 · Artificial Intelligence

9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained

This article introduces Retrieval‑Augmented Generation (RAG) and systematically details nine distinct RAG architectures—standard, conversational with memory, corrective (CRAG), adaptive, self‑RAG, fusion, HyDE, agentic, and Graph RAG—highlighting their workflows, real‑world examples, advantages, and trade‑offs.

AI ArchitectureGraphRAGLLM

0 likes · 17 min read

9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained

MaGe Linux Operations

Apr 28, 2026 · Artificial Intelligence

Why Your RAG Performance Is Poor: Common Issues and Optimization Strategies

This article systematically analyzes why Retrieval‑Augmented Generation pipelines often underperform—covering embedding model selection, chunking strategies, hybrid retrieval, reranking, context window waste, evaluation metrics, and a detailed troubleshooting checklist—while providing concrete code examples and best‑practice recommendations for engineers.

ChunkingEmbeddingEvaluation

0 likes · 19 min read

Why Your RAG Performance Is Poor: Common Issues and Optimization Strategies

PMTalk Product Manager Community

Apr 28, 2026 · Artificial Intelligence

First Principle for Agent Product Managers: Choosing Between Single Agent, Multi‑Agent Collaboration, and Workflow

The article presents a decision framework for AI product managers, mapping workflow determinism and context certainty to four technical patterns—traditional RPA + AI, single Agent + RAG/knowledge graph, end‑to‑end RL Agent, and multi‑Agent collaboration—each with concrete use‑case examples and selection guidelines.

AI AgentsMulti-Agent SystemsRPA

0 likes · 6 min read

First Principle for Agent Product Managers: Choosing Between Single Agent, Multi‑Agent Collaboration, and Workflow

AI Illustrated Series

Apr 27, 2026 · Artificial Intelligence

Comprehensive RAG Interview Q&A: 22 In-Depth Questions and Answers

This extensive interview guide covers 22 core RAG questions, detailing the definition, workflow, embedding selection, vector database choices, retrieval optimization, multi‑turn handling, context compression, evaluation metrics, knowledge‑graph integration, operational challenges, Agentic and hybrid RAG, document update strategies, similarity algorithms, and hallucination mitigation, providing concrete examples and practical advice for AI interview preparation.

AI interviewEmbeddingRAG

0 likes · 29 min read

Comprehensive RAG Interview Q&A: 22 In-Depth Questions and Answers

DataFunTalk

Apr 26, 2026 · Artificial Intelligence

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Best Practices

This article analyses the practical construction of an enterprise‑level Retrieval‑Augmented Generation (RAG) 2.0 system, covering background issues of large models, a modular architecture, layered offline/online pipelines, hybrid retrieval, ranking strategies, prompt engineering, and deployment insights drawn from China Mobile’s production experience.

Enterprise AIHybrid RetrievalPrompt Engineering

0 likes · 22 min read

DataFunSummit

Apr 22, 2026 · Artificial Intelligence

From Flawed RAG to Production‑Ready: Deep Dive into Scaling Retrieval‑Augmented Generation

This expert roundtable dissects why RAG often fails in production—low recall, hallucinations, cost overruns—and walks through concrete diagnostics, hybrid search designs, knowledge‑engineering tricks, GraphRAG and Agentic RAG advances, plus practical deployment, security, and cost‑optimization guidelines.

AI DeploymentAgentic RAGHybrid Search

0 likes · 20 min read

From Flawed RAG to Production‑Ready: Deep Dive into Scaling Retrieval‑Augmented Generation

MeowKitty Programming

Apr 21, 2026 · Backend Development

2026 AI Priorities for Java Developers: Structured Output, RAG, and Observability

While many Java teams chase flashy AI demos and agents, the real 2026 focus has shifted to engineering concerns—ensuring model outputs reliably map to Java objects, integrating Retrieval‑Augmented Generation into robust data pipelines, and adding observability so AI services can be monitored and debugged like traditional back‑end components.

AILangChain4jObservability

0 likes · 7 min read

2026 AI Priorities for Java Developers: Structured Output, RAG, and Observability

AI Architect Hub

Apr 20, 2026 · Artificial Intelligence

Why LLMs Need RAG: Overcoming Core Limitations and Building Scalable AI Solutions

This article analyzes the fundamental shortcomings of large language models for enterprise use, explains how Retrieval‑Augmented Generation (RAG) bridges those gaps through a detailed offline‑online workflow, and explores emerging trends that will shape the next generation of intelligent AI architectures.

AI ArchitectureEnterprise AIFuture AI

0 likes · 10 min read

Why LLMs Need RAG: Overcoming Core Limitations and Building Scalable AI Solutions

Big Data and Microservices

Apr 20, 2026 · Artificial Intelligence

Why AI Hallucinates and How RAG Turns It into an Open‑Book Test

The article explains why large language models often fabricate facts, introduces Retrieval‑Augmented Generation (RAG) as a way to ground responses with external data, walks through its four‑step workflow, showcases practical use cases, and highlights the limitations and best practices for deploying RAG.

AIHallucinationKnowledge Base

0 likes · 12 min read

Why AI Hallucinates and How RAG Turns It into an Open‑Book Test

AI Architect Hub

Apr 19, 2026 · Artificial Intelligence

Mastering RAG: From Data Cleaning to Vector DBs in AI Applications

This article introduces the second stage of a large‑model application series, detailing the value of Retrieval‑Augmented Generation (RAG), its architecture, and a step‑by‑step outline covering data cleaning, text chunking, vectorization, vector‑DB selection, recall strategies, reranking, and prompt construction.

AILLMPrompt Engineering

0 likes · 4 min read

Mastering RAG: From Data Cleaning to Vector DBs in AI Applications

Su San Talks Tech

Apr 19, 2026 · Artificial Intelligence

Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank

To make Retrieval‑Augmented Generation reliable in production, the article outlines five key engineering tactics—semantic chunking with metadata, hybrid vector‑keyword search, two‑stage retrieval with reranking, query rewriting and expansion, and dynamic result evaluation—each illustrated with concrete examples and code snippets.

AI EngineeringHybrid SearchMetadata

0 likes · 10 min read

Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank

Machine Learning Algorithms & Natural Language Processing

Apr 17, 2026 · Artificial Intelligence

When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)

The article explains why conventional Retrieval‑Augmented Generation often produces incorrect answers despite retrieving relevant documents, introduces the Disco‑RAG framework that adds a structured reading step using argument trees and relation graphs, and shows how this three‑step approach dramatically improves performance on long‑document and ambiguous‑question benchmarks without any model training.

Disco-RAGRAGRetrieval-Augmented Generation

0 likes · 13 min read

When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)

DataFunSummit

Apr 17, 2026 · Artificial Intelligence

Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions

This article dissects the hype‑versus‑reality gap of Retrieval‑Augmented Generation in enterprises, exposing low recall, hallucinations, and cost overruns, then offers a systematic diagnosis, hybrid search, reranking, security controls, and advanced GraphRAG and Agentic RAG strategies to achieve reliable production deployments.

Enterprise AILLMRAG

0 likes · 17 min read

Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions

DataFunTalk

Apr 15, 2026 · Artificial Intelligence

Building a Production‑Ready RAG System for Enterprise Knowledge Work

This article analyzes the challenges and practical solutions of deploying Retrieval‑Augmented Generation (RAG) in an enterprise office setting, covering background problems, modular architecture, offline and online pipelines, hybrid retrieval, multi‑stage ranking, knowledge filtering, prompt engineering, and model selection to achieve accurate, reliable answers.

Enterprise AIHybrid RetrievalRAG

0 likes · 21 min read

Building a Production‑Ready RAG System for Enterprise Knowledge Work

IT Services Circle

Apr 14, 2026 · Artificial Intelligence

What Is RAG? A Complete Guide to Retrieval‑Augmented Generation for AI Engineers

This article explains Retrieval‑Augmented Generation (RAG), covering why large language models need external knowledge, the full offline‑and‑online workflow, document chunking, embedding evolution, vector database choices, multi‑path retrieval, evaluation metrics, hallucination types, and practical strategies to mitigate them.

AI evaluationEmbeddingRAG

0 likes · 55 min read

What Is RAG? A Complete Guide to Retrieval‑Augmented Generation for AI Engineers

Spring Full-Stack Practical Cases

Apr 11, 2026 · Artificial Intelligence

Master AI Fundamentals: Tokens, Context Windows, Temperature, Hallucinations & RAG

This article breaks down five essential AI concepts—tokens, context windows, temperature settings, hallucinations, and retrieval‑augmented generation—explaining how they work, why they matter, and how to apply them effectively when building or using large language model applications.

AI FundamentalsHallucinationRetrieval-Augmented Generation

0 likes · 12 min read

Master AI Fundamentals: Tokens, Context Windows, Temperature, Hallucinations & RAG

Bighead's Algorithm Notes

Apr 7, 2026 · Artificial Intelligence

AutoHypo-Fin: Tsinghua's Web-Mining Method to Auto-Generate and Backtest Market Hypotheses

AutoHypo‑Fin is an end‑to‑end framework that harvests large‑scale web financial data, extracts entities via large language models, builds a temporal knowledge graph, uses retrieval‑augmented generation and statistical backtesting to automatically create, test, and iteratively optimize trading hypotheses, achieving superior risk‑adjusted returns compared with baseline strategies in experiments from 2019‑2024.

AutoHypo-FinKnowledge GraphLLM

0 likes · 11 min read

AutoHypo-Fin: Tsinghua's Web-Mining Method to Auto-Generate and Backtest Market Hypotheses

IT Services Circle

Apr 6, 2026 · Artificial Intelligence

Mastering RAG Interview Questions: A Complete Retrieval Optimization Blueprint

This article breaks down the full RAG retrieval pipeline—from query understanding and rewriting, through hybrid retrieval and reranking, to chunking, context compression, and dynamic routing—providing concrete techniques, formulas, and performance metrics to help candidates ace interview questions on RAG systems.

Cross-EncoderHard Negative MiningHybrid Retrieval

0 likes · 16 min read

Mastering RAG Interview Questions: A Complete Retrieval Optimization Blueprint

DataFunSummit

Apr 1, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article analyzes why Retrieval‑Augmented Generation (RAG) often underperforms in enterprise production, identifies eight common pitfalls—from document parsing to token costs—and offers a systematic roadmap of diagnostics, hybrid search, reranking, and deployment strategies presented by leading AI experts.

AIEnterpriseRAG

0 likes · 18 min read

Why RAG Fails in Production and How to Fix It: Expert Insights

AI Step-by-Step

Mar 29, 2026 · Artificial Intelligence

How RAG Quickly Gives Your Agent Real Business Knowledge

The article explains why agents often lack business understanding, describes Retrieval‑Augmented Generation (RAG) as the fastest way to provide correct, up‑to‑date business context, outlines eight practical RAG patterns, and offers a step‑by‑step checklist for building enterprise‑ready agents.

AgentEnterprise AIGraphRAG

0 likes · 10 min read

How RAG Quickly Gives Your Agent Real Business Knowledge

Data Party THU

Mar 23, 2026 · Artificial Intelligence

Boosting RAG Performance: Query Translation & Decomposition Techniques

The article explains two emerging RAG query‑optimization approaches—query translation and query decomposition—detailing fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and shows how combining them can improve relevance and latency in LLM‑augmented systems.

LLMQuery OptimizationRAG

0 likes · 9 min read

Boosting RAG Performance: Query Translation & Decomposition Techniques

Woodpecker Software Testing

Mar 22, 2026 · Artificial Intelligence

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

This article explains why traditional API, assertion, and UI testing fail for Retrieval‑Augmented Generation (RAG) systems, and presents a four‑step, evidence‑driven testing framework—including golden test sets, dual‑track validation, chaos engineering, and continuous trust dashboards—to ensure factual reliability and operational robustness in real‑world deployments.

Fact CheckingLLMOpenTelemetry

0 likes · 8 min read

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

Data Party THU

Mar 21, 2026 · Artificial Intelligence

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

The article explains that expanding LLM context windows leads to attention dilution and retrieval collapse, degrading answer quality, and argues that Retrieval‑Augmented Generation remains essential because it preserves signal density through focused retrieval and selective prompting.

AI ArchitectureAttention DilutionLLM

0 likes · 8 min read

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

PaperAgent

Mar 19, 2026 · Artificial Intelligence

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

The article presents the MDER‑DR two‑stage framework that tackles semantic loss in knowledge‑graph triple indexing by generating context‑aware entity summaries and using an LLM‑driven decompose‑parse retrieval loop, achieving up to 66% performance gains on multi‑hop question answering benchmarks.

Entity SummarizationKG QAKnowledge Graph

0 likes · 5 min read

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

Tech Freedom Circle

Mar 19, 2026 · Artificial Intelligence

Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need

The article dissects a failed Alibaba second‑round interview where the candidate answered only “vector‑search‑enhanced” for a RAG design, and then presents a systematic, four‑module RAG architecture together with six design principles, detailed indexing, query understanding, multi‑path recall, and context generation techniques to help candidates demonstrate comprehensive technical depth.

AI ArchitectureKnowledge GraphMulti‑Path Recall

0 likes · 22 min read

Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need

Woodpecker Software Testing

Mar 17, 2026 · Artificial Intelligence

Why Direct Prompting Beats LLM Knowledge‑Base Agents for Test‑Case Generation

The article explains that feeding requirements directly in the prompt yields far better test‑case designs than using an LLM‑powered knowledge‑base agent, because the model processes the full context without the loss and fragmentation introduced by retrieval‑augmented generation.

AI AgentsKnowledge BaseLLM

0 likes · 6 min read

Why Direct Prompting Beats LLM Knowledge‑Base Agents for Test‑Case Generation

Data STUDIO

Mar 9, 2026 · Artificial Intelligence

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

This article dissects why naive Retrieval‑Augmented Generation (RAG) often yields only 60% accuracy, then presents eleven concrete ingestion, query, and hybrid techniques—complete with code samples, performance trade‑offs, and real‑world case studies—that together can raise RAG accuracy to 94% while outlining practical implementation roadmaps and common pitfalls.

EmbeddingKnowledge GraphLLM

0 likes · 31 min read

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

DataFunSummit

Feb 25, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article summarizes a DataFun‑hosted roundtable where leading AI experts dissect the gap between RAG’s promise and real‑world deployment, exposing low recall, hallucinations, and cost overruns, then present systematic diagnostics, evaluation metrics, hybrid search, and engineering best practices to reliably operationalize RAG in enterprise settings.

Enterprise AIHybrid SearchLLM

0 likes · 18 min read

Qborfy AI

Feb 18, 2026 · Artificial Intelligence

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

This article explains Retrieval‑Augmented Generation (RAG), detailing its offline knowledge‑base construction and online retrieval‑enhanced generation workflow, comparing it with traditional and fine‑tuned models, and providing step‑by‑step LangChain implementations, advanced techniques, and practical use‑case demos.

Embedding ModelsHybrid SearchLangChain

0 likes · 16 min read

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

DataFunTalk

Feb 11, 2026 · Artificial Intelligence

Why Most RAG Deployments Fail and How to Build a Production‑Ready RAG System

This round‑table dissects the gap between RAG’s hype and real‑world production, exposing common pitfalls such as low recall, hallucinations and cost overruns, and then delivers a systematic diagnostic framework, hybrid search strategies, fine‑tuning rules, and practical best‑practice roadmaps for building reliable enterprise RAG solutions.

Agentic RAGHybrid SearchLLM

0 likes · 20 min read

Why Most RAG Deployments Fail and How to Build a Production‑Ready RAG System

Amazon Cloud Developers

Feb 10, 2026 · Artificial Intelligence

How RAG‑MCP Cuts Prompt Tokens by Up to 74% While Boosting Accuracy

This article presents a rigorous, multi‑dimensional evaluation of the RAG‑MCP framework versus a full‑tool MCP approach on Amazon Bedrock, showing up to 74% token reduction, higher tool‑selection accuracy, lower latency, and better scalability for large tool sets.

Amazon BedrockLLMRAG

0 likes · 21 min read

How RAG‑MCP Cuts Prompt Tokens by Up to 74% While Boosting Accuracy

AI2ML AI to Machine Learning

Feb 7, 2026 · Artificial Intelligence

Why the ‘Skills’ Approach Is the Third Major Compromise Shaping Enterprise AI in 2026

The article argues that embracing the Skills paradigm— a lightweight, low‑cost alternative to large‑scale model training—represents the third major compromise in the large‑model era, balancing reduced emergence and planning hallucinations against increased stability and engineering efficiency for enterprise AI deployments.

Agentic AIEnterprise AIMixture of Experts

0 likes · 8 min read

Why the ‘Skills’ Approach Is the Third Major Compromise Shaping Enterprise AI in 2026

Data STUDIO

Jan 27, 2026 · Artificial Intelligence

How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs

This article explains why large‑language‑model hallucinations are risky, introduces Retrieval‑Augmented Generation (RAG) as a remedy, and walks through nine Python‑based RAG architectures—standard, conversational, corrective, adaptive, fusion, HyDE, self‑RAG, agentic, and graph RAG—detailing their workflows, code examples, strengths, weaknesses, and a decision‑making map for selecting the right design.

AI hallucinationLangChainPython

0 likes · 29 min read

How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs

PaperAgent

Jan 13, 2026 · Artificial Intelligence

How C2LLM Redefines Code Retrieval with Attention‑Based Pooling

Introducing C2LLM, a contrastive code LLM series that replaces mean and EOS pooling with a multi‑head attention pooling module, achieving top scores on the MTEB‑Code benchmark across 12 tasks and demonstrating cost‑effective, high‑precision code retrieval for both production and AI agent applications.

Code EmbeddingLarge Language ModelMTEB-Code

0 likes · 8 min read

How C2LLM Redefines Code Retrieval with Attention‑Based Pooling

Sohu Tech Products

Jan 7, 2026 · Artificial Intelligence

Master Retrieval-Augmented Generation (RAG): Concepts, Benefits, Implementation

This article explains Retrieval‑Augmented Generation (RAG), its dual‑stage architecture that combines parametric LLM knowledge with external non‑parametric data, outlines its technical evolution, discusses why it outperforms pure LLMs, and provides a step‑by‑step guide with toolchain choices, evaluation metrics, and future challenges.

AIKnowledge BaseLLM

0 likes · 14 min read

Master Retrieval-Augmented Generation (RAG): Concepts, Benefits, Implementation

PaperAgent

Jan 5, 2026 · Artificial Intelligence

How QuCo‑RAG Replaces Model Confidence with Objective Evidence to Cut Hallucinations

QuCo‑RAG introduces a dynamic retrieval‑augmented generation framework that quantifies uncertainty using pre‑training corpus statistics, replacing unreliable model confidence with objective frequency and co‑occurrence evidence, achieving millisecond‑level hallucination detection, superior multi‑hop QA performance, and cross‑model transferability across various LLMs.

Dynamic RetrievalLLMRetrieval-Augmented Generation

0 likes · 9 min read

How QuCo‑RAG Replaces Model Confidence with Objective Evidence to Cut Hallucinations

Mingyi World Elasticsearch

Dec 28, 2025 · Artificial Intelligence

Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough

This article walks through the principles of Retrieval‑Augmented Generation (RAG) and provides a complete Python implementation using Elasticsearch, covering document chunking, semantic embedding, bulk indexing, hybrid BM25‑vector search, RRF result fusion, prompt design, LLM invocation, and a practical demo.

ElasticsearchHybrid SearchPrompt Engineering

0 likes · 9 min read

Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough

PaperAgent

Dec 12, 2025 · Artificial Intelligence

How BookRAG Redefines Long-Document Retrieval with Hierarchical Indexing

BookRAG introduces a hierarchical, structure‑aware indexing method that combines tree‑based document representation with graph‑based entity linking and an agent‑driven retrieval pipeline, achieving up to 71.2% recall improvement on multimodal long‑document benchmarks while cutting token usage and latency dramatically.

Agent RetrievalLLMLong Document QA

0 likes · 7 min read

How BookRAG Redefines Long-Document Retrieval with Hierarchical Indexing

Open Source Tech Hub

Dec 5, 2025 · Artificial Intelligence

From Neurons to GPT: A Complete Timeline of AI Evolution and Future Trends

This comprehensive article traces AI from its biological roots and early computers through the birth of artificial intelligence, the rise of machine learning, the emergence of large language models, multimodal agents, and finally explores current breakthroughs, practical applications, and future directions.

AgentsPrompt EngineeringRetrieval-Augmented Generation

0 likes · 39 min read

From Neurons to GPT: A Complete Timeline of AI Evolution and Future Trends

Architect's Guide

Nov 24, 2025 · Artificial Intelligence

Building Java LLM Applications with LangChain4j: A Hands‑On Guide

This tutorial walks through the fundamentals of large language models, prompt engineering, and word embeddings, then shows how to set up a LangChain‑based LLM stack in Java using LangChain4j, covering core modules, memory, retrieval, chains, agents, and complete code examples.

AI AgentsJavaLLM

0 likes · 15 min read

Building Java LLM Applications with LangChain4j: A Hands‑On Guide

JD Tech Talk

Nov 21, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Retrieval‑Augmented Generation

This article explains why effective chunking is crucial for RAG performance, compares seven major chunking strategies—including fixed‑size, semantic, recursive, document‑structure, agent‑driven, sentence, and paragraph methods—and offers practical guidance on selecting and optimizing chunks for real‑world AI applications.

AIChunkingRAG

0 likes · 10 min read

Mastering Chunking Strategies for Retrieval‑Augmented Generation

Wu Shixiong's Large Model Academy

Nov 20, 2025 · Artificial Intelligence

How to Build a Quantifiable Data Quality Framework for Dynamic Incremental RAG

This article explains why static RAG metrics don’t apply to dynamic pipelines, introduces five essential dimensions—Parseability, Deduplication, Relevance, Chunk Quality, and Freshness—and shows how to combine them into a weighted score that enables monitoring, alerts, and continuous improvement of dynamic RAG systems.

Data QualityDynamic RAGMetrics

0 likes · 10 min read

How to Build a Quantifiable Data Quality Framework for Dynamic Incremental RAG

Data Thinking Notes

Nov 16, 2025 · Artificial Intelligence

How AI Agents Transform Automation: Architecture, Challenges & Future Trends

This comprehensive overview examines AI agents powered by large language models, detailing their definition, core components, architectural patterns, key technologies such as prompt engineering and retrieval‑augmented generation, diverse application domains, current challenges, security solutions, and emerging research directions.

Multi-Agent SystemsPrompt EngineeringRetrieval-Augmented Generation

0 likes · 81 min read

How AI Agents Transform Automation: Architecture, Challenges & Future Trends

Alibaba Cloud Big Data AI Platform

Nov 4, 2025 · Artificial Intelligence

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

EMNLP 2025 in Suzhou will feature Alibaba Cloud’s AI platform PAI presenting four accepted papers on knowledge distillation, small‑model reasoning, distilled reasoning models, and an automated RAG benchmark framework, alongside exhibition demos, networking events, and recruitment opportunities for AI talent.

AI platformEMNLP 2025Retrieval-Augmented Generation

0 likes · 10 min read

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

Wu Shixiong's Large Model Academy

Nov 2, 2025 · Artificial Intelligence

Why Document Parsing Is the Real Bottleneck in RAG Projects (And How to Fix It)

The article explains that in Retrieval‑Augmented Generation projects the hardest challenge lies in robust document parsing—handling PDFs, PPTs, scanned contracts, OCR errors, and preserving structure—to ensure high‑quality retrieval and avoid hallucinations.

AIOCRRAG

0 likes · 10 min read

Why Document Parsing Is the Real Bottleneck in RAG Projects (And How to Fix It)

Amazon Cloud Developers

Oct 31, 2025 · Artificial Intelligence

Build Accurate AI Apps Without Complex RAG: Introducing Amazon Nova Web Grounding

Amazon Nova Web Grounding is an out‑of‑the‑box RAG tool for Amazon Bedrock that automatically retrieves and cites up‑to‑date information, reduces hallucinations, and lets developers create accurate, real‑time AI applications using simple Python code.

AI ApplicationsAmazon BedrockAmazon Nova

0 likes · 8 min read

Build Accurate AI Apps Without Complex RAG: Introducing Amazon Nova Web Grounding

DataFunSummit

Oct 30, 2025 · Artificial Intelligence

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

This article explores how the explosion of unstructured data exposes the limits of traditional OCR and shows how emerging multimodal large language models provide end‑to‑end document understanding, reduce pipeline complexity, cut training costs, enable hybrid retrieval‑augmented generation, and drive real‑world industry deployments.

AIDocument processingLarge Language Model

0 likes · 28 min read

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

Xuanwu Backend Tech Stack

Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

Information RetrievalRAGRerank

0 likes · 10 min read

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

JD Tech Talk

Oct 21, 2025 · Backend Development

How Backend Engineers Are Breaking Through AI with RAG Architectures

This article details a backend developer's two‑year AI journey, the challenges of rapid model advances, and how applying microservice principles to Retrieval‑Augmented Generation (RAG) creates a scalable, multi‑agent platform for insurance knowledge, memory, and intelligent agents.

Backend AIKnowledge BaseRAG

0 likes · 11 min read

How Backend Engineers Are Breaking Through AI with RAG Architectures

AI Large Model Application Practice

Oct 13, 2025 · Artificial Intelligence

How to Tame LLM Agents: Proven Strategies to Reduce Uncertainty and Boost Reliability

This article outlines practical techniques—including prompt engineering, domain fine‑tuning, retrieval‑augmented generation, structured outputs, workflow constraints, model parameter control, behavior rules, risk‑based AI participation, and comprehensive governance—to curb the unpredictability of large language model agents in enterprise settings.

AI AgentAI GovernanceLLM

0 likes · 18 min read

How to Tame LLM Agents: Proven Strategies to Reduce Uncertainty and Boost Reliability

DataFunSummit

Oct 9, 2025 · Artificial Intelligence

Why AI Coding Agents Still Struggle: Context Limits, Knowledge Gaps, and the Road to Human‑Like Assistants

This talk examines the core challenges facing AI coding agents—limited context windows, knowledge accumulation, and software‑engineering complexity—while outlining practical solutions such as context providing, RAG, fine‑tuning, online learning, feedback loops, and multi‑agent collaboration to move toward truly human‑like, continuously learning coding assistants.

AI codingFeedback LoopRetrieval-Augmented Generation

0 likes · 24 min read

Why AI Coding Agents Still Struggle: Context Limits, Knowledge Gaps, and the Road to Human‑Like Assistants

JD Cloud Developers

Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, covering its core workflow—from knowledge preparation, data cleaning, and metadata extraction to query preprocessing, vector retrieval, reranking, information integration, and final LLM generation, while also reviewing common embedding models and vector databases.

LLMRAGRetrieval-Augmented Generation

0 likes · 13 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

How RAGFlow’s Agent Engine Turns Retrieval into a Problem‑Solving AI

This article explains how RAGFlow upgrades a traditional RAG system from a passive question‑answer engine to an active problem‑solving agent by integrating the ReAct reasoning‑action‑observation loop, a visual canvas workflow, and a modular component‑tool ecosystem, with concrete Python implementations and code examples.

AI AgentsPythonReAct

0 likes · 16 min read

How RAGFlow’s Agent Engine Turns Retrieval into a Problem‑Solving AI

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

RAGFlow Primer Part 1: Introduction and Concept Deep Dive

This article provides a comprehensive technical overview of RAGFlow, an industrial‑grade Retrieval‑Augmented Generation platform, detailing its architecture, core components such as DeepDoc, intelligent chunking, embedding integration, multi‑stage retrieval, and agent workflow, while comparing it with traditional RAG shortcomings.

DeepDocIntelligent ChunkingKnowledge Base

0 likes · 32 min read

RAGFlow Primer Part 1: Introduction and Concept Deep Dive

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

Inside RAGFlow: How Its Microservice Architecture Powers an Enterprise‑Grade Retrieval‑Augmented Generation Platform

This article provides a detailed technical walkthrough of RAGFlow's architecture, covering its microservice design, directory layout, layered structure, cloud‑native deployment, core modules such as DeepDoc, RAG engine, Agent system, and web UI, as well as multi‑tenant isolation, streaming responses, asynchronous task handling, concurrency controls, scalability strategies, and a complete request‑lifecycle example for document upload.

AI ArchitectureDeepDocDocker Compose

0 likes · 26 min read

Inside RAGFlow: How Its Microservice Architecture Powers an Enterprise‑Grade Retrieval‑Augmented Generation Platform

Huolala Tech

Sep 24, 2025 · Artificial Intelligence

How CID-GraphRAG Boosts Multi‑Turn AI Customer Service with Dual‑Layer Retrieval

The article introduces CID-GraphRAG, a novel framework that combines intent‑driven graphs with semantic similarity search to improve multi‑turn intelligent customer service, detailing its architecture, dual‑layer retrieval mechanism, evaluation against baseline models, and future research directions.

AIDialogue SystemsLLM

0 likes · 14 min read

How CID-GraphRAG Boosts Multi‑Turn AI Customer Service with Dual‑Layer Retrieval

Data Thinking Notes

Sep 21, 2025 · Artificial Intelligence

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

Amid the rapid rise of generative AI, this article examines the limitations of large language models and explains how Retrieval‑Augmented Generation (RAG), followed by the advanced paradigms DeepSearch and DeepResearch, progressively enhance knowledge handling through dynamic retrieval, multi‑agent reasoning, and autonomous research capabilities.

AI Knowledge ManagementDeepResearchDeepSearch

0 likes · 16 min read

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

DataFunTalk

Sep 19, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, GraphRAG, and autonomous agents boost model intelligence, improve user experience, and enable advanced content generation, understanding, and multi‑step reasoning.

Autonomous AgentsGraphRAGRetrieval-Augmented Generation

0 likes · 4 min read

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

Architecture & Thinking

Sep 12, 2025 · Artificial Intelligence

How Knowledge Graphs Turn Large Language Models into Trustworthy Experts

Integrating structured knowledge graphs with generative AI provides traceable, explainable, and high‑precision reasoning across domains such as medicine, finance, and law, through techniques like Retrieval‑Augmented Generation, graph neural networks, and adaptive planning, dramatically reducing hallucinations and boosting expert‑level performance.

AI hallucinationGraph Neural NetworkKnowledge Graph

0 likes · 12 min read

How Knowledge Graphs Turn Large Language Models into Trustworthy Experts

Architects Research Society

Sep 10, 2025 · Artificial Intelligence

From Vectors to Graphs to Hybrids: The Evolution of AI Knowledge Representation

This article explores the three stages of AI knowledge representation—vector embeddings, graph‑based structures, and the emerging hybrid approach that combines vectors, graphs, and large language models—to illustrate how modern Retrieval‑Augmented Generation systems achieve both semantic similarity and precise relational reasoning.

AIRetrieval-Augmented Generationgraph databases

0 likes · 3 min read

From Vectors to Graphs to Hybrids: The Evolution of AI Knowledge Representation

Instant Consumer Technology Team

Sep 5, 2025 · Artificial Intelligence

Why Context Engineering Is the Next Frontier for Large Language Models

This article surveys over 1,400 papers to define context engineering as a systematic discipline that structures retrieval, memory, tools, and multi‑agent coordination for LLMs, highlighting the critical asymmetry between understanding long contexts and generating equally complex outputs.

LLM evaluationMemory ManagementRetrieval-Augmented Generation

0 likes · 8 min read

Why Context Engineering Is the Next Frontier for Large Language Models

Alibaba Cloud Developer

Sep 1, 2025 · Artificial Intelligence

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

This article delves into the implementation details and optimization strategies of Retrieval‑Augmented Generation (RAG), covering document chunking, index enhancement, embedding, hybrid search, and re‑ranking, and provides practical code examples to help developers move from quick deployment to deep performance tuning.

AIChunkingEmbedding

0 likes · 19 min read

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

Tencent Technical Engineering

Aug 29, 2025 · Artificial Intelligence

How Retrieval‑Augmented Generation Evolves into Autonomous AI Agents

This article examines the limitations of large language models' internal knowledge, explains how retrieval‑augmented generation (RAG) and tool‑augmented generation address these limits, and traces the evolution from simple retrieve‑then‑generate pipelines to autonomous, multi‑modal AI agents.

AI AgentsLLMRAG

0 likes · 20 min read

How Retrieval‑Augmented Generation Evolves into Autonomous AI Agents

Tech Freedom Circle

Aug 26, 2025 · Artificial Intelligence

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

This article provides a step‑by‑step technical guide to optimizing Retrieval‑Augmented Generation (RAG) for interview scenarios, covering query rewriting, HyDE, fallback strategies, routing and prompt routing, multi‑representation indexing, hybrid retrieval, re‑ranking, self‑RAG, generation control, performance benchmarking, and a practical checklist with concrete code examples and metrics.

AI interviewHybrid RetrievalIndex Optimization

0 likes · 30 min read

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

DaTaobao Tech

Aug 25, 2025 · Artificial Intelligence

Mastering RAG: From Quick Start to Deep Optimization Strategies

This article dives into the practical implementation of Retrieval‑Augmented Generation (RAG), covering document chunking, semantic and reverse HyDE indexing, embedding, hybrid search, and re‑ranking techniques, and provides concrete code examples and optimization tips for building high‑performance AI applications.

ChunkingEmbeddingHybrid Search

0 likes · 18 min read

Mastering RAG: From Quick Start to Deep Optimization Strategies

Volcano Engine Developer Services

Aug 21, 2025 · Artificial Intelligence

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Since last year, the debate over “Prompt Engineering” has split between practitioners who favor “Context Engineering” for building scalable agent systems and scholars who treat Prompt Engineering as a broad umbrella term, highlighting the need to dynamically construct and manage context for reliable, extensible AI applications.

AI AgentsLLMPrompt Engineering

0 likes · 33 min read

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Data Party THU

Aug 17, 2025 · Artificial Intelligence

Why Do Large Language Models Hallucinate? Unpacking the Probabilistic Roots and Fixes

Large language models often generate confident but false statements—a phenomenon called hallucination—because they predict the next token based on statistical patterns rather than factual understanding, and this article explains the underlying mechanisms and practical mitigation strategies.

HallucinationLLMRLHF

0 likes · 11 min read

Why Do Large Language Models Hallucinate? Unpacking the Probabilistic Roots and Fixes

AsiaInfo Technology: New Tech Exploration

Jul 30, 2025 · Artificial Intelligence

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

This article analyzes the prompt‑inflation bottleneck that arises when large language models (LLMs) must handle thousands of Model Context Protocol (MCP) services, and introduces the MCP‑RAG architecture—a retrieval‑augmented generation solution that builds a metadata knowledge base and intelligent retrieval layer to enable precise, efficient MCP service discovery at scale.

AILLMMCP

0 likes · 21 min read

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

JD Tech

Jul 29, 2025 · Artificial Intelligence

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

This article describes a QCon talk that combines causal inference with large language models to build a retrieval‑augmented generation pricing system for e‑commerce, detailing the three‑step algorithm, LLM‑driven modeling challenges, process‑reward tree search, reinforcement‑learning fine‑tuning, and experimental gains in accuracy and speed.

Retrieval-Augmented Generationcausal inferencee‑commerce pricing

0 likes · 17 min read

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

AI2ML AI to Machine Learning

Jul 24, 2025 · Artificial Intelligence

Exploring Recent Large‑Model Agent Papers: Insights and Analyses

This article reviews a series of recent research papers on large‑model agents, covering topics such as reinforcement‑learning‑driven ML agents, premise‑critique ability of LLMs, long‑term tool‑augmented LLM evaluation, agentic RAG, set‑based retrieval for multi‑hop QA, mobile VLM agents, and broader surveys of LLM applications, summarizing each work’s problem statement, prior approaches, novel contributions, experimental results, limitations, and future directions.

Agentic AILLM evaluationRetrieval-Augmented Generation

0 likes · 46 min read

Exploring Recent Large‑Model Agent Papers: Insights and Analyses

DataFunTalk

Jul 21, 2025 · Artificial Intelligence

From Prompt Engineering to Context Engineering: Transforming LLM Interactions

This article traces the evolution from prompt engineering to context engineering, detailing technical milestones, core concepts, practical strategies, and future trends that together reshape large language model applications and enable sophisticated AI agents across diverse domains.

Memory ManagementPrompt EngineeringRetrieval-Augmented Generation

0 likes · 35 min read

From Prompt Engineering to Context Engineering: Transforming LLM Interactions

Instant Consumer Technology Team

Jul 14, 2025 · Artificial Intelligence

9 Essential Technologies for Building Scalable AI Agents

An in‑depth guide outlines the nine core technologies—ranging from autonomous agent fundamentals and multi‑agent collaboration to workflow orchestration, retrieval‑augmented generation, fine‑tuning, function calling, model context protocols, agent‑to‑agent communication, and AI‑driven UI—required to design, deploy, and scale enterprise‑grade AI agents.

AI AgentsFunction CallingModel Context Protocol

0 likes · 9 min read

9 Essential Technologies for Building Scalable AI Agents