Tagged articles
145 articles
Page 1 of 2
Tencent Tech
Tencent Tech
May 20, 2026 · Artificial Intelligence

The Three Evolutions of AI Engineering: Prompt, Context, and Harness

This article analyzes the progressive stages of AI‑driven software engineering—Prompt Engineering, Context Engineering, and Harness Engineering—illustrating how each addresses specific challenges, presenting real‑world experiments from OpenAI and Anthropic, and outlining a roadmap for engineers to master the new paradigm.

AI AgentsContext EngineeringHarness Engineering
0 likes · 19 min read
The Three Evolutions of AI Engineering: Prompt, Context, and Harness
SuanNi
SuanNi
May 20, 2026 · Artificial Intelligence

AI‑Powered Research Workflow: When to Trust the Tools and When to Supervise

The article surveys AI‑assisted research across the full lifecycle—creation, writing, validation, and dissemination—detailing the capabilities of prompt engineering, retrieval‑augmented generation, training‑free agents and hybrid methods, reporting benchmark numbers, failure modes, and governance challenges that dictate when human oversight remains essential.

AI research automationPrompt EngineeringRetrieval Augmented Generation
0 likes · 17 min read
AI‑Powered Research Workflow: When to Trust the Tools and When to Supervise
Lao Guo's Learning Space
Lao Guo's Learning Space
May 12, 2026 · Artificial Intelligence

Demystifying the Core Technologies Behind ChatGPT, GPT‑4, and DeepSeek

This article breaks down the key algorithms that power large‑language models—Transformer, Mixture‑of‑Experts, Flash Attention, KV‑Cache, Multi‑Token Prediction, quantization, Chain‑of‑Thought and Retrieval‑Augmented Generation—explaining how each contributes to the performance of ChatGPT, GPT‑4 and DeepSeek.

Flash AttentionKV cacheMixture of Experts
0 likes · 10 min read
Demystifying the Core Technologies Behind ChatGPT, GPT‑4, and DeepSeek
James' Growth Diary
James' Growth Diary
May 12, 2026 · Artificial Intelligence

GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs

This article explains why traditional vector RAG loses relational information, how GraphRAG reconstructs entity‑relationship triples into a knowledge graph, and provides step‑by‑step code, performance benchmarks, retrieval modes, and practical tips that raise multi‑hop reasoning accuracy from around 50% to 85%.

GraphRAGKnowledge GraphLangChain
0 likes · 14 min read
GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs
James' Growth Diary
James' Growth Diary
May 9, 2026 · Artificial Intelligence

Agentic RAG Deep Dive: Letting the Agent Decide When and How Often to Retrieve

The article analyzes the shortcomings of traditional one‑shot RAG pipelines, introduces four Agentic RAG patterns that let an LLM‑driven agent control retrieval strategy, source selection, query rewriting and retry limits, and provides concrete TypeScript implementations with LangGraph, code snippets, and practical pitfalls.

Agentic RAGLLMLangGraph
0 likes · 16 min read
Agentic RAG Deep Dive: Letting the Agent Decide When and How Often to Retrieve
Lao Guo's Learning Space
Lao Guo's Learning Space
May 6, 2026 · Artificial Intelligence

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

This article examines why Retrieval‑Augmented Generation systems that work in demos often fail in production, detailing common pitfalls—from chunking and vector‑database selection to hybrid retrieval and re‑ranking—and offers concrete strategies, configuration tips, and a decision tree to build reliable enterprise‑grade RAG solutions.

Enterprise AIHybrid RetrievalRAG
0 likes · 12 min read
Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide
DataFunSummit
DataFunSummit
May 4, 2026 · Artificial Intelligence

Inside Alibaba Cloud AI Search: Agentic RAG Architecture and Multi‑Agent Techniques

Alibaba Cloud AI Search tackles high‑concurrency, multimodal, and multi‑hop queries by evolving its Agentic RAG architecture from a single agent to a coordinated multi‑agent system that integrates planning, retrieval, and generation, leverages hybrid vector‑text‑DB‑graph recall, GPU‑accelerated indexing, quantization, NL2SQL, and multimodal search, with performance data and real‑world case studies.

AI searchAgentic RAGAlibaba Cloud
0 likes · 6 min read
Inside Alibaba Cloud AI Search: Agentic RAG Architecture and Multi‑Agent Techniques
DataFunTalk
DataFunTalk
May 4, 2026 · Artificial Intelligence

Engineering and Algorithm Innovations for RAG Engines in Office Applications

This article analyzes the challenges and practical solutions of building a Retrieval‑Augmented Generation (RAG) system for office scenarios, covering background issues, modular architecture, offline and online pipelines, hybrid retrieval, ranking models, knowledge filtering, prompt design, and two‑stage generation techniques.

AIDocument ParsingHybrid Retrieval
0 likes · 22 min read
Engineering and Algorithm Innovations for RAG Engines in Office Applications
DataFunSummit
DataFunSummit
May 3, 2026 · Artificial Intelligence

From Flawed to Production-Ready: Deep Dive into Building Enterprise-Grade RAG Systems

The article analyzes why early RAG deployments often fall short, dissects the most common technical pain points—from document parsing to vector overload—and presents a systematic roadmap that includes hybrid search, reranking, GraphRAG, Agentic RAG, model selection, scalability tricks, and security controls for robust B‑side production.

Agentic RAGEnterprise AIFine-tuning
0 likes · 20 min read
From Flawed to Production-Ready: Deep Dive into Building Enterprise-Grade RAG Systems
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
May 3, 2026 · Artificial Intelligence

9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained

This article introduces Retrieval‑Augmented Generation (RAG) and systematically details nine distinct RAG architectures—standard, conversational with memory, corrective (CRAG), adaptive, self‑RAG, fusion, HyDE, agentic, and Graph RAG—highlighting their workflows, real‑world examples, advantages, and trade‑offs.

AI ArchitectureGraphRAGLLM
0 likes · 17 min read
9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained
MaGe Linux Operations
MaGe Linux Operations
Apr 28, 2026 · Artificial Intelligence

Why Your RAG Performance Is Poor: Common Issues and Optimization Strategies

This article systematically analyzes why Retrieval‑Augmented Generation pipelines often underperform—covering embedding model selection, chunking strategies, hybrid retrieval, reranking, context window waste, evaluation metrics, and a detailed troubleshooting checklist—while providing concrete code examples and best‑practice recommendations for engineers.

EmbeddingHybrid RetrievalRAG
0 likes · 19 min read
Why Your RAG Performance Is Poor: Common Issues and Optimization Strategies
PMTalk Product Manager Community
PMTalk Product Manager Community
Apr 28, 2026 · Artificial Intelligence

First Principle for Agent Product Managers: Choosing Between Single Agent, Multi‑Agent Collaboration, and Workflow

The article presents a decision framework for AI product managers, mapping workflow determinism and context certainty to four technical patterns—traditional RPA + AI, single Agent + RAG/knowledge graph, end‑to‑end RL Agent, and multi‑Agent collaboration—each with concrete use‑case examples and selection guidelines.

AI AgentsRPARetrieval Augmented Generation
0 likes · 6 min read
First Principle for Agent Product Managers: Choosing Between Single Agent, Multi‑Agent Collaboration, and Workflow
AI Illustrated Series
AI Illustrated Series
Apr 27, 2026 · Artificial Intelligence

Comprehensive RAG Interview Q&A: 22 In-Depth Questions and Answers

This extensive interview guide covers 22 core RAG questions, detailing the definition, workflow, embedding selection, vector database choices, retrieval optimization, multi‑turn handling, context compression, evaluation metrics, knowledge‑graph integration, operational challenges, Agentic and hybrid RAG, document update strategies, similarity algorithms, and hallucination mitigation, providing concrete examples and practical advice for AI interview preparation.

AI InterviewEmbeddingKnowledge Retrieval
0 likes · 29 min read
Comprehensive RAG Interview Q&A: 22 In-Depth Questions and Answers
DataFunTalk
DataFunTalk
Apr 26, 2026 · Artificial Intelligence

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Best Practices

This article analyses the practical construction of an enterprise‑level Retrieval‑Augmented Generation (RAG) 2.0 system, covering background issues of large models, a modular architecture, layered offline/online pipelines, hybrid retrieval, ranking strategies, prompt engineering, and deployment insights drawn from China Mobile’s production experience.

Enterprise AIHybrid RetrievalPrompt Engineering
0 likes · 22 min read
Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Best Practices
DataFunSummit
DataFunSummit
Apr 22, 2026 · Artificial Intelligence

From Flawed RAG to Production‑Ready: Deep Dive into Scaling Retrieval‑Augmented Generation

This expert roundtable dissects why RAG often fails in production—low recall, hallucinations, cost overruns—and walks through concrete diagnostics, hybrid search designs, knowledge‑engineering tricks, GraphRAG and Agentic RAG advances, plus practical deployment, security, and cost‑optimization guidelines.

AI deploymentAgentic RAGHybrid Search
0 likes · 20 min read
From Flawed RAG to Production‑Ready: Deep Dive into Scaling Retrieval‑Augmented Generation
MeowKitty Programming
MeowKitty Programming
Apr 21, 2026 · Backend Development

2026 AI Priorities for Java Developers: Structured Output, RAG, and Observability

While many Java teams chase flashy AI demos and agents, the real 2026 focus has shifted to engineering concerns—ensuring model outputs reliably map to Java objects, integrating Retrieval‑Augmented Generation into robust data pipelines, and adding observability so AI services can be monitored and debugged like traditional back‑end components.

AILangChain4jObservability
0 likes · 7 min read
2026 AI Priorities for Java Developers: Structured Output, RAG, and Observability
AI Architect Hub
AI Architect Hub
Apr 20, 2026 · Artificial Intelligence

Why LLMs Need RAG: Overcoming Core Limitations and Building Scalable AI Solutions

This article analyzes the fundamental shortcomings of large language models for enterprise use, explains how Retrieval‑Augmented Generation (RAG) bridges those gaps through a detailed offline‑online workflow, and explores emerging trends that will shape the next generation of intelligent AI architectures.

AI ArchitectureEnterprise AIFuture AI
0 likes · 10 min read
Why LLMs Need RAG: Overcoming Core Limitations and Building Scalable AI Solutions
Big Data and Microservices
Big Data and Microservices
Apr 20, 2026 · Artificial Intelligence

Why AI Hallucinates and How RAG Turns It into an Open‑Book Test

The article explains why large language models often fabricate facts, introduces Retrieval‑Augmented Generation (RAG) as a way to ground responses with external data, walks through its four‑step workflow, showcases practical use cases, and highlights the limitations and best practices for deploying RAG.

AIKnowledge BaseLLM
0 likes · 12 min read
Why AI Hallucinates and How RAG Turns It into an Open‑Book Test
AI Architect Hub
AI Architect Hub
Apr 19, 2026 · Artificial Intelligence

Mastering RAG: From Data Cleaning to Vector DBs in AI Applications

This article introduces the second stage of a large‑model application series, detailing the value of Retrieval‑Augmented Generation (RAG), its architecture, and a step‑by‑step outline covering data cleaning, text chunking, vectorization, vector‑DB selection, recall strategies, reranking, and prompt construction.

AILLMPrompt Engineering
0 likes · 4 min read
Mastering RAG: From Data Cleaning to Vector DBs in AI Applications
Su San Talks Tech
Su San Talks Tech
Apr 19, 2026 · Artificial Intelligence

Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank

To make Retrieval‑Augmented Generation reliable in production, the article outlines five key engineering tactics—semantic chunking with metadata, hybrid vector‑keyword search, two‑stage retrieval with reranking, query rewriting and expansion, and dynamic result evaluation—each illustrated with concrete examples and code snippets.

AI EngineeringHybrid SearchQuery Rewriting
0 likes · 10 min read
Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 17, 2026 · Artificial Intelligence

When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)

The article explains why conventional Retrieval‑Augmented Generation often produces incorrect answers despite retrieving relevant documents, introduces the Disco‑RAG framework that adds a structured reading step using argument trees and relation graphs, and shows how this three‑step approach dramatically improves performance on long‑document and ambiguous‑question benchmarks without any model training.

Disco-RAGRAGRetrieval Augmented Generation
0 likes · 13 min read
When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)
DataFunSummit
DataFunSummit
Apr 17, 2026 · Artificial Intelligence

Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions

This article dissects the hype‑versus‑reality gap of Retrieval‑Augmented Generation in enterprises, exposing low recall, hallucinations, and cost overruns, then offers a systematic diagnosis, hybrid search, reranking, security controls, and advanced GraphRAG and Agentic RAG strategies to achieve reliable production deployments.

Enterprise AILLMRAG
0 likes · 17 min read
Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions
DataFunTalk
DataFunTalk
Apr 15, 2026 · Artificial Intelligence

Building a Production‑Ready RAG System for Enterprise Knowledge Work

This article analyzes the challenges and practical solutions of deploying Retrieval‑Augmented Generation (RAG) in an enterprise office setting, covering background problems, modular architecture, offline and online pipelines, hybrid retrieval, multi‑stage ranking, knowledge filtering, prompt engineering, and model selection to achieve accurate, reliable answers.

Enterprise AIHybrid RetrievalRAG
0 likes · 21 min read
Building a Production‑Ready RAG System for Enterprise Knowledge Work
IT Services Circle
IT Services Circle
Apr 14, 2026 · Artificial Intelligence

What Is RAG? A Complete Guide to Retrieval‑Augmented Generation for AI Engineers

This article explains Retrieval‑Augmented Generation (RAG), covering why large language models need external knowledge, the full offline‑and‑online workflow, document chunking, embedding evolution, vector database choices, multi‑path retrieval, evaluation metrics, hallucination types, and practical strategies to mitigate them.

AI EvaluationEmbeddingRAG
0 likes · 55 min read
What Is RAG? A Complete Guide to Retrieval‑Augmented Generation for AI Engineers
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
Apr 11, 2026 · Artificial Intelligence

Master AI Fundamentals: Tokens, Context Windows, Temperature, Hallucinations & RAG

This article breaks down five essential AI concepts—tokens, context windows, temperature settings, hallucinations, and retrieval‑augmented generation—explaining how they work, why they matter, and how to apply them effectively when building or using large language model applications.

AI fundamentalsContext WindowRetrieval Augmented Generation
0 likes · 12 min read
Master AI Fundamentals: Tokens, Context Windows, Temperature, Hallucinations & RAG
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Apr 7, 2026 · Artificial Intelligence

AutoHypo-Fin: Tsinghua's Web-Mining Method to Auto-Generate and Backtest Market Hypotheses

AutoHypo‑Fin is an end‑to‑end framework that harvests large‑scale web financial data, extracts entities via large language models, builds a temporal knowledge graph, uses retrieval‑augmented generation and statistical backtesting to automatically create, test, and iteratively optimize trading hypotheses, achieving superior risk‑adjusted returns compared with baseline strategies in experiments from 2019‑2024.

AutoHypo-FinKnowledge GraphLLM
0 likes · 11 min read
AutoHypo-Fin: Tsinghua's Web-Mining Method to Auto-Generate and Backtest Market Hypotheses
IT Services Circle
IT Services Circle
Apr 6, 2026 · Artificial Intelligence

Mastering RAG Interview Questions: A Complete Retrieval Optimization Blueprint

This article breaks down the full RAG retrieval pipeline—from query understanding and rewriting, through hybrid retrieval and reranking, to chunking, context compression, and dynamic routing—providing concrete techniques, formulas, and performance metrics to help candidates ace interview questions on RAG systems.

Cross-EncoderHard Negative MiningHybrid Retrieval
0 likes · 16 min read
Mastering RAG Interview Questions: A Complete Retrieval Optimization Blueprint
DataFunSummit
DataFunSummit
Apr 1, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article analyzes why Retrieval‑Augmented Generation (RAG) often underperforms in enterprise production, identifies eight common pitfalls—from document parsing to token costs—and offers a systematic roadmap of diagnostics, hybrid search, reranking, and deployment strategies presented by leading AI experts.

AIEnterpriseRAG
0 likes · 18 min read
Why RAG Fails in Production and How to Fix It: Expert Insights
AI Step-by-Step
AI Step-by-Step
Mar 29, 2026 · Artificial Intelligence

How RAG Quickly Gives Your Agent Real Business Knowledge

The article explains why agents often lack business understanding, describes Retrieval‑Augmented Generation (RAG) as the fastest way to provide correct, up‑to‑date business context, outlines eight practical RAG patterns, and offers a step‑by‑step checklist for building enterprise‑ready agents.

AgentEnterprise AIGraphRAG
0 likes · 10 min read
How RAG Quickly Gives Your Agent Real Business Knowledge
Data Party THU
Data Party THU
Mar 23, 2026 · Artificial Intelligence

Boosting RAG Performance: Query Translation & Decomposition Techniques

The article explains two emerging RAG query‑optimization approaches—query translation and query decomposition—detailing fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and shows how combining them can improve relevance and latency in LLM‑augmented systems.

LLMRAGRetrieval Augmented Generation
0 likes · 9 min read
Boosting RAG Performance: Query Translation & Decomposition Techniques
Woodpecker Software Testing
Woodpecker Software Testing
Mar 22, 2026 · Artificial Intelligence

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

This article explains why traditional API, assertion, and UI testing fail for Retrieval‑Augmented Generation (RAG) systems, and presents a four‑step, evidence‑driven testing framework—including golden test sets, dual‑track validation, chaos engineering, and continuous trust dashboards—to ensure factual reliability and operational robustness in real‑world deployments.

Fact CheckingLLMOpenTelemetry
0 likes · 8 min read
How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024
Data Party THU
Data Party THU
Mar 21, 2026 · Artificial Intelligence

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

The article explains that expanding LLM context windows leads to attention dilution and retrieval collapse, degrading answer quality, and argues that Retrieval‑Augmented Generation remains essential because it preserves signal density through focused retrieval and selective prompting.

AI ArchitectureAttention DilutionLLM
0 likes · 8 min read
Why Bigger Context Windows Hurt LLMs and How RAG Still Wins
PaperAgent
PaperAgent
Mar 19, 2026 · Artificial Intelligence

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

The article presents the MDER‑DR two‑stage framework that tackles semantic loss in knowledge‑graph triple indexing by generating context‑aware entity summaries and using an LLM‑driven decompose‑parse retrieval loop, achieving up to 66% performance gains on multi‑hop question answering benchmarks.

Entity SummarizationKG QAKnowledge Graph
0 likes · 5 min read
How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries
Tech Freedom Circle
Tech Freedom Circle
Mar 19, 2026 · Artificial Intelligence

Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need

The article dissects a failed Alibaba second‑round interview where the candidate answered only “vector‑search‑enhanced” for a RAG design, and then presents a systematic, four‑module RAG architecture together with six design principles, detailed indexing, query understanding, multi‑path recall, and context generation techniques to help candidates demonstrate comprehensive technical depth.

AI ArchitectureKnowledge GraphMulti‑Path Recall
0 likes · 22 min read
Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need
Data STUDIO
Data STUDIO
Mar 9, 2026 · Artificial Intelligence

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

This article dissects why naive Retrieval‑Augmented Generation (RAG) often yields only 60% accuracy, then presents eleven concrete ingestion, query, and hybrid techniques—complete with code samples, performance trade‑offs, and real‑world case studies—that together can raise RAG accuracy to 94% while outlining practical implementation roadmaps and common pitfalls.

EmbeddingKnowledge GraphLLM
0 likes · 31 min read
Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies
DataFunSummit
DataFunSummit
Feb 25, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article summarizes a DataFun‑hosted roundtable where leading AI experts dissect the gap between RAG’s promise and real‑world deployment, exposing low recall, hallucinations, and cost overruns, then present systematic diagnostics, evaluation metrics, hybrid search, and engineering best practices to reliably operationalize RAG in enterprise settings.

Enterprise AIHybrid SearchLLM
0 likes · 18 min read
Why RAG Fails in Production and How to Fix It: Expert Insights
Qborfy AI
Qborfy AI
Feb 18, 2026 · Artificial Intelligence

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

This article explains Retrieval‑Augmented Generation (RAG), detailing its offline knowledge‑base construction and online retrieval‑enhanced generation workflow, comparing it with traditional and fine‑tuned models, and providing step‑by‑step LangChain implementations, advanced techniques, and practical use‑case demos.

Hybrid SearchLangChainPrompt Engineering
0 likes · 16 min read
How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code
DataFunTalk
DataFunTalk
Feb 11, 2026 · Artificial Intelligence

Why Most RAG Deployments Fail and How to Build a Production‑Ready RAG System

This round‑table dissects the gap between RAG’s hype and real‑world production, exposing common pitfalls such as low recall, hallucinations and cost overruns, and then delivers a systematic diagnostic framework, hybrid search strategies, fine‑tuning rules, and practical best‑practice roadmaps for building reliable enterprise RAG solutions.

Agentic RAGFine-tuningHybrid Search
0 likes · 20 min read
Why Most RAG Deployments Fail and How to Build a Production‑Ready RAG System
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Feb 7, 2026 · Artificial Intelligence

Why the ‘Skills’ Approach Is the Third Major Compromise Shaping Enterprise AI in 2026

The article argues that embracing the Skills paradigm— a lightweight, low‑cost alternative to large‑scale model training—represents the third major compromise in the large‑model era, balancing reduced emergence and planning hallucinations against increased stability and engineering efficiency for enterprise AI deployments.

Agentic AIEnterprise AIMixture of Experts
0 likes · 8 min read
Why the ‘Skills’ Approach Is the Third Major Compromise Shaping Enterprise AI in 2026
Data STUDIO
Data STUDIO
Jan 27, 2026 · Artificial Intelligence

How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs

This article explains why large‑language‑model hallucinations are risky, introduces Retrieval‑Augmented Generation (RAG) as a remedy, and walks through nine Python‑based RAG architectures—standard, conversational, corrective, adaptive, fusion, HyDE, self‑RAG, agentic, and graph RAG—detailing their workflows, code examples, strengths, weaknesses, and a decision‑making map for selecting the right design.

AI hallucinationLangChainPython
0 likes · 29 min read
How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs
PaperAgent
PaperAgent
Jan 13, 2026 · Artificial Intelligence

How C2LLM Redefines Code Retrieval with Attention‑Based Pooling

Introducing C2LLM, a contrastive code LLM series that replaces mean and EOS pooling with a multi‑head attention pooling module, achieving top scores on the MTEB‑Code benchmark across 12 tasks and demonstrating cost‑effective, high‑precision code retrieval for both production and AI agent applications.

MTEB-CodeRetrieval Augmented Generationattention pooling
0 likes · 8 min read
How C2LLM Redefines Code Retrieval with Attention‑Based Pooling
Sohu Tech Products
Sohu Tech Products
Jan 7, 2026 · Artificial Intelligence

Master Retrieval-Augmented Generation (RAG): Concepts, Benefits, Implementation

This article explains Retrieval‑Augmented Generation (RAG), its dual‑stage architecture that combines parametric LLM knowledge with external non‑parametric data, outlines its technical evolution, discusses why it outperforms pure LLMs, and provides a step‑by‑step guide with toolchain choices, evaluation metrics, and future challenges.

AIKnowledge BaseLLM
0 likes · 14 min read
Master Retrieval-Augmented Generation (RAG): Concepts, Benefits, Implementation
PaperAgent
PaperAgent
Jan 5, 2026 · Artificial Intelligence

How QuCo‑RAG Replaces Model Confidence with Objective Evidence to Cut Hallucinations

QuCo‑RAG introduces a dynamic retrieval‑augmented generation framework that quantifies uncertainty using pre‑training corpus statistics, replacing unreliable model confidence with objective frequency and co‑occurrence evidence, achieving millisecond‑level hallucination detection, superior multi‑hop QA performance, and cross‑model transferability across various LLMs.

Dynamic RetrievalLLMRetrieval Augmented Generation
0 likes · 9 min read
How QuCo‑RAG Replaces Model Confidence with Objective Evidence to Cut Hallucinations
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Dec 28, 2025 · Artificial Intelligence

Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough

This article walks through the principles of Retrieval‑Augmented Generation (RAG) and provides a complete Python implementation using Elasticsearch, covering document chunking, semantic embedding, bulk indexing, hybrid BM25‑vector search, RRF result fusion, prompt design, LLM invocation, and a practical demo.

ElasticsearchHybrid SearchPrompt Engineering
0 likes · 9 min read
Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough
PaperAgent
PaperAgent
Dec 12, 2025 · Artificial Intelligence

How BookRAG Redefines Long-Document Retrieval with Hierarchical Indexing

BookRAG introduces a hierarchical, structure‑aware indexing method that combines tree‑based document representation with graph‑based entity linking and an agent‑driven retrieval pipeline, achieving up to 71.2% recall improvement on multimodal long‑document benchmarks while cutting token usage and latency dramatically.

Agent RetrievalHierarchical IndexingLLM
0 likes · 7 min read
How BookRAG Redefines Long-Document Retrieval with Hierarchical Indexing
Open Source Tech Hub
Open Source Tech Hub
Dec 5, 2025 · Artificial Intelligence

From Neurons to GPT: A Complete Timeline of AI Evolution and Future Trends

This comprehensive article traces AI from its biological roots and early computers through the birth of artificial intelligence, the rise of machine learning, the emergence of large language models, multimodal agents, and finally explores current breakthroughs, practical applications, and future directions.

Fine-tuningPrompt EngineeringRetrieval Augmented Generation
0 likes · 39 min read
From Neurons to GPT: A Complete Timeline of AI Evolution and Future Trends
Architect's Guide
Architect's Guide
Nov 24, 2025 · Artificial Intelligence

Building Java LLM Applications with LangChain4j: A Hands‑On Guide

This tutorial walks through the fundamentals of large language models, prompt engineering, and word embeddings, then shows how to set up a LangChain‑based LLM stack in Java using LangChain4j, covering core modules, memory, retrieval, chains, agents, and complete code examples.

AI AgentsJavaLLM
0 likes · 15 min read
Building Java LLM Applications with LangChain4j: A Hands‑On Guide
JD Tech Talk
JD Tech Talk
Nov 21, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Retrieval‑Augmented Generation

This article explains why effective chunking is crucial for RAG performance, compares seven major chunking strategies—including fixed‑size, semantic, recursive, document‑structure, agent‑driven, sentence, and paragraph methods—and offers practical guidance on selecting and optimizing chunks for real‑world AI applications.

AIRAGRetrieval Augmented Generation
0 likes · 10 min read
Mastering Chunking Strategies for Retrieval‑Augmented Generation
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 20, 2025 · Artificial Intelligence

How to Build a Quantifiable Data Quality Framework for Dynamic Incremental RAG

This article explains why static RAG metrics don’t apply to dynamic pipelines, introduces five essential dimensions—Parseability, Deduplication, Relevance, Chunk Quality, and Freshness—and shows how to combine them into a weighted score that enables monitoring, alerts, and continuous improvement of dynamic RAG systems.

Data QualityDynamic RAGRetrieval Augmented Generation
0 likes · 10 min read
How to Build a Quantifiable Data Quality Framework for Dynamic Incremental RAG
Data Thinking Notes
Data Thinking Notes
Nov 16, 2025 · Artificial Intelligence

How AI Agents Transform Automation: Architecture, Challenges & Future Trends

This comprehensive overview examines AI agents powered by large language models, detailing their definition, core components, architectural patterns, key technologies such as prompt engineering and retrieval‑augmented generation, diverse application domains, current challenges, security solutions, and emerging research directions.

Prompt EngineeringRetrieval Augmented GenerationSecurity
0 likes · 81 min read
How AI Agents Transform Automation: Architecture, Challenges & Future Trends
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 4, 2025 · Artificial Intelligence

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

EMNLP 2025 in Suzhou will feature Alibaba Cloud’s AI platform PAI presenting four accepted papers on knowledge distillation, small‑model reasoning, distilled reasoning models, and an automated RAG benchmark framework, alongside exhibition demos, networking events, and recruitment opportunities for AI talent.

AI PlatformEMNLP 2025Retrieval Augmented Generation
0 likes · 10 min read
How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025
DataFunSummit
DataFunSummit
Oct 30, 2025 · Artificial Intelligence

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

This article explores how the explosion of unstructured data exposes the limits of traditional OCR and shows how emerging multimodal large language models provide end‑to‑end document understanding, reduce pipeline complexity, cut training costs, enable hybrid retrieval‑augmented generation, and drive real‑world industry deployments.

AIDocument ProcessingOCR
0 likes · 28 min read
How Multimodal Large Models Are Revolutionizing Document Processing and OCR
Xuanwu Backend Tech Stack
Xuanwu Backend Tech Stack
Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

RAGRerankRetrieval Augmented Generation
0 likes · 10 min read
How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers
JD Tech Talk
JD Tech Talk
Oct 21, 2025 · Backend Development

How Backend Engineers Are Breaking Through AI with RAG Architectures

This article details a backend developer's two‑year AI journey, the challenges of rapid model advances, and how applying microservice principles to Retrieval‑Augmented Generation (RAG) creates a scalable, multi‑agent platform for insurance knowledge, memory, and intelligent agents.

Backend AIKnowledge BaseRAG
0 likes · 11 min read
How Backend Engineers Are Breaking Through AI with RAG Architectures
AI Large Model Application Practice
AI Large Model Application Practice
Oct 13, 2025 · Artificial Intelligence

How to Tame LLM Agents: Proven Strategies to Reduce Uncertainty and Boost Reliability

This article outlines practical techniques—including prompt engineering, domain fine‑tuning, retrieval‑augmented generation, structured outputs, workflow constraints, model parameter control, behavior rules, risk‑based AI participation, and comprehensive governance—to curb the unpredictability of large language model agents in enterprise settings.

AI AgentAI GovernanceLLM
0 likes · 18 min read
How to Tame LLM Agents: Proven Strategies to Reduce Uncertainty and Boost Reliability
DataFunSummit
DataFunSummit
Oct 9, 2025 · Artificial Intelligence

Why AI Coding Agents Still Struggle: Context Limits, Knowledge Gaps, and the Road to Human‑Like Assistants

This talk examines the core challenges facing AI coding agents—limited context windows, knowledge accumulation, and software‑engineering complexity—while outlining practical solutions such as context providing, RAG, fine‑tuning, online learning, feedback loops, and multi‑agent collaboration to move toward truly human‑like, continuously learning coding assistants.

AI CodingCoding AgentFeedback Loop
0 likes · 24 min read
Why AI Coding Agents Still Struggle: Context Limits, Knowledge Gaps, and the Road to Human‑Like Assistants
JD Cloud Developers
JD Cloud Developers
Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, covering its core workflow—from knowledge preparation, data cleaning, and metadata extraction to query preprocessing, vector retrieval, reranking, information integration, and final LLM generation, while also reviewing common embedding models and vector databases.

LLMRAGRetrieval Augmented Generation
0 likes · 13 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?
Tech Freedom Circle
Tech Freedom Circle
Sep 25, 2025 · Artificial Intelligence

How RAGFlow’s Agent Engine Turns Retrieval into a Problem‑Solving AI

This article explains how RAGFlow upgrades a traditional RAG system from a passive question‑answer engine to an active problem‑solving agent by integrating the ReAct reasoning‑action‑observation loop, a visual canvas workflow, and a modular component‑tool ecosystem, with concrete Python implementations and code examples.

AI AgentsPythonRAGFlow
0 likes · 16 min read
How RAGFlow’s Agent Engine Turns Retrieval into a Problem‑Solving AI
Tech Freedom Circle
Tech Freedom Circle
Sep 25, 2025 · Artificial Intelligence

RAGFlow Primer Part 1: Introduction and Concept Deep Dive

This article provides a comprehensive technical overview of RAGFlow, an industrial‑grade Retrieval‑Augmented Generation platform, detailing its architecture, core components such as DeepDoc, intelligent chunking, embedding integration, multi‑stage retrieval, and agent workflow, while comparing it with traditional RAG shortcomings.

DeepDocIntelligent ChunkingKnowledge Base
0 likes · 32 min read
RAGFlow Primer Part 1: Introduction and Concept Deep Dive
Tech Freedom Circle
Tech Freedom Circle
Sep 25, 2025 · Artificial Intelligence

Inside RAGFlow: How Its Microservice Architecture Powers an Enterprise‑Grade Retrieval‑Augmented Generation Platform

This article provides a detailed technical walkthrough of RAGFlow's architecture, covering its microservice design, directory layout, layered structure, cloud‑native deployment, core modules such as DeepDoc, RAG engine, Agent system, and web UI, as well as multi‑tenant isolation, streaming responses, asynchronous task handling, concurrency controls, scalability strategies, and a complete request‑lifecycle example for document upload.

AI ArchitectureDeepDocDocker Compose
0 likes · 26 min read
Inside RAGFlow: How Its Microservice Architecture Powers an Enterprise‑Grade Retrieval‑Augmented Generation Platform
Huolala Tech
Huolala Tech
Sep 24, 2025 · Artificial Intelligence

How CID-GraphRAG Boosts Multi‑Turn AI Customer Service with Dual‑Layer Retrieval

The article introduces CID-GraphRAG, a novel framework that combines intent‑driven graphs with semantic similarity search to improve multi‑turn intelligent customer service, detailing its architecture, dual‑layer retrieval mechanism, evaluation against baseline models, and future research directions.

AIDialogue SystemsLLM
0 likes · 14 min read
How CID-GraphRAG Boosts Multi‑Turn AI Customer Service with Dual‑Layer Retrieval
Data Thinking Notes
Data Thinking Notes
Sep 21, 2025 · Artificial Intelligence

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

Amid the rapid rise of generative AI, this article examines the limitations of large language models and explains how Retrieval‑Augmented Generation (RAG), followed by the advanced paradigms DeepSearch and DeepResearch, progressively enhance knowledge handling through dynamic retrieval, multi‑agent reasoning, and autonomous research capabilities.

AI Knowledge ManagementDeepResearchDeepSearch
0 likes · 16 min read
From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval
DataFunTalk
DataFunTalk
Sep 19, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, GraphRAG, and autonomous agents boost model intelligence, improve user experience, and enable advanced content generation, understanding, and multi‑step reasoning.

Autonomous AgentsGraphRAGRetrieval Augmented Generation
0 likes · 4 min read
How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents
Architecture & Thinking
Architecture & Thinking
Sep 12, 2025 · Artificial Intelligence

How Knowledge Graphs Turn Large Language Models into Trustworthy Experts

Integrating structured knowledge graphs with generative AI provides traceable, explainable, and high‑precision reasoning across domains such as medicine, finance, and law, through techniques like Retrieval‑Augmented Generation, graph neural networks, and adaptive planning, dramatically reducing hallucinations and boosting expert‑level performance.

AI hallucinationGraph Neural NetworkKnowledge Graph
0 likes · 12 min read
How Knowledge Graphs Turn Large Language Models into Trustworthy Experts
Architects Research Society
Architects Research Society
Sep 10, 2025 · Artificial Intelligence

From Vectors to Graphs to Hybrids: The Evolution of AI Knowledge Representation

This article explores the three stages of AI knowledge representation—vector embeddings, graph‑based structures, and the emerging hybrid approach that combines vectors, graphs, and large language models—to illustrate how modern Retrieval‑Augmented Generation systems achieve both semantic similarity and precise relational reasoning.

AIRetrieval Augmented Generationgraph databases
0 likes · 3 min read
From Vectors to Graphs to Hybrids: The Evolution of AI Knowledge Representation
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 5, 2025 · Artificial Intelligence

Why Context Engineering Is the Next Frontier for Large Language Models

This article surveys over 1,400 papers to define context engineering as a systematic discipline that structures retrieval, memory, tools, and multi‑agent coordination for LLMs, highlighting the critical asymmetry between understanding long contexts and generating equally complex outputs.

Context EngineeringLLM evaluationMemory Management
0 likes · 8 min read
Why Context Engineering Is the Next Frontier for Large Language Models
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 1, 2025 · Artificial Intelligence

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

This article delves into the implementation details and optimization strategies of Retrieval‑Augmented Generation (RAG), covering document chunking, index enhancement, embedding, hybrid search, and re‑ranking, and provides practical code examples to help developers move from quick deployment to deep performance tuning.

AIEmbeddingHybrid Search
0 likes · 19 min read
Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval
Tech Freedom Circle
Tech Freedom Circle
Aug 26, 2025 · Artificial Intelligence

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

This article provides a step‑by‑step technical guide to optimizing Retrieval‑Augmented Generation (RAG) for interview scenarios, covering query rewriting, HyDE, fallback strategies, routing and prompt routing, multi‑representation indexing, hybrid retrieval, re‑ranking, self‑RAG, generation control, performance benchmarking, and a practical checklist with concrete code examples and metrics.

AI InterviewHybrid RetrievalIndex Optimization
0 likes · 30 min read
How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained
DaTaobao Tech
DaTaobao Tech
Aug 25, 2025 · Artificial Intelligence

Mastering RAG: From Quick Start to Deep Optimization Strategies

This article dives into the practical implementation of Retrieval‑Augmented Generation (RAG), covering document chunking, semantic and reverse HyDE indexing, embedding, hybrid search, and re‑ranking techniques, and provides concrete code examples and optimization tips for building high‑performance AI applications.

EmbeddingHybrid SearchRAG
0 likes · 18 min read
Mastering RAG: From Quick Start to Deep Optimization Strategies
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 21, 2025 · Artificial Intelligence

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Since last year, the debate over “Prompt Engineering” has split between practitioners who favor “Context Engineering” for building scalable agent systems and scholars who treat Prompt Engineering as a broad umbrella term, highlighting the need to dynamically construct and manage context for reliable, extensible AI applications.

AI AgentsLLMPrompt Engineering
0 likes · 33 min read
Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Jul 30, 2025 · Artificial Intelligence

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

This article analyzes the prompt‑inflation bottleneck that arises when large language models (LLMs) must handle thousands of Model Context Protocol (MCP) services, and introduces the MCP‑RAG architecture—a retrieval‑augmented generation solution that builds a metadata knowledge base and intelligent retrieval layer to enable precise, efficient MCP service discovery at scale.

AILLMMCP
0 likes · 21 min read
How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls
JD Tech
JD Tech
Jul 29, 2025 · Artificial Intelligence

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

This article describes a QCon talk that combines causal inference with large language models to build a retrieval‑augmented generation pricing system for e‑commerce, detailing the three‑step algorithm, LLM‑driven modeling challenges, process‑reward tree search, reinforcement‑learning fine‑tuning, and experimental gains in accuracy and speed.

Retrieval Augmented Generationcausal inferencee‑commerce pricing
0 likes · 17 min read
How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Jul 24, 2025 · Artificial Intelligence

Exploring Recent Large‑Model Agent Papers: Insights and Analyses

This article reviews a series of recent research papers on large‑model agents, covering topics such as reinforcement‑learning‑driven ML agents, premise‑critique ability of LLMs, long‑term tool‑augmented LLM evaluation, agentic RAG, set‑based retrieval for multi‑hop QA, mobile VLM agents, and broader surveys of LLM applications, summarizing each work’s problem statement, prior approaches, novel contributions, experimental results, limitations, and future directions.

Agentic AIBenchmarkLLM evaluation
0 likes · 46 min read
Exploring Recent Large‑Model Agent Papers: Insights and Analyses
DataFunTalk
DataFunTalk
Jul 21, 2025 · Artificial Intelligence

From Prompt Engineering to Context Engineering: Transforming LLM Interactions

This article traces the evolution from prompt engineering to context engineering, detailing technical milestones, core concepts, practical strategies, and future trends that together reshape large language model applications and enable sophisticated AI agents across diverse domains.

Memory ManagementPrompt EngineeringRetrieval Augmented Generation
0 likes · 35 min read
From Prompt Engineering to Context Engineering: Transforming LLM Interactions
Instant Consumer Technology Team
Instant Consumer Technology Team
Jul 14, 2025 · Artificial Intelligence

9 Essential Technologies for Building Scalable AI Agents

An in‑depth guide outlines the nine core technologies—ranging from autonomous agent fundamentals and multi‑agent collaboration to workflow orchestration, retrieval‑augmented generation, fine‑tuning, function calling, model context protocols, agent‑to‑agent communication, and AI‑driven UI—required to design, deploy, and scale enterprise‑grade AI agents.

AI AgentsAgent ArchitectureFunction Calling
0 likes · 9 min read
9 Essential Technologies for Building Scalable AI Agents
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 12, 2025 · Artificial Intelligence

Why GraphRAG Is the Future of Retrieval‑Augmented Generation

This article explains how GraphRAG combines knowledge graphs with retrieval‑augmented generation to overcome the limitations of vector‑only RAG, delivering higher accuracy, better explainability, easier development, and stronger governance for generative AI applications across various domains.

AIGraphRAGKnowledge Graph
0 likes · 23 min read
Why GraphRAG Is the Future of Retrieval‑Augmented Generation
AI Algorithm Path
AI Algorithm Path
Jul 3, 2025 · Artificial Intelligence

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

This article examines how Retrieval‑Augmented Generation (RAG) has progressed from simple keyword‑based retrieval to advanced semantic methods, modular architectures, graph‑enhanced reasoning, and autonomous agentic systems, highlighting each approach's workflow, benefits, limitations, and the shift toward dynamic AI decision‑making.

AIAgentic RAGGraph RAG
0 likes · 7 min read
Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation
AIWalker
AIWalker
Jun 30, 2025 · Artificial Intelligence

Chinese Team Builds First AI That Understands Film, Using 440K Shot Library for Director‑Level Camera Moves

FilMaster is a pioneering AI system that learns cinematic principles from a 440,000‑shot movie database, combines multimodal LLMs, RAG, and audience‑centric rhythm control to generate editable, high‑quality films, and outperforms prior methods by over 50% on the new FilmEval benchmark.

AI film generationFilmEval benchmarkRetrieval Augmented Generation
0 likes · 18 min read
Chinese Team Builds First AI That Understands Film, Using 440K Shot Library for Director‑Level Camera Moves
AI Algorithm Path
AI Algorithm Path
Jun 26, 2025 · Artificial Intelligence

The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System

This guide breaks down the ten core building blocks of a production‑ready RAG pipeline—from input handling and vector stores to prompt engineering, LLM inference, observability, and evaluation—showing why each piece matters, common pitfalls, and practical best‑practice recommendations.

LLMObservabilityPrompt Engineering
0 likes · 9 min read
The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Jun 6, 2025 · Artificial Intelligence

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

The article enumerates common pitfalls of Retrieval‑Augmented Generation—such as missing content, low‑rank document misses, context limits, format errors, incomplete answers, scalability bottlenecks, complex PDF extraction, data‑quality issues, domain adaptation gaps, hallucinations, and feedback‑loop deficiencies—and offers concrete mitigation strategies ranging from data cleaning and prompt design to hybrid search, hierarchical retrieval, document compression, and automated evaluation.

Data QualityHybrid SearchLLM
0 likes · 9 min read
Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)
IT Services Circle
IT Services Circle
Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AIKnowledge RetrievalRAG
0 likes · 12 min read
Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices
Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 5, 2025 · Artificial Intelligence

How DeepSeek AI Transforms Government Search with Smarter, Faster Answers

This article explains how DeepSeek's large‑model‑driven search system overcomes traditional keyword‑matching limits, improves long‑tail query coverage, and delivers personalized, accurate government service results through intent parsing, knowledge‑graph retrieval, and generative optimization.

Retrieval Augmented Generationartificial intelligencegovernment services
0 likes · 9 min read
How DeepSeek AI Transforms Government Search with Smarter, Faster Answers
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 5, 2025 · Artificial Intelligence

How Deep (Re)Search Transforms Code Search and AI-Powered Knowledge Retrieval

This article systematically explains the concepts of Deep Search and Deep Research, contrasts them with traditional Retrieval‑Augmented Generation, reviews leading commercial and open‑source solutions, details their architecture for code retrieval, and outlines future plans for specialized code‑search agents.

AI researchKnowledge RetrievalRetrieval Augmented Generation
0 likes · 13 min read
How Deep (Re)Search Transforms Code Search and AI-Powered Knowledge Retrieval
Architect
Architect
May 5, 2025 · Artificial Intelligence

How Agentic RAG‑R1 Turns Retrieval‑Augmented Generation into an Autonomous AI Agent

Agentic RAG‑R1, an open‑source project from Peking University, combines Retrieval‑Augmented Generation with an agentic AI loop, introduces the GRPO reinforcement‑learning optimizer, supports LoRA‑based fine‑tuning, quantization and multimodal tool calls, and demonstrates significant accuracy gains on the MedQA benchmark across both Chinese and English test sets.

Agentic AILLM Tool UseRetrieval Augmented Generation
0 likes · 8 min read
How Agentic RAG‑R1 Turns Retrieval‑Augmented Generation into an Autonomous AI Agent
DataFunSummit
DataFunSummit
Apr 21, 2025 · Artificial Intelligence

Deep Integration of Knowledge Graphs and Large Language Models: Methods, Applications, and Future Directions

This article explores how knowledge graphs can be tightly integrated with large language models through prompt engineering, fine‑tuning, retrieval‑augmented generation, reasoning collaboration, and knowledge agents, outlining technical pathways, practical implementations, and future research directions across AI domains.

AIKnowledge GraphRetrieval Augmented Generation
0 likes · 23 min read
Deep Integration of Knowledge Graphs and Large Language Models: Methods, Applications, and Future Directions
Sohu Tech Products
Sohu Tech Products
Apr 9, 2025 · Artificial Intelligence

Boost LLM Retrieval Accuracy with MCP: A Step‑by‑Step Guide

This tutorial explains how to overcome the limitations of Retrieval‑Augmented Generation by using the Model Context Protocol (MCP) together with a MongoDB database, providing detailed setup steps, configuration examples, and performance comparisons that demonstrate significantly higher query precision for large language models.

AI AgentsMCPMongoDB
0 likes · 24 min read
Boost LLM Retrieval Accuracy with MCP: A Step‑by‑Step Guide
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Apr 8, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), its three‑step workflow of retrieval, augmentation, and generation, its key advantages such as improved accuracy and explainability, and compares RAG with traditional pre‑trained models, fine‑tuned models, hybrid models, knowledge‑distillation methods, and RLHF, while also covering vector, full‑text, and hybrid retrieval modes and the role of rerank models.

AIKnowledge RetrievalRAG
0 likes · 18 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?
Tencent Cloud Developer
Tencent Cloud Developer
Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) enhances large language models by fetching up‑to‑date external knowledge before generation, mitigating knowledge‑cutoff limits and hallucinations through a retrieval step (using text, vector, or graph methods) and a generation step, evolving from naive single‑method approaches to advanced, modular, graph‑based, and agentic systems that enable adaptive, multi‑hop reasoning and future intelligent, multimodal pipelines.

AIAgentic AIKnowledge Retrieval
0 likes · 9 min read
Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development
Architect
Architect
Mar 30, 2025 · Artificial Intelligence

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques

This article provides a comprehensive survey of Retrieval‑Augmented Generation (RAG), covering its basic principles, key components, seven technical variants, challenges, evaluation methods, and future research directions across multimodal, graph‑based, and agentic extensions.

AI SurveyKnowledge RetrievalRAG
0 likes · 9 min read
What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques
Alibaba Cloud Native
Alibaba Cloud Native
Mar 19, 2025 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide

This article explains the Retrieval‑Augmented Generation (RAG) paradigm, walks through its four core steps, and provides a detailed Spring AI implementation—including configuration, vector storage, REST controller, multi‑query expansion, query rewriting, document joining, and error handling—plus best‑practice recommendations for production deployments.

AIJavaRAG
0 likes · 23 min read
Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide