Tagged articles
894 articles
Page 5 of 9
BirdNest Tech Talk
BirdNest Tech Talk
Oct 21, 2025 · Artificial Intelligence

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

This article explains what vector stores are, outlines their core workflow of adding, querying, and searching embeddings, compares popular back‑ends like FAISS, Chroma, and Pinecone, and walks through a complete Chinese‑language example using LangChain’s FAISS integration with detailed code and result analysis.

AIFAISSLangChain
0 likes · 10 min read
How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain
BirdNest Tech Talk
BirdNest Tech Talk
Oct 16, 2025 · Artificial Intelligence

Mastering Text Splitting in LangChain: From Theory to Code

This guide explains why large documents must be broken into semantic chunks for LLMs, introduces core parameters like chunk_size and chunk_overlap, compares LangChain's various splitters, and walks through a complete Python example that loads a long text, configures a RecursiveCharacterTextSplitter, and inspects the resulting chunks.

EmbeddingLangChainRAG
0 likes · 9 min read
Mastering Text Splitting in LangChain: From Theory to Code
DataFunSummit
DataFunSummit
Oct 16, 2025 · Artificial Intelligence

How Chat BI Transforms Data Warehousing with AI: Unlock Real‑Time Insights

This presentation by iQIYI’s Technical Director Zhang Xiaoming details the evolution of BI systems, introduces the Chat BI framework, explains its three‑step implementation, outlines architectural design, data‑warehouse integration, performance optimizations, and user‑operation strategies, revealing how AI and RAG empower smarter data analytics.

AIBIChatBI
0 likes · 18 min read
How Chat BI Transforms Data Warehousing with AI: Unlock Real‑Time Insights
Alibaba Cloud Native
Alibaba Cloud Native
Oct 15, 2025 · Cloud Native

What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes

The Higress 2.0 release introduces 30 changes—13 new features such as a RAG MCP server and ECDS‑based configuration refactor, 7 bug fixes, 5 refactorings, documentation updates and a test improvement—providing developers with enhanced knowledge‑management capabilities, more stable routing, and clearer documentation for cloud‑native service‑mesh environments.

MCPRAGRelease Notes
0 likes · 20 min read
What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes
Xiaohe Frontend Team
Xiaohe Frontend Team
Oct 15, 2025 · Artificial Intelligence

REFRAG: Using Tiny Models to Compress RAG for Faster, Smarter AI

Meta’s new REFRAG framework lets a lightweight encoder compress retrieved text into semantic tags, enabling large language models to answer queries with far fewer tokens, lower latency, and higher throughput, while preserving core meaning and allowing flexible placement of compressed information within prompts.

LLM efficiencyRAGmodel compression
0 likes · 8 min read
REFRAG: Using Tiny Models to Compress RAG for Faster, Smarter AI
Practical DevOps Architecture
Practical DevOps Architecture
Oct 14, 2025 · Artificial Intelligence

Master AI Agents: From Basics to Advanced Multi-Model Development

This comprehensive AI agent development course covers 18 chapters, ranging from fundamental concepts and architecture to large‑model integration, tool and browser control, memory, RAG self‑learning, sandboxing, database manipulation, multi‑agent architectures, code assistance, and a real‑world frontend automation project, complete with source code and documentation.

AI agentsLangChainRAG
0 likes · 3 min read
Master AI Agents: From Basics to Advanced Multi-Model Development
DataFunTalk
DataFunTalk
Oct 13, 2025 · Artificial Intelligence

How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while explaining the underlying technologies of Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems.

AI applicationsRAGSupervised Fine‑Tuning
0 likes · 4 min read
How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications
DataFunTalk
DataFunTalk
Oct 11, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World Apps with RAG, GraphRAG & Agents

This article explores Tencent’s large language model deployments across diverse business scenarios—content generation, intelligent customer service, and role‑playing—detailing the underlying RAG, GraphRAG, and Agent technologies, their principles, practical implementations, and the advantages they bring to enterprise AI solutions.

AILLMRAG
0 likes · 5 min read
How Tencent’s LLM Powers Real‑World Apps with RAG, GraphRAG & Agents
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Oct 10, 2025 · Artificial Intelligence

How Ontologies Boost Large Language Models: A Comprehensive Review

This review examines how formal knowledge representations (ontologies) can be integrated with large language models to enhance reasoning, reduce hallucinations, and improve factual reliability, outlining three roles—information provider, reasoner, validator—while analyzing recent frameworks, open‑source projects, and future research challenges.

AIOntologyRAG
0 likes · 29 min read
How Ontologies Boost Large Language Models: A Comprehensive Review
JD Tech
JD Tech
Oct 9, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines external knowledge retrieval with large language models, covering its motivations, data preparation, chunking strategies, vectorization, storage, query processing, retrieval, reranking, prompt engineering, and LLM generation, plus practical optimization tips.

LLMRAGchunking
0 likes · 14 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?
DataFunSummit
DataFunSummit
Sep 28, 2025 · Artificial Intelligence

Unlocking Enterprise Knowledge: Building Multimodal AI Systems with LLMs

This article examines the challenges of processing massive multimodal data in enterprises and presents a knowledge‑augmentation framework that leverages Retrieval‑Augmented Generation, memory‑inspired architecture, and feedback loops to enable reliable, scalable AI‑driven decision making across diverse business scenarios.

Enterprise KnowledgeKnowledge GraphLLM
0 likes · 29 min read
Unlocking Enterprise Knowledge: Building Multimodal AI Systems with LLMs
Volcano Engine Developer Services
Volcano Engine Developer Services
Sep 28, 2025 · Artificial Intelligence

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

This guide breaks down the complex terminology of large language models—explaining tokens, transformers, self‑attention, RAG, scaling laws, dense vs. sparse architectures, and training stages—using clear analogies and step‑by‑step explanations so readers can confidently understand and work with modern AI systems.

AI fundamentalsModel TrainingRAG
0 likes · 35 min read
Demystifying AI Jargon: A Beginner’s Guide to Large Language Models
Data STUDIO
Data STUDIO
Sep 28, 2025 · Artificial Intelligence

Top Reranker Models for RAG in 2025: A Comparative Review

This article explains why initial retrieval in Retrieval‑Augmented Generation often yields noisy results, describes how rerankers act as quality filters to improve relevance, compares the leading 2025 reranker models—including Cohere, bge‑reranker, Voyage, Jina, FlashRank, and MixedBread—and provides code snippets, evaluation metrics, and guidance for selecting the right model for specific use cases.

AICross-EncoderLLM
0 likes · 31 min read
Top Reranker Models for RAG in 2025: A Comparative Review
JD Tech Talk
JD Tech Talk
Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, detailing its core workflow—from knowledge preparation, chunking, and embedding to vector database storage and the question‑answering stage—while highlighting key challenges, tools, and optimization strategies.

AIEmbeddingLLM
0 likes · 15 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?
JD Cloud Developers
JD Cloud Developers
Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, covering its core workflow—from knowledge preparation, data cleaning, and metadata extraction to query preprocessing, vector retrieval, reranking, information integration, and final LLM generation, while also reviewing common embedding models and vector databases.

LLMRAGRetrieval Augmented Generation
0 likes · 13 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?
Tencent Advertising Technology
Tencent Advertising Technology
Sep 27, 2025 · Artificial Intelligence

How AI‑Generated Test Cases Transformed Tencent Ads R&D Workflow

This article details how Tencent's advertising R&D team tackled lengthy, experience‑driven test case creation by deploying AIGC‑powered demand analysis, Prompt + RAG knowledge retrieval, and multi‑stage automated validation, ultimately boosting test case adoption from under 20% to nearly 60% while reducing manual effort and iteration time.

AI testingAIGCPrompt engineering
0 likes · 14 min read
How AI‑Generated Test Cases Transformed Tencent Ads R&D Workflow
Bilibili Tech
Bilibili Tech
Sep 26, 2025 · Artificial Intelligence

How RAG Transforms Natural Language Queries into Accurate SQL for Business Users

This article explains how Retrieval‑Augmented Generation (RAG) combines large language models with vector databases to let non‑technical staff query massive membership data using plain language, detailing the workflow, technical architecture, optimization challenges, and real‑world impact on data‑driven decision making.

AIData PlatformLLM
0 likes · 17 min read
How RAG Transforms Natural Language Queries into Accurate SQL for Business Users
BirdNest Tech Talk
BirdNest Tech Talk
Sep 25, 2025 · Artificial Intelligence

Mastering LangChain: A Hands‑On Guide to Building LLM Applications

This repository offers a comprehensive, step‑by‑step LangChain tutorial series that walks developers through installation, the LangChain Expression Language, streaming, parallel execution, callbacks, serialization, model customization, prompt templates, memory, multimodal support, and advanced tools like LangGraph and LangSmith, enabling the creation of sophisticated AI applications.

AI DevelopmentLLMLangChain
0 likes · 9 min read
Mastering LangChain: A Hands‑On Guide to Building LLM Applications
DataFunSummit
DataFunSummit
Sep 24, 2025 · Artificial Intelligence

Taming LLM Hallucinations: Strategies and Solutions from 360

This article explores the problem of large‑model hallucinations, explains its definitions and classifications, analyzes root causes in data, algorithms and inference, and presents detection methods and practical mitigation techniques such as RAG, decoding strategies, and model‑enhancement approaches, illustrated with real‑world 360 use cases and future research directions.

AI SafetyLLMModel Alignment
0 likes · 22 min read
Taming LLM Hallucinations: Strategies and Solutions from 360
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 21, 2025 · Artificial Intelligence

FinKario: Event‑Enhanced Financial Knowledge Graphs Boost A‑Share Sharpe Ratio to 4.9

This article reviews the FinKario paper, which introduces an event‑augmented financial knowledge graph and a two‑stage RAG retrieval strategy that together enable real‑time knowledge updates and efficient integration of long‑form research reports, yielding a Sharpe ratio of 4.9 and outperforming baseline LLMs and institutional strategies in back‑testing.

FinKarioLLMRAG
0 likes · 10 min read
FinKario: Event‑Enhanced Financial Knowledge Graphs Boost A‑Share Sharpe Ratio to 4.9
DataFunSummit
DataFunSummit
Sep 19, 2025 · Artificial Intelligence

How Tencent Leverages LLMs: RAG, GraphRAG, and Agents in Real‑World Apps

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑play, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and intelligent agents—that enable these applications.

AILLMRAG
0 likes · 4 min read
How Tencent Leverages LLMs: RAG, GraphRAG, and Agents in Real‑World Apps
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Sep 18, 2025 · Artificial Intelligence

How to Diagnose and Optimize RAG Systems When 30% Answers Miss the Mark

This guide explains why RAG systems often produce off‑topic answers, outlines how to measure hit‑rate, retrieval, reranking and generation metrics, provides step‑by‑step evaluation pipelines, code examples, real‑world case studies, and interview‑ready templates for diagnosing and optimizing each stage of the pipeline.

AIGenerationPipeline
0 likes · 18 min read
How to Diagnose and Optimize RAG Systems When 30% Answers Miss the Mark
Data STUDIO
Data STUDIO
Sep 18, 2025 · Artificial Intelligence

Build a RAG App from Scratch: Master Text Chunking, Vector Retrieval, and Coreference Resolution

This tutorial walks through building a Retrieval‑Augmented Generation (RAG) system from the ground up, covering document parsing, text chunking strategies, vector store creation with ChromaDB, semantic search, prompt engineering for LLMs, conversation memory, coreference handling, and practical optimization tips, all illustrated with complete Python code.

ChromaDBPythonRAG
0 likes · 19 min read
Build a RAG App from Scratch: Master Text Chunking, Vector Retrieval, and Coreference Resolution
Zhuanzhuan Tech
Zhuanzhuan Tech
Sep 17, 2025 · Artificial Intelligence

LLM‑Powered Intent Understanding, RAG QA, and Knowledge Base Maintenance for Recycling

This article details how Zhuanzhuan leverages large language models to enhance on‑site device inspection through a three‑stage pipeline—intent understanding, retrieval‑augmented generation QA, and automated knowledge‑base upkeep—highlighting technical innovations, workflow integration, and the resulting operational benefits.

AIIntent UnderstandingKnowledge Base
0 likes · 14 min read
LLM‑Powered Intent Understanding, RAG QA, and Knowledge Base Maintenance for Recycling
DataFunSummit
DataFunSummit
Sep 17, 2025 · Artificial Intelligence

How Tencent’s Large Language Model Powers Real-World AI Applications

This article explores Tencent’s large language model across diverse business scenarios—content generation, intelligent customer service, role‑playing, and more—detailing the principles and practical uses of Retrieval‑Augmented Generation (RAG), GraphRAG, and Agent technologies, and how they enhance model intelligence and user experience.

AIKnowledge GraphRAG
0 likes · 4 min read
How Tencent’s Large Language Model Powers Real-World AI Applications
Architecture & Thinking
Architecture & Thinking
Sep 17, 2025 · Artificial Intelligence

How the 32B ‘Zhiyu’ Model is Revolutionizing Intelligent Operations

The Zhiyu model, a 32‑billion‑parameter SRE‑focused LLM, combines extensive domain knowledge, enhanced professional skills, and deterministic RAG to deliver precise, actionable insights for intelligent operations, backed by a robust multi‑source training pipeline, staged training, and flexible deployment options.

AI OperationsModel TrainingRAG
0 likes · 7 min read
How the 32B ‘Zhiyu’ Model is Revolutionizing Intelligent Operations
DataFunTalk
DataFunTalk
Sep 15, 2025 · Artificial Intelligence

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

In an interview, Di Xingxing of Autohome details their AI+Data framework—unified lake‑warehouse, intelligent engine, and agent services—that breaks data silos, blends traditional models with LLMs, leverages causal inference and RAG knowledge bases, and uses continuous feedback to build explainable, evolving data agents for accurate sales forecasting, competitive analysis, and end‑to‑end business automation in the automotive industry.

AIRAGautomotive
0 likes · 10 min read
How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap
AI Cyberspace
AI Cyberspace
Sep 15, 2025 · Artificial Intelligence

What Is Agentic AI? From LLM Limits to Autonomous AI Agents

Agentic AI transforms static large language models into autonomous agents by adding perception, goal orientation, planning, action, interaction, and iterative loops, tracing its evolution from early chatbots through Prompt Engineering, ReAct, AutoGPT, OpenAI Function Calling, to modern multi‑agent frameworks, while addressing challenges like memory, hallucinations, and scalability.

Agentic AIMulti-AgentRAG
0 likes · 38 min read
What Is Agentic AI? From LLM Limits to Autonomous AI Agents
DataFunSummit
DataFunSummit
Sep 14, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents

This article examines Tencent's large language model deployments across various business scenarios, detailing the use of Retrieval‑Augmented Generation, GraphRAG for role‑playing, and Agent technologies, while also outlining core application areas and the three main technical approaches—SFT, RAG, and Agents.

AI agentsAI applicationsGraphRAG
0 likes · 4 min read
How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Sep 13, 2025 · Artificial Intelligence

Choosing the Right Path for RAG Development: Low‑Code Platforms vs Open‑Source Frameworks

This article compares low‑code development platforms with open‑source large‑model frameworks such as LangChain and LlamaIndex, outlining their features, advantages, limitations, and suitability for building retrieval‑augmented generation (RAG) applications in various enterprise scenarios.

AI DevelopmentLangChainLlamaIndex
0 likes · 13 min read
Choosing the Right Path for RAG Development: Low‑Code Platforms vs Open‑Source Frameworks
Data Party THU
Data Party THU
Sep 11, 2025 · Artificial Intelligence

How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores

ComRAG tackles the static‑knowledge gaps, uneven QA quality, and storage explosion of community question‑answer platforms by integrating a static documentation vector store with dual dynamic CQA stores managed via a centroid‑based memory, delivering higher accuracy, lower latency, and scalable storage for industrial retrieval‑augmented generation.

Community QADynamic RetrievalLLM
0 likes · 7 min read
How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 11, 2025 · Artificial Intelligence

How REFRAG Cuts LLM Decoding Time by 30×: A New Efficient RAG Framework

REFRAG (REpresentation For RAG) introduces a novel decoding framework that compresses, senses, and expands context using precomputed chunk embeddings, achieving up to 30.85× faster first-token generation and 16× larger context windows without sacrificing perplexity, as validated across diverse long‑context tasks.

LLMRAGchunk embeddings
0 likes · 18 min read
How REFRAG Cuts LLM Decoding Time by 30×: A New Efficient RAG Framework
Continuous Delivery 2.0
Continuous Delivery 2.0
Sep 11, 2025 · Artificial Intelligence

Building Scalable Enterprise RAG: Lessons, Pitfalls, and Proven Solutions

This article shares practical lessons from building a large‑scale enterprise RAG system, covering imperfect data, document quality scoring, hierarchical chunking, metadata design, semantic‑search failures, open‑source model choices, and table handling to achieve reliable AI‑driven search.

Enterprise AIOpen-source modelsRAG
0 likes · 13 min read
Building Scalable Enterprise RAG: Lessons, Pitfalls, and Proven Solutions
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Sep 11, 2025 · Artificial Intelligence

How AST Boosts LLM‑Powered Code Question Answering: Theory, Practice, and Future Directions

This article explores how abstract syntax trees (AST) can enrich large language model (LLM) based code question‑answering by providing precise structural context, detailing LLM strengths and limits, describing AST‑LLM collaboration, RAG integration, cutting‑edge models, practical tooling, challenges, standardisation efforts, and future research avenues.

ASTLLMRAG
0 likes · 30 min read
How AST Boosts LLM‑Powered Code Question Answering: Theory, Practice, and Future Directions
DaTaobao Tech
DaTaobao Tech
Sep 10, 2025 · Frontend Development

How AI Cut Front‑End Development Time by 60% in Alibaba’s Giraffe Search

This article details how the author transformed a constrained Weex/Muise front‑end project for the “giraffe” search page into an AI‑driven workflow, building a structured knowledge base, defining project‑level rules, and using RAG techniques to accelerate component, tracking, and payment integration, ultimately reducing development time by 60% and proposing a new “AI programming as context engineering” paradigm.

AIKnowledge BaseMuise
0 likes · 14 min read
How AI Cut Front‑End Development Time by 60% in Alibaba’s Giraffe Search
DataFunTalk
DataFunTalk
Sep 10, 2025 · Artificial Intelligence

Why RAG is Evolving: From Retrieval to Integrated Reasoning, Memory, and Multimodal AI

This article explores how Retrieval‑Augmented Generation (RAG) is transitioning from basic retrieve‑and‑generate pipelines to a unified architecture that incorporates reasoning chains, agent layers, knowledge graphs, Monte‑Carlo Tree Search, reinforcement learning, sophisticated memory management, and multimodal tensor‑based retrieval, while addressing engineering challenges such as storage expansion, re‑ranking, and index dimensionality.

AI reasoningRAGRetrieval-Augmented Generation
0 likes · 19 min read
Why RAG is Evolving: From Retrieval to Integrated Reasoning, Memory, and Multimodal AI
Architecture Breakthrough
Architecture Breakthrough
Sep 7, 2025 · Industry Insights

Why Arrogance Blocks You From Riding the AI Wave—and How to Overcome It

The article argues that arrogance, not lack of knowledge, hinders individuals from seizing AI opportunities, outlines four psychological barriers—unseen, undervalued, incomprehensible, and too late—and provides practical steps such as prompt engineering, RAG, fine‑tuning, and AI agents to actively engage with the AI wave.

AIPrompt engineeringRAG
0 likes · 11 min read
Why Arrogance Blocks You From Riding the AI Wave—and How to Overcome It
Data Party THU
Data Party THU
Sep 5, 2025 · Artificial Intelligence

What a PRISMA Review Uncovers About Retrieval‑Augmented Generation (RAG)

This systematic PRISMA review analyzes 128 highly‑cited RAG papers, covering five major databases, 343 datasets, a detailed technical roadmap, evaluation metrics from EM to LLM‑as‑Judge, and future research directions, showing that RAG has evolved into a complex, programmable, and auditable distributed system.

AIDatasetsEvaluation Metrics
0 likes · 5 min read
What a PRISMA Review Uncovers About Retrieval‑Augmented Generation (RAG)
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 5, 2025 · Artificial Intelligence

How Context Engineering Transforms Dify Agents: Boost Efficiency by 10×

This article explains how Context Engineering (CE) extends Prompt Engineering by integrating seven core elements—system prompts, user input, short‑term memory, long‑term memory, retrieval, tools, and structured output—using the open‑source Dify platform to build dynamic, multimodal agents that cut inference costs tenfold and raise complex‑task success rates by 40%.

AI Agent DevelopmentDifyLLM
0 likes · 16 min read
How Context Engineering Transforms Dify Agents: Boost Efficiency by 10×
DataFunSummit
DataFunSummit
Sep 4, 2025 · Artificial Intelligence

Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation

This article explores the evolving search demands of the intelligent era, explains dense and sparse vector concepts, details Elasticsearch's vector search capabilities and recent performance breakthroughs, introduces hybrid and relevance‑tuning techniques, and demonstrates RAG principles and real‑world enterprise use cases.

AIElasticsearchHybrid Search
0 likes · 14 min read
Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation
Data Party THU
Data Party THU
Sep 3, 2025 · Artificial Intelligence

Unlocking Large Model Secrets: Transformers, MoE, Fine‑Tuning, RAG & KV Caching

This article provides a comprehensive technical overview of today’s large‑model ecosystem, covering the Transformer architecture, Mixture‑of‑Experts extensions, five fine‑tuning methods, the evolution from traditional RAG to agentic RAG, classic agent design patterns, diverse text‑chunking strategies, and the KV‑cache optimization that accelerates inference.

Agentic AIFine‑tuningKV cache
0 likes · 13 min read
Unlocking Large Model Secrets: Transformers, MoE, Fine‑Tuning, RAG & KV Caching
Efficient Ops
Efficient Ops
Sep 2, 2025 · Artificial Intelligence

How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations

At the 27th GOPS Global Operations Conference in Shanghai (Oct 17‑18, 2025), Professor Wang Peng of Fudan University will reveal how large language models can extract and structure heterogeneous operational data into high‑quality knowledge bases, and how RAG‑driven Q&A enhances fault diagnosis, SOP generation, and automated decision‑making.

Intelligent OperationsKnowledge BaseRAG
0 likes · 3 min read
How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Sep 2, 2025 · Artificial Intelligence

Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities

The article analyzes why enterprise‑wide large‑model AI projects face steep hurdles, outlining required human capabilities, historical labor shifts, current hot technologies such as RAG, Agent, CoT and multimodal, their limits, a three‑stage implementation roadmap, typical case pitfalls, and the key success factors for sustainable digital transformation.

CoTDigital TransformationEnterprise AI
0 likes · 15 min read
Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 2, 2025 · Artificial Intelligence

Why RAG Is Dead: Jeff Huber’s 5 Retrieval Secrets and Context Engineering

Jeff Huber, founder of Chroma, argues that traditional RAG is obsolete, introduces context engineering as the new paradigm, and shares five practical retrieval strategies, a complete pipeline, and insights on handling context rot, memory, and generative benchmarking to build production‑grade AI applications.

AIContext EngineeringGenerative Benchmarking
0 likes · 11 min read
Why RAG Is Dead: Jeff Huber’s 5 Retrieval Secrets and Context Engineering
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 1, 2025 · Artificial Intelligence

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

This article delves into the implementation details and optimization strategies of Retrieval‑Augmented Generation (RAG), covering document chunking, index enhancement, embedding, hybrid search, and re‑ranking, and provides practical code examples to help developers move from quick deployment to deep performance tuning.

AIEmbeddingHybrid Search
0 likes · 19 min read
Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval
Data Thinking Notes
Data Thinking Notes
Aug 31, 2025 · Artificial Intelligence

Embedding's Role in Retrieval‑Augmented Generation: Basics, Challenges & Future

This article explains how embedding technology converts unstructured data into vector representations, powers precise retrieval in Retrieval‑Augmented Generation (RAG), outlines the evolution of embedding models, discusses current challenges such as long‑text handling and domain adaptation, and highlights emerging solutions.

AIEmbeddingRAG
0 likes · 12 min read
Embedding's Role in Retrieval‑Augmented Generation: Basics, Challenges & Future
Xiaolei Talks DB
Xiaolei Talks DB
Aug 28, 2025 · Databases

How AI Is Transforming Databases: Highlights from China’s DTCC2025

At DTCC2025 in Beijing, industry leaders showcased AI-driven innovations, vector database advances, RAG techniques, and distributed database performance breakthroughs, illustrating how databases are evolving from passive data stores into intelligent, autonomous systems that boost efficiency, scalability, and business value across sectors.

AIDistributed SystemsRAG
0 likes · 10 min read
How AI Is Transforming Databases: Highlights from China’s DTCC2025
Data Thinking Notes
Data Thinking Notes
Aug 26, 2025 · Artificial Intelligence

From Prompt to Context: How AI Agents Evolve into Proactive Intelligence

This article explores the rapid growth of large language models and explains how AI agents transform passive, single‑turn responses into proactive, continuous intelligence by leveraging a core “Prompt→Context→Action” loop, detailing their architecture, key components, challenges, and future directions.

AI AgentLLM architecturePrompt engineering
0 likes · 20 min read
From Prompt to Context: How AI Agents Evolve into Proactive Intelligence
Tech Freedom Circle
Tech Freedom Circle
Aug 26, 2025 · Artificial Intelligence

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

This article provides a step‑by‑step technical guide to optimizing Retrieval‑Augmented Generation (RAG) for interview scenarios, covering query rewriting, HyDE, fallback strategies, routing and prompt routing, multi‑representation indexing, hybrid retrieval, re‑ranking, self‑RAG, generation control, performance benchmarking, and a practical checklist with concrete code examples and metrics.

AI InterviewHybrid RetrievalIndex Optimization
0 likes · 30 min read
How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained
Alibaba Cloud Native
Alibaba Cloud Native
Aug 26, 2025 · Artificial Intelligence

Boost Dify’s RAG Performance with Higress AI Gateway: Two Integration Strategies

This guide explains how to overcome Dify's built‑in RAG limitations by using Higress AI Gateway to connect external RAG services, detailing two integration patterns—RAG Retrieval Agent and Automatic Retrieval Injection—along with step‑by‑step configuration, validation, and the resulting benefits for enterprise AI applications.

DifyIntegrationKnowledge Retrieval
0 likes · 13 min read
Boost Dify’s RAG Performance with Higress AI Gateway: Two Integration Strategies
DataFunSummit
DataFunSummit
Aug 25, 2025 · Artificial Intelligence

Building Xiaomi’s Vertical Domain QA Agent: From RAG to Real‑World Deployment

This article explains how Xiaomi designed and deployed a vertical‑domain question‑answering assistant for product and car queries, covering business background, a four‑module RAG‑plus‑LLM architecture, knowledge‑base construction, custom chunking strategies, dynamic signal handling, and the challenges overcome to achieve reliable real‑time voice interactions.

Agent ArchitectureLLMRAG
0 likes · 22 min read
Building Xiaomi’s Vertical Domain QA Agent: From RAG to Real‑World Deployment
DaTaobao Tech
DaTaobao Tech
Aug 25, 2025 · Artificial Intelligence

Mastering RAG: From Quick Start to Deep Optimization Strategies

This article dives into the practical implementation of Retrieval‑Augmented Generation (RAG), covering document chunking, semantic and reverse HyDE indexing, embedding, hybrid search, and re‑ranking techniques, and provides concrete code examples and optimization tips for building high‑performance AI applications.

EmbeddingHybrid SearchRAG
0 likes · 18 min read
Mastering RAG: From Quick Start to Deep Optimization Strategies
Fun with Large Models
Fun with Large Models
Aug 22, 2025 · Artificial Intelligence

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

This tutorial shows how to create a lightweight Retrieval‑Augmented Generation (RAG) system that indexes multiple PDF files, stores their embeddings in a FAISS vector database, and answers user queries through a LangChain agent powered by DashScope embeddings and the DeepSeek‑Chat model, all wrapped in a Streamlit UI.

DashscopeDeepSeekFAISS
0 likes · 13 min read
Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 21, 2025 · Artificial Intelligence

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Since last year, the debate over “Prompt Engineering” has split between practitioners who favor “Context Engineering” for building scalable agent systems and scholars who treat Prompt Engineering as a broad umbrella term, highlighting the need to dynamically construct and manage context for reliable, extensible AI applications.

AI agentsLLMPrompt engineering
0 likes · 33 min read
Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 21, 2025 · Artificial Intelligence

Why Your AI Defect Deduplication Returns Mixed Data and How to Fix It

This article details the challenges of building an AI‑powered defect deduplication system using Retrieval‑Augmented Generation, explains why LLMs produce composite (spliced) results, diagnoses the root cause as information loss in the RAG pipeline, and presents a step‑by‑step solution that restores atomicity of records for reliable duplicate detection.

AI debuggingKnowledge BaseLLM
0 likes · 14 min read
Why Your AI Defect Deduplication Returns Mixed Data and How to Fix It
JD Retail Technology
JD Retail Technology
Aug 20, 2025 · Artificial Intelligence

Launch Multi-Agent AI Systems with OxyGent in Just 20 Lines of Code

Learn how to quickly set up OxyGent, a flexible AI agent framework, by installing Python, Node.js, and the MCP tools, configuring environment variables, and using just 20 lines of code to build, debug, and deploy multi‑agent applications with features like RAG, tool integration, and distributed execution.

DeploymentMCPMulti-Agent
0 likes · 5 min read
Launch Multi-Agent AI Systems with OxyGent in Just 20 Lines of Code
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 20, 2025 · Artificial Intelligence

How DeepSearch Elevates RAG: From RAG 1.0 to a Multi‑Agent AI Search Engine

This article explains how Alibaba Cloud OpenSearch LLM version evolved from RAG 1.0 to RAG 2.0, introducing the DeepSearch multi‑agent architecture that combines offline data processing, online query handling, planning, clarification, search, and summarization agents to deliver more accurate and complex AI‑driven answers.

AI searchDeepSearchLLM
0 likes · 10 min read
How DeepSearch Elevates RAG: From RAG 1.0 to a Multi‑Agent AI Search Engine
Instant Consumer Technology Team
Instant Consumer Technology Team
Aug 20, 2025 · Backend Development

How I Built a Production‑Ready RAG Service in 3 Weeks Using AI Coding Tools

In just three weeks, I single‑handedly created a production‑grade Retrieval‑Augmented Generation (RAG) API with FastAPI, leveraging Cursor and Claude Code to automate coding, testing, and deployment, and I share practical insights on AI‑assisted development, high cohesion‑low coupling design, TDD, git worktree parallelism, and agent orchestration.

AI CodingFastAPIRAG
0 likes · 19 min read
How I Built a Production‑Ready RAG Service in 3 Weeks Using AI Coding Tools
Instant Consumer Technology Team
Instant Consumer Technology Team
Aug 19, 2025 · Artificial Intelligence

Mastering Document Chunking for RAG: Strategies, Code & Best Practices

This article explores why proper document chunking is crucial for Retrieval‑Augmented Generation, explains core concepts like context windows and signal‑to‑noise, compares various chunking strategies—from simple fixed‑size splits to semantic and hybrid approaches—and provides practical Python code examples to help you build more effective RAG pipelines.

LLMRAGText Splitting
0 likes · 24 min read
Mastering Document Chunking for RAG: Strategies, Code & Best Practices
Data Thinking Notes
Data Thinking Notes
Aug 17, 2025 · Artificial Intelligence

Unlocking AI Agents: From Basics to Real-World Development

This article provides a comprehensive overview of AI Agents, covering their fundamental concepts, core features, technical evolution, work cycle, architectural modules, key technologies such as prompt engineering and RAG, practical development steps, a data‑analysis agent case study, and typical industry applications.

AI AgentAgent ArchitecturePrompt engineering
0 likes · 13 min read
Unlocking AI Agents: From Basics to Real-World Development
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 15, 2025 · Artificial Intelligence

Mastering AI Agents: Prompt Engineering, Workflows, and RAG Strategies

This article systematically explains how to build reliable, high‑performance AI agents by focusing on the core components—LLM, prompts, workflows, RAG, and tools—while covering prompt engineering techniques, DSL‑based workflow design, vector‑database knowledge bases, security against prompt injection, and practical project planning.

AI AgentLLMRAG
0 likes · 15 min read
Mastering AI Agents: Prompt Engineering, Workflows, and RAG Strategies
Tencent Technical Engineering
Tencent Technical Engineering
Aug 14, 2025 · Artificial Intelligence

Why Do Large Language Models Hallucinate? Causes, Risks, and Multi‑Dimensional Solutions

This article systematically examines the root causes of hallucinations in large language models, evaluates their pros and cons, and presents a comprehensive set of optimization techniques—including prompt engineering, RAG, sampling tweaks, supervised fine‑tuning, LoRA, RLHF, chain‑of‑thought reasoning, and agent/workflow designs—to build more reliable and trustworthy AI applications.

AILLMLoRA
0 likes · 29 min read
Why Do Large Language Models Hallucinate? Causes, Risks, and Multi‑Dimensional Solutions
DaTaobao Tech
DaTaobao Tech
Aug 13, 2025 · Artificial Intelligence

Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques

This article explores the emerging field of prompt engineering, detailing its fundamentals, advanced strategies such as chain‑of‑thought, ReAct, and structured frameworks, and demonstrates practical applications in AI agents for data retrieval, SQL generation, and market insight, offering actionable guidance for developers and business users alike.

AI agentsData RetrievalRAG
0 likes · 42 min read
Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 11, 2025 · Artificial Intelligence

How Fine‑Tuning Large Models Solves Code Upgrade Challenges and Boosts Stable Module Matching

This article details an innovative approach that uses large‑model supervised fine‑tuning to overcome the instability of code RAG and code agents during open‑source repository upgrades, addressing domain‑specific terminology, code style differences, and improving recall, accuracy, and deployment efficiency.

AI agentsFine-tuningLLM
0 likes · 11 min read
How Fine‑Tuning Large Models Solves Code Upgrade Challenges and Boosts Stable Module Matching
AI Large Model Application Practice
AI Large Model Application Practice
Aug 11, 2025 · Artificial Intelligence

How to Build an LLM-Powered Smart Resume Screening System

This article presents a detailed design and implementation of an LLM‑based intelligent resume matching system that combines semantic vector retrieval, structured rule filtering, multi‑dimensional weighted scoring, and natural‑language interaction to create a fast, quantifiable, and explainable hiring pipeline.

AI RecruitmentLLMRAG
0 likes · 18 min read
How to Build an LLM-Powered Smart Resume Screening System
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 8, 2025 · Artificial Intelligence

Can GitOps Power Low‑Cost LLM Agents? A Hands‑On Exploration

This article examines how the Manus sandbox and CodeAct mechanisms inspire a GitOps‑based approach to building LLM agents, detailing the design of planner and executor components, workflow steps, advantages such as RAG and observability, and the potential for low‑cost, scalable intelligent agent development.

AI agentsGitOpsIntelligent agents
0 likes · 12 min read
Can GitOps Power Low‑Cost LLM Agents? A Hands‑On Exploration
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 8, 2025 · Artificial Intelligence

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

This article summarizes a Cloud Summit talk where Alibaba Cloud’s AI expert Zhang Yingying explains how large language models can enhance big‑data intelligent operations, covering opportunities, challenges, RAG‑based Q&A, multi‑agent diagnostics, and the engineering architecture needed for reliable, scalable deployment.

AI EngineeringBig Data OperationsRAG
0 likes · 20 min read
Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 8, 2025 · Artificial Intelligence

What Von Neumann’s Brain Theory Reveals About Prompt Engineering for LLMs

The article explores how Von Neumann’s insights on the brain‑computer analogy illuminate modern large‑language‑model prompt engineering, comparing logical reasoning chains, memory mechanisms, and DSL‑driven computation to improve accuracy, reduce hallucinations, and balance reasoning depth with precise calculation.

DSLPrompt engineeringRAG
0 likes · 14 min read
What Von Neumann’s Brain Theory Reveals About Prompt Engineering for LLMs
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Aug 8, 2025 · Industry Insights

How CodeRAG Reinvents Large‑Scale Code Repository Knowledge Extraction and Hierarchical Retrieval

CodeRAG leverages AST‑centric parsing and a hierarchical knowledge graph to overcome text‑only retrieval limits in large code repositories, offering multi‑language analysis, incremental parsing, hybrid indexing, and intelligent context selection for tasks such as code completion, Q&A, documentation generation, and impact analysis.

ASTCodeRAGLarge-Scale Repos
0 likes · 15 min read
How CodeRAG Reinvents Large‑Scale Code Repository Knowledge Extraction and Hierarchical Retrieval
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 5, 2025 · Databases

How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine

This article explains how PolarDB IMCI integrates vector indexing and embedding directly into the database kernel, offering a unified, transactional, and real‑time vector lifecycle management service that lets developers build RAG knowledge bases and AI applications using only standard SQL, dramatically reducing development and operational complexity.

AIPolardbRAG
0 likes · 11 min read
How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 5, 2025 · Artificial Intelligence

Mastering Intent Detection & Slot Filling: Proven Strategies and Code Samples

This article shares reusable AI development techniques for intent detection and slot filling, comparing four solution tiers—from simple prompt engineering to advanced RAG‑enhanced architectures—complete with practical code snippets, performance trade‑offs, and guidance on selecting the optimal approach for reliable conversational agents.

Intent DetectionNLUPrompt engineering
0 likes · 27 min read
Mastering Intent Detection & Slot Filling: Proven Strategies and Code Samples
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Aug 4, 2025 · Artificial Intelligence

How RAG and Long‑Term Memory Turn AI into a Truly Remembering Assistant

This article explains how Retrieval‑Augmented Generation (RAG) and long‑term memory systems like MenoBase enable large language models to overcome short‑term memory limits, dynamically retrieve up‑to‑date knowledge, and personalize interactions, with practical Dify implementation steps and real‑world use cases across industries.

AIDifyKnowledge Base
0 likes · 18 min read
How RAG and Long‑Term Memory Turn AI into a Truly Remembering Assistant
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Jul 30, 2025 · Artificial Intelligence

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

This article analyzes the prompt‑inflation bottleneck that arises when large language models (LLMs) must handle thousands of Model Context Protocol (MCP) services, and introduces the MCP‑RAG architecture—a retrieval‑augmented generation solution that builds a metadata knowledge base and intelligent retrieval layer to enable precise, efficient MCP service discovery at scale.

AILLMMCP
0 likes · 21 min read
How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls
Ops Development Stories
Ops Development Stories
Jul 29, 2025 · Artificial Intelligence

Master AI Agents with LangGraph: Build Adaptive RAG, Translation, and ReAct Agents

This comprehensive guide explains what an AI Agent is, its core capabilities and design patterns, and walks through step‑by‑step implementations of RAG, Translation, and ReAct agents using LangGraph, complete with code samples, workflow diagrams, and practical tips for building personal ops knowledge‑base agents.

LLMLangGraphRAG
0 likes · 64 min read
Master AI Agents with LangGraph: Build Adaptive RAG, Translation, and ReAct Agents
SF Technology Team
SF Technology Team
Jul 29, 2025 · Artificial Intelligence

How SF Tech’s Proprietary Large Models Revolutionize Logistics and AI Operations

The DA Data Intelligence Conference in Shenzhen showcased SF Tech’s breakthroughs in large‑model AI, revealing how their proprietary multimodal models, RAG innovations, and agent platforms dramatically improve logistics decision‑making, resource scheduling, and customer service across multiple industries.

AI OperationsAgent PlatformRAG
0 likes · 11 min read
How SF Tech’s Proprietary Large Models Revolutionize Logistics and AI Operations
Architecture and Beyond
Architecture and Beyond
Jul 27, 2025 · Artificial Intelligence

What Makes an AI Agent Tick? From Expert Systems to Modern Architectures

This article traces the evolution of AI agents from early expert systems to today’s multimodal, memory‑rich agents, explains their perception, reasoning, memory and action modules, discusses model selection, prompt engineering, RAG techniques, and highlights current limitations such as hallucinations, reliability, cost, and security.

AI AgentFunction CallingMemory Architecture
0 likes · 28 min read
What Makes an AI Agent Tick? From Expert Systems to Modern Architectures
JD Tech Talk
JD Tech Talk
Jul 23, 2025 · Artificial Intelligence

Causal Inference + LLMs: Transforming E‑Commerce Pricing Strategies

This article describes how integrating causal inference with large language models and Retrieval‑Augmented Generation can automate and explain e‑commerce product pricing, detailing the three‑step workflow, reinforcement‑learning rewards, experimental results, and future directions for end‑to‑end RAG‑LLM training.

RAGcausal inferencee‑commerce pricing
0 likes · 15 min read
Causal Inference + LLMs: Transforming E‑Commerce Pricing Strategies
Zhuanzhuan Tech
Zhuanzhuan Tech
Jul 23, 2025 · Artificial Intelligence

Why AI‑Generated Code Often Misses the Mark and How a Code Knowledge Base Fixes It

AI‑generated code frequently fails to match project conventions due to lack of contextual memory, but building a dynamic code knowledge base combined with Retrieval‑Augmented Generation (RAG) enables precise, compliant code output, reduces errors, accelerates development, and transforms AI into a project‑specific assistant.

AICode GenerationKnowledge Base
0 likes · 13 min read
Why AI‑Generated Code Often Misses the Mark and How a Code Knowledge Base Fixes It
Tencent Cloud Developer
Tencent Cloud Developer
Jul 23, 2025 · Artificial Intelligence

Why Retrieval‑Augmented Generation Is Evolving Into Agentic AI Search

This article explains how the inherent knowledge limits of large language models drive the rise of Retrieval‑Augmented Generation (RAG), outlines its three evolutionary stages, introduces Agentic RAG and DeepSearch, and discusses the knowledge and ability boundaries that shape future AI search systems.

AI searchAgentic AIDeepSearch
0 likes · 19 min read
Why Retrieval‑Augmented Generation Is Evolving Into Agentic AI Search
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jul 18, 2025 · Artificial Intelligence

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

The video walks through the differences between traditional keyword search and vector search, explains the core concept of Retrieval‑Augmented Generation, and demonstrates how to construct a knowledge‑base Q&A system using a large language model integrated with Elasticsearch.

ElasticsearchKnowledge BaseQ&A system
0 likes · 1 min read
Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)
DataFunSummit
DataFunSummit
Jul 16, 2025 · Artificial Intelligence

How Tencent Cloud ES Powers RAG with Hybrid Search and Massive Vector Optimizations

This article explores how Tencent Cloud Elasticsearch combines decades of text search expertise with cutting‑edge vector retrieval and large language models to deliver a one‑stop Retrieval‑Augmented Generation solution, detailing the underlying models, hybrid search architecture, performance tricks, and real‑world case studies.

ElasticsearchHybrid SearchLLM
0 likes · 24 min read
How Tencent Cloud ES Powers RAG with Hybrid Search and Massive Vector Optimizations
DataFunSummit
DataFunSummit
Jul 15, 2025 · Artificial Intelligence

Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications

This article explains why traditional keyword search falls short, introduces Elasticsearch's vector search and hybrid retrieval capabilities, and shows how combining it with large language models enables Retrieval‑Augmented Generation (RAG) for more accurate, context‑aware AI-driven search across text and multimedia data.

AIElasticsearchRAG
0 likes · 5 min read
Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications
Tencent Cloud Developer
Tencent Cloud Developer
Jul 15, 2025 · Artificial Intelligence

How RAG Evolved: From Naive to Agentic – A Complete Guide

This article systematically outlines the evolution of Retrieval‑Augmented Generation (RAG) from its naive three‑step pipeline to advanced, modular, and agentic architectures, highlighting each generation's motivations, core features, advantages, drawbacks, and practical implementation details for large language model applications.

Agentic RAGLLMModular RAG
0 likes · 20 min read
How RAG Evolved: From Naive to Agentic – A Complete Guide