Tagged articles
891 articles
Page 3 of 9
DeepHub IMBA
DeepHub IMBA
Mar 17, 2026 · Artificial Intelligence

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

The article examines how retrieval‑augmented generation suffers from poor query formulation and presents two advanced strategies—query translation, which generates multiple semantically similar variants, and query decomposition, which breaks complex questions into finer sub‑queries—detailing methods such as fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and explains when to combine them.

Hybrid RetrievalLLMQuery Decomposition
0 likes · 9 min read
Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 17, 2026 · Artificial Intelligence

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Chunk splitting, a critical yet often overlooked step in RAG pipelines, dramatically impacts retrieval recall and LLM output quality; this guide walks through three evolution stages—from naive fixed‑length splits to sentence‑aware overlaps and finally semantic, structure‑driven segmentation—complete with code, experiments, and practical pitfalls.

LLMRAGchunking
0 likes · 15 min read
Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation
DeepHub IMBA
DeepHub IMBA
Mar 15, 2026 · Artificial Intelligence

BookRAG: A Tree‑Graph Fusion RAG Framework for Hierarchical Documents

BookRAG introduces a tree‑graph fused Retrieval‑Augmented Generation framework that builds a native document index combining hierarchical layout trees with fine‑grained knowledge graphs, and employs an Information‑Foraging‑Theory‑inspired agent to dynamically navigate queries across complex, multi‑section documents.

Knowledge GraphRAGagent-based retrieval
0 likes · 11 min read
BookRAG: A Tree‑Graph Fusion RAG Framework for Hierarchical Documents
SpringMeng
SpringMeng
Mar 14, 2026 · Artificial Intelligence

How Do Skills, MCP, RAG, and Agents Relate in OpenClaw?

The article explains OpenClaw’s four‑layer architecture—Agent, Memory, RAG, and Skills—detailing how each component (including Function Call and MCP) works, how they differ from platforms like Dify, and provides practical security guidelines for running the open‑source AI agent framework.

AI AgentMCPMemory
0 likes · 15 min read
How Do Skills, MCP, RAG, and Agents Relate in OpenClaw?
DeepHub IMBA
DeepHub IMBA
Mar 13, 2026 · Artificial Intelligence

Why Bigger Context Windows Make RAG Essential, Not Redundant

Although expanding LLM context windows seems to eliminate the need for Retrieval‑Augmented Generation, in practice larger windows dilute attention and cause retrieval failures, so RAG remains crucial for filtering high‑signal content and maintaining answer quality.

AI ArchitectureAttention DilutionLLM
0 likes · 7 min read
Why Bigger Context Windows Make RAG Essential, Not Redundant
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 13, 2026 · Artificial Intelligence

Why Every RAG System Needs Smart Query Understanding and Routing

The article explains how diverse user queries in a RAG‑based insurance system require intent classification, entity extraction, and multi‑path routing to choose between vector search, calculation, database lookup, or chit‑chat, and outlines practical rule‑ML‑LLM hybrid solutions with safety safeguards.

LLMQuery UnderstandingRAG
0 likes · 11 min read
Why Every RAG System Needs Smart Query Understanding and Routing
Su San Talks Tech
Su San Talks Tech
Mar 12, 2026 · Artificial Intelligence

Demystifying OpenClaw: How Agents, RAG, Memory, and Skills Power AI Automation

OpenClaw is an open‑source AI agent platform that integrates core concepts such as Agents, Retrieval‑Augmented Generation, Memory, Function Calling, and the proprietary Skills protocol, offering a four‑layer architecture, configurable workspaces, SQLite‑backed persistence, and practical deployment guidance while highlighting security best practices.

AI AgentFunction CallingMemory
0 likes · 14 min read
Demystifying OpenClaw: How Agents, RAG, Memory, and Skills Power AI Automation
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 12, 2026 · Artificial Intelligence

How to Build Cross-Session Memory for RAG Chatbots: Short‑Term vs Long‑Term Strategies

This article explains the role of memory modules in Retrieval‑Augmented Generation systems, compares short‑term and long‑term memory techniques, outlines storage and retrieval methods, discusses management strategies like forgetting and deduplication, and compares LangChain and LlamaIndex implementations for practical deployment.

LLMLangChainLlamaIndex
0 likes · 11 min read
How to Build Cross-Session Memory for RAG Chatbots: Short‑Term vs Long‑Term Strategies
AI Engineering
AI Engineering
Mar 11, 2026 · Artificial Intelligence

Google Gemini Embedding 2: One Model for All Media Types

Google’s newly released Gemini Embedding 2 is the first truly native multimodal embedding model that processes text, images, video, audio, and PDFs within a single vector space, cutting latency by 70% and boosting recall by 20% compared to chained‑model pipelines.

Gemini Embedding 2Google AIRAG
0 likes · 4 min read
Google Gemini Embedding 2: One Model for All Media Types
AI Waka
AI Waka
Mar 11, 2026 · Artificial Intelligence

Why Context Engineering Is the Secret to Smarter AI Agents

The article explains how context engineering—designing the entire information environment for large language models—overcomes prompt engineering limits, mitigates context decay, and improves speed, accuracy, and cost by strategically selecting, compressing, ordering, isolating, and formatting context for production‑grade AI agents.

AI agentsAWS BedrockContext Engineering
0 likes · 24 min read
Why Context Engineering Is the Secret to Smarter AI Agents
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 11, 2026 · Artificial Intelligence

Taming Hallucinations and Multi‑Turn Failures in RAG Systems

This article breaks down the final‑mile challenges of Retrieval‑Augmented Generation—hallucinations, broken multi‑turn dialogue, prompt design, citation, and feedback loops—and provides concrete, layered solutions ranging from hard‑coded prompts and few‑shot examples to query rewriting, history management, post‑processing filters, and self‑check mechanisms.

Prompt engineeringRAGcitation
0 likes · 15 min read
Taming Hallucinations and Multi‑Turn Failures in RAG Systems
AI Explorer
AI Explorer
Mar 11, 2026 · Artificial Intelligence

Gemini Embedding 2: Google’s First Native Multimodal Embedding Model

Google’s Gemini Embedding 2 introduces a native multimodal embedding model that maps text, images, video, audio, and documents into a single vector space, offers three configurable dimensions, achieves state‑of‑the‑art benchmarks across modalities, and enables cross‑modal search, RAG, and seamless integration with major vector databases.

AI modelsGemini EmbeddingMatryoshka representation
0 likes · 8 min read
Gemini Embedding 2: Google’s First Native Multimodal Embedding Model
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 10, 2026 · Artificial Intelligence

RRF vs Weighted Sum in RAG: Boost Retrieval, Solve Timeliness & Interview Challenges

This article explains why Reciprocal Rank Fusion often outperforms weighted‑sum fusion in Retrieval‑Augmented Generation, presents a three‑layer approach to keep knowledge bases timely, discusses HyDE’s cost‑benefit trade‑offs, and offers concrete interview‑ready answers for common RAG follow‑up questions.

HyDEHybrid RetrievalInterview Tips
0 likes · 13 min read
RRF vs Weighted Sum in RAG: Boost Retrieval, Solve Timeliness & Interview Challenges
Shi's AI Notebook
Shi's AI Notebook
Mar 9, 2026 · Artificial Intelligence

Unpacking the Hype: A Clear Map of LLM, RAG, Agent and Agent Platforms

The article explains why the buzz around AI agents can mislead learners, breaks down overlapping concepts such as LLM, RAG, Tool Use, Agent, Code Agent, and Agent Platform into distinct layers, and outlines a step‑by‑step learning plan to build a solid conceptual map.

AI conceptsAgentAgent Platform
0 likes · 9 min read
Unpacking the Hype: A Clear Map of LLM, RAG, Agent and Agent Platforms
Data STUDIO
Data STUDIO
Mar 9, 2026 · Artificial Intelligence

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

This article dissects why naive Retrieval‑Augmented Generation (RAG) often yields only 60% accuracy, then presents eleven concrete ingestion, query, and hybrid techniques—complete with code samples, performance trade‑offs, and real‑world case studies—that together can raise RAG accuracy to 94% while outlining practical implementation roadmaps and common pitfalls.

EmbeddingKnowledge GraphLLM
0 likes · 31 min read
Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies
AI Explorer
AI Explorer
Mar 8, 2026 · Artificial Intelligence

Qwen-Agent: An Open-Source Agent Framework Empowering Complex AI Applications

Qwen-Agent, an open‑source agent development framework built on Qwen large models (≥3.0), integrates function calling, code interpreter, RAG, and MCP support, offering ready‑to‑run demos, GUI tools, and extensive documentation to help developers quickly build and customize sophisticated AI agents.

AI agentsCode InterpreterFunction Calling
0 likes · 7 min read
Qwen-Agent: An Open-Source Agent Framework Empowering Complex AI Applications
Data Party THU
Data Party THU
Mar 8, 2026 · Artificial Intelligence

6 Practical Context‑Engineering Techniques to Tame RAG Hallucinations

This article explains why retrieval‑augmented generation (RAG) models often hallucinate, introduces the concept of context engineering, and details six practical techniques—including selective retrieval, context compression, hierarchical layout, dynamic query rewriting, memory management, and tool‑aware context—along with their trade‑offs and real‑world impact.

AIContext EngineeringLLM
0 likes · 23 min read
6 Practical Context‑Engineering Techniques to Tame RAG Hallucinations
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 7, 2026 · Artificial Intelligence

Mastering Offline Document Parsing for RAG: From PDFs to Multimodal Knowledge Bases

This article provides a comprehensive guide to offline document parsing for Retrieval‑Augmented Generation, covering multi‑format extraction, layout analysis, OCR pitfalls, chunking strategies, hierarchical metadata tagging, and how these steps directly affect retrieval accuracy and overall RAG performance.

Document ParsingRAGmetadata
0 likes · 14 min read
Mastering Offline Document Parsing for RAG: From PDFs to Multimodal Knowledge Bases
SpringMeng
SpringMeng
Mar 7, 2026 · Artificial Intelligence

LangChain4j vs Spring AI: Which Java AI Framework Is Right for Your Project?

The article compares LangChain4j and Spring AI across design philosophy, core features, ecosystem integration, community maturity, and learning curve, providing concrete code examples, a feature‑richness matrix, and practical selection guidelines to help Java developers choose the most suitable AI framework for their needs.

AI frameworksAgentComparison
0 likes · 15 min read
LangChain4j vs Spring AI: Which Java AI Framework Is Right for Your Project?
Java Web Project
Java Web Project
Mar 7, 2026 · Artificial Intelligence

Why AgentScope Java Beats Python for Multi‑Agent AI Development

AgentScope Java, Alibaba's open‑source multi‑agent framework, lets Java developers build autonomous assistants and collaborative agents with built‑in ReAct reasoning, RAG, memory, and enterprise‑grade integrations, offering a compelling alternative to Python‑centric AI stacks and Spring AI Alibaba.

AIAgentScopeFramework
0 likes · 10 min read
Why AgentScope Java Beats Python for Multi‑Agent AI Development
Architecture Digest
Architecture Digest
Mar 6, 2026 · Artificial Intelligence

AgentScope Java: Unlock Multi‑Agent AI Development Without Leaving Java

This article introduces AgentScope Java, a multi‑agent development framework that lets Java developers build intelligent assistants and collaborative agents with built‑in reasoning, tool use, memory, RAG, and Spring Boot integration, providing production‑grade performance and easy setup.

AI FrameworkAgentScopeMulti-Agent
0 likes · 9 min read
AgentScope Java: Unlock Multi‑Agent AI Development Without Leaving Java
Woodpecker Software Testing
Woodpecker Software Testing
Mar 6, 2026 · Artificial Intelligence

How RAG Testing Teams Can Successfully Transform in 2024

With RAG becoming the backbone of enterprise AI, traditional API‑UI testing misses critical semantic errors, leading to high hallucination rates; this article outlines why conventional methods fail and presents a three‑pillar transformation—skill rebuilding, process reengineering, and advanced tooling—backed by real‑world case studies.

AI testingLLMMLOps
0 likes · 9 min read
How RAG Testing Teams Can Successfully Transform in 2024
Tencent Cloud Developer
Tencent Cloud Developer
Mar 5, 2026 · Artificial Intelligence

20 Cutting‑Edge RAG Optimization Techniques: From Semantic Chunking to Self‑RAG

This article systematically presents twenty practical RAG (Retrieval‑Augmented Generation) optimization methods—covering semantic chunking, chunk‑size evaluation, context‑enhanced retrieval, query transformation, re‑ranking, feedback loops, multimodal and graph RAG, hierarchical retrieval, HyDE, Self‑RAG and reinforcement‑learning‑enhanced RAG—each with clear Python code examples, advantages, limitations and ideal use‑cases.

AILLMRAG
0 likes · 57 min read
20 Cutting‑Edge RAG Optimization Techniques: From Semantic Chunking to Self‑RAG
Woodpecker Software Testing
Woodpecker Software Testing
Mar 3, 2026 · Artificial Intelligence

2026 In‑Depth Comparison of RAG Testing Tools: Finding the Most Trustworthy Solution

RAG systems have reached a trustworthiness tipping point, and in 2026 a surge of testing challenges demands new evaluation metrics; this article benchmarks twelve leading retrieval‑augmented generation testing tools across retrieval quality, generation controllability, observability, security compliance, and CI/CD integration, revealing which solutions best address real‑world finance and government use cases.

AI testingObservabilityRAG
0 likes · 8 min read
2026 In‑Depth Comparison of RAG Testing Tools: Finding the Most Trustworthy Solution
DataFunTalk
DataFunTalk
Mar 1, 2026 · Artificial Intelligence

How to Build a Production‑Ready RAG System for Enterprise Knowledge Workflows

This article explains the challenges of applying large language models in real‑world office scenarios and presents a detailed, step‑by‑step RAG (Retrieval‑Augmented Generation) solution—including architecture, offline document processing, query rewriting, hybrid retrieval, multi‑stage ranking, knowledge filtering, and prompt‑driven generation—backed by practical lessons from a Chinese mobile operator.

Enterprise AIHybrid RetrievalPrompt engineering
0 likes · 22 min read
How to Build a Production‑Ready RAG System for Enterprise Knowledge Workflows
Woodpecker Software Testing
Woodpecker Software Testing
Mar 1, 2026 · Artificial Intelligence

Optimizing RAG System Performance: A Practical Testing Guide

The article presents a systematic framework for testing and optimizing Retrieval‑Augmented Generation (RAG) systems, detailing performance‑sensitive bottlenecks, a three‑dimensional test matrix, real‑world case studies, and test‑driven engineering practices to ensure stable, fast, and accurate AI services.

AIBenchmarkingObservability
0 likes · 9 min read
Optimizing RAG System Performance: A Practical Testing Guide
AI Explorer
AI Explorer
Feb 28, 2026 · Artificial Intelligence

Explore the Awesome LLM Apps Repository: Hands‑On RAG and AI Agent Examples

The article presents the “Awesome LLM Apps” GitHub repository—over 98 000 stars and hundreds of open‑source LLM projects that showcase Retrieval‑Augmented Generation, AI agents, and multi‑agent collaborations across diverse use‑cases, and offers step‑by‑step guidance on browsing, cloning, configuring, and running these examples for developers, product managers, students, and AI enthusiasts.

AI agentsGitHubLLM
0 likes · 6 min read
Explore the Awesome LLM Apps Repository: Hands‑On RAG and AI Agent Examples
PaperAgent
PaperAgent
Feb 27, 2026 · Artificial Intelligence

How HyperRAG Uses N‑ary Hypergraphs to Overcome Binary KG Limitations

HyperRAG introduces an n‑ary hypergraph retrieval framework that replaces binary knowledge‑graph triples with hyperedges, addressing semantic fragmentation and path‑explosion while delivering superior accuracy and efficiency across multiple closed‑ and open‑domain QA benchmarks.

HyperRAGHypergraphKnowledge Graph
0 likes · 6 min read
How HyperRAG Uses N‑ary Hypergraphs to Overcome Binary KG Limitations
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Feb 26, 2026 · Artificial Intelligence

How RAG Gives Large Language Models Their Own Knowledge Base – Illustrated with Easysearch

The article explains why Retrieval‑Augmented Generation (RAG) is needed to overcome large language models' knowledge cut‑off and hallucination issues, details the offline indexing and online retrieval‑generation workflow, compares RAG with fine‑tuning, and shows how Easysearch’s hybrid search makes an effective RAG backbone.

EasysearchFine-tuningHybrid Search
0 likes · 10 min read
How RAG Gives Large Language Models Their Own Knowledge Base – Illustrated with Easysearch
DataFunTalk
DataFunTalk
Feb 26, 2026 · Artificial Intelligence

How RAG Can Overcome Large‑Model Pitfalls in Enterprise Knowledge Work

This article explains the challenges large language models face in real‑world applications, introduces Retrieval‑Augmented Generation (RAG) as a solution, and details a modular RAG architecture, its components, and practical techniques for document parsing, query rewriting, hybrid retrieval, ranking, and answer generation in an enterprise setting.

Document ParsingLLM deploymentRAG
0 likes · 22 min read
How RAG Can Overcome Large‑Model Pitfalls in Enterprise Knowledge Work
DataFunSummit
DataFunSummit
Feb 25, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article summarizes a DataFun‑hosted roundtable where leading AI experts dissect the gap between RAG’s promise and real‑world deployment, exposing low recall, hallucinations, and cost overruns, then present systematic diagnostics, evaluation metrics, hybrid search, and engineering best practices to reliably operationalize RAG in enterprise settings.

Enterprise AIHybrid SearchLLM
0 likes · 18 min read
Why RAG Fails in Production and How to Fix It: Expert Insights
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 25, 2026 · Artificial Intelligence

How Hologres Powers Fast Vector & Full‑Text Search for AI‑Driven Customer Service

The Taobao‑Tmall customer operations team built an integrated vector‑plus‑full‑text retrieval solution on Hologres, achieving millisecond‑level recall for massive unstructured knowledge bases, boosting intelligent客服, rule comparison, and sentiment analysis across multiple business scenarios.

AI RetrievalFull‑Text SearchHologres
0 likes · 12 min read
How Hologres Powers Fast Vector & Full‑Text Search for AI‑Driven Customer Service
DataFunSummit
DataFunSummit
Feb 24, 2026 · Artificial Intelligence

How Large Language Models Are Redefining Search Ranking at Tencent

This article details Tencent Search's exploration of large‑model‑driven ranking, covering the evolution from traditional keyword retrieval to RAG‑based AI search, the multi‑stage AI ranking architecture (L0‑L5), model training pipelines, distillation, synthetic data generation, and future research directions.

LLMRAGranking architecture
0 likes · 21 min read
How Large Language Models Are Redefining Search Ranking at Tencent
AI Product Manager Community
AI Product Manager Community
Feb 24, 2026 · Artificial Intelligence

Mastering AI Agents: 100 Essential Questions Across 5 Stages

This comprehensive guide walks you through five development stages of AI agents—core concepts, advanced planning, memory management, tool integration, and enterprise deployment—answering 100 practical questions that reveal definitions, architectures, best‑practice patterns, safety measures, and performance‑optimisation techniques for production‑grade agents.

AI agentsAgent ArchitectureEnterprise Deployment
0 likes · 34 min read
Mastering AI Agents: 100 Essential Questions Across 5 Stages
AI Waka
AI Waka
Feb 24, 2026 · Artificial Intelligence

Stop Fragmenting Docs: How Tree‑Based PageIndex Improves RAG Accuracy and Efficiency

The article explains why breaking documents into countless semantic fragments harms retrieval‑augmented generation, introduces PageIndex’s tree‑structured, inference‑driven approach as a superior alternative, and provides detailed setup, usage, and integration instructions for both local and production environments.

AIDocument SearchLLM
0 likes · 9 min read
Stop Fragmenting Docs: How Tree‑Based PageIndex Improves RAG Accuracy and Efficiency
AI Engineering
AI Engineering
Feb 23, 2026 · Databases

Is Zvec the ‘SQLite Moment’ for Vector Databases?

Alibaba’s newly open‑sourced Zvec brings an in‑process vector database that claims millisecond searches over billions of vectors, supports dense and sparse embeddings, installs via a single pip command, and runs on anything from laptops to edge devices, though users warn of memory limits and unverified security concerns.

PythonRAGZvec
0 likes · 3 min read
Is Zvec the ‘SQLite Moment’ for Vector Databases?
ShiZhen AI
ShiZhen AI
Feb 23, 2026 · Artificial Intelligence

Is OpenViking’s File‑System‑Based Agent Memory a Real Innovation or Just a RAG Facelift?

OpenViking, an open‑source “Agent context database” from ByteDance’s Volcano Engine, replaces flat RAG retrieval with a hierarchical file‑system model, offering layered summaries, recursive directory search, and traceable sessions, but its core still relies on vector retrieval and some features remain placeholders, making it more suited to enterprise agents than hobby projects.

Agent MemoryContext managementEnterprise AI
0 likes · 11 min read
Is OpenViking’s File‑System‑Based Agent Memory a Real Innovation or Just a RAG Facelift?
AI Waka
AI Waka
Feb 23, 2026 · Artificial Intelligence

Why Strategy Must Be a First-Class Citizen in AI Agent Context Windows

Enterprises must treat policy and decision boundaries as primary components of the context window for large‑scale AI agents, because relying solely on retrieved “relevant” paragraphs leads to unpredictable behavior, higher costs, and operational risk as agent numbers grow into the millions.

AI agentsContext EngineeringEnterprise AI
0 likes · 15 min read
Why Strategy Must Be a First-Class Citizen in AI Agent Context Windows
Data STUDIO
Data STUDIO
Feb 22, 2026 · Artificial Intelligence

Building AI Agents with LangGraph: Implementing RAG and Long‑Term Memory

This tutorial walks through adding Retrieval‑Augmented Generation (RAG) and persistent long‑term memory to a LangGraph AI agent, covering concepts, step‑by‑step code for document loading, vector store creation, prompt engineering, memory management, and best‑practice pitfalls.

AI AgentEmbeddingLangChain
0 likes · 16 min read
Building AI Agents with LangGraph: Implementing RAG and Long‑Term Memory
Open Source Tech Hub
Open Source Tech Hub
Feb 19, 2026 · Artificial Intelligence

Build Retrieval‑Augmented Generation (RAG) Agents in PHP with Neuron AI

This guide explains the fundamentals of Retrieval‑Augmented Generation, how embeddings and vector databases enable contextual AI agents, and provides step‑by‑step instructions for installing Neuron AI, writing a PHP RAG class, loading knowledge, and monitoring the agent in production.

AI agentsNeuron AIPHP
0 likes · 13 min read
Build Retrieval‑Augmented Generation (RAG) Agents in PHP with Neuron AI
AI Tech Publishing
AI Tech Publishing
Feb 19, 2026 · Artificial Intelligence

Add Long-Term Memory to Your Agent with Lightweight RAG (Lesson 5)

This tutorial shows how to equip an AI agent with long‑term memory using Retrieval‑Augmented Generation (RAG), covering the concepts of vector embeddings, FAISS indexing, building and querying a knowledge base, and providing complete Python code examples.

AgentEmbeddingFAISS
0 likes · 13 min read
Add Long-Term Memory to Your Agent with Lightweight RAG (Lesson 5)
Qborfy AI
Qborfy AI
Feb 18, 2026 · Artificial Intelligence

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

This article explains Retrieval‑Augmented Generation (RAG), detailing its offline knowledge‑base construction and online retrieval‑enhanced generation workflow, comparing it with traditional and fine‑tuned models, and providing step‑by‑step LangChain implementations, advanced techniques, and practical use‑case demos.

Hybrid SearchLangChainPrompt engineering
0 likes · 16 min read
How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code
Data Party THU
Data Party THU
Feb 15, 2026 · Artificial Intelligence

Why Retrieval‑Augmented Generation Is Still Fragile: Boosting Generalization and Evidence‑Based Answers

Although modern information access is faster than ever, retrieval‑augmented generation systems remain vulnerable, especially when faced with distribution shifts, making it crucial to improve both retriever generalization across domains and languages and ensure generators produce evidence‑grounded responses or refuse when evidence is lacking.

AI RobustnessRAGevidence grounding
0 likes · 3 min read
Why Retrieval‑Augmented Generation Is Still Fragile: Boosting Generalization and Evidence‑Based Answers
Code Wrench
Code Wrench
Feb 15, 2026 · Backend Development

What OpenClaw’s Rise Reveals About Building Reliable Go Agents

The article examines OpenClaw’s rapid popularity, extracts three practical engineering lessons for Go‑based AI agents, warns against three common pitfalls, and outlines a phased roadmap for easy‑agent, emphasizing local‑first data, lightweight routing, secure plugin ecosystems, and robust observability.

Agent ArchitectureGoRAG
0 likes · 12 min read
What OpenClaw’s Rise Reveals About Building Reliable Go Agents
AI Engineering
AI Engineering
Feb 14, 2026 · Industry Insights

How Cloudflare’s Markdown for Agents Redefines AI Web Scraping

Cloudflare’s new Markdown for Agents feature lets AI systems request web pages as Markdown via content negotiation, cutting token usage by up to 80%, simplifying scraping pipelines, and signaling a broader shift in how AI consumes web content.

AI web scrapingCloudflareContent negotiation
0 likes · 6 min read
How Cloudflare’s Markdown for Agents Redefines AI Web Scraping
Yunqi AI+
Yunqi AI+
Feb 13, 2026 · Artificial Intelligence

AI Engineering: Methodology and Practice for Turning Generative AI into Production Systems

The article outlines a comprehensive AI engineering methodology—including the TPMR framework, an AI‑driven development lifecycle, talent transformation from co‑pilot to AI pilot, and a practical enterprise adoption roadmap—to move generative AI and large models from experimental prototypes to production‑grade systems.

AI EngineeringAI LifecycleLLMOps
0 likes · 5 min read
AI Engineering: Methodology and Practice for Turning Generative AI into Production Systems
PMTalk Product Manager Community
PMTalk Product Manager Community
Feb 13, 2026 · Artificial Intelligence

From Zero to One: Building a Deployable RAG System for Intelligent Customer Service

This article walks product managers through the end‑to‑end design of a Retrieval‑Augmented Generation (RAG) intelligent‑customer‑service system, covering business value, knowledge‑base preparation, hybrid retrieval, prompt‑driven generation, deployment choices, monitoring metrics, and common methodological pitfalls.

AI ArchitectureIntelligent Customer ServiceKnowledge Retrieval
0 likes · 11 min read
From Zero to One: Building a Deployable RAG System for Intelligent Customer Service
DataFunTalk
DataFunTalk
Feb 11, 2026 · Artificial Intelligence

Why Most RAG Deployments Fail and How to Build a Production‑Ready RAG System

This round‑table dissects the gap between RAG’s hype and real‑world production, exposing common pitfalls such as low recall, hallucinations and cost overruns, and then delivers a systematic diagnostic framework, hybrid search strategies, fine‑tuning rules, and practical best‑practice roadmaps for building reliable enterprise RAG solutions.

Agentic RAGFine-tuningHybrid Search
0 likes · 20 min read
Why Most RAG Deployments Fail and How to Build a Production‑Ready RAG System
Java Architecture Diary
Java Architecture Diary
Feb 10, 2026 · Artificial Intelligence

Boost RAG Accuracy with LangChain4j 1.11.0 Hybrid Search on PgVector

This guide explains why pure vector retrieval often fails for version‑specific queries, introduces hybrid search that combines semantic and keyword matching, and provides step‑by‑step code and SQL examples for enabling PgVector hybrid search in LangChain4j 1.11.0.

Full‑Text SearchHybrid SearchLangChain4j
0 likes · 11 min read
Boost RAG Accuracy with LangChain4j 1.11.0 Hybrid Search on PgVector
DaTaobao Tech
DaTaobao Tech
Feb 9, 2026 · Artificial Intelligence

Boosting Trustworthiness in Retrieval‑Augmented Generation: The Trustworthy Generation Design Pattern

This article presents the Trustworthy Generation design pattern for Retrieval‑Augmented Generation (RAG) systems, analyzes four root causes of low trustworthiness—retrieval errors, content reliability, pre‑retrieval reasoning mistakes, and model hallucinations—and proposes layered solutions, citation techniques, CRAG and Self‑RAG architectures, guardrails, and practical trade‑offs.

AI SafetyGenerationLLM
0 likes · 16 min read
Boosting Trustworthiness in Retrieval‑Augmented Generation: The Trustworthy Generation Design Pattern
Tech Musings
Tech Musings
Feb 7, 2026 · Fundamentals

How to Clean and Convert a Chinese Poetry Dataset for RAG Projects

This guide explains how to clean a Chinese poetry corpus—removing special characters, filtering short entries, and converting traditional characters to simplified Chinese—using Python validation functions, batch file processing, and WSL‑based OpenCC conversion, then persisting the results as JSON.

JSONRAGdata cleaning
0 likes · 12 min read
How to Clean and Convert a Chinese Poetry Dataset for RAG Projects
SpringMeng
SpringMeng
Feb 7, 2026 · Databases

Redis’s Multithreaded Query Engine Boosts RAG Performance

Redis introduces a multithreaded query engine that keeps average latency under 10 ms while delivering up to 16× higher throughput for vector‑search workloads, enabling faster retrieval‑augmented generation (RAG) applications and outperforming pure vector databases and managed Redis services in benchmark tests.

BenchmarkMultithreaded QueryRAG
0 likes · 6 min read
Redis’s Multithreaded Query Engine Boosts RAG Performance
AI Tech Publishing
AI Tech Publishing
Feb 6, 2026 · Artificial Intelligence

2026 Large Model Engineering Roadmap: From Foundations to Production

This roadmap outlines a step‑by‑step learning path for building, optimizing, and safely deploying large language model systems, covering fundamentals, vector stores, RAG, advanced techniques, fine‑tuning, inference speed, deployment, observability, agents, and production safeguards.

DeploymentFine-tuningInference
0 likes · 5 min read
2026 Large Model Engineering Roadmap: From Foundations to Production
PaperAgent
PaperAgent
Feb 6, 2026 · Artificial Intelligence

How xMemory Cuts Tokens by 30% While Boosting Agent QA Scores Over 10 Points

The paper introduces xMemory, a hierarchical "split‑aggregate‑retrieve" framework that reduces token usage by up to 30% and improves QA performance by more than 10 points in long‑range agent conversations, outperforming traditional RAG across multiple LLMs.

Agent MemoryHierarchical RetrievalLLM
0 likes · 8 min read
How xMemory Cuts Tokens by 30% While Boosting Agent QA Scores Over 10 Points
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 4, 2026 · Artificial Intelligence

Progressive Disclosure: Making Multi‑Skill LLM Agents Efficient and Scalable

This article examines the core challenge of giving large‑language‑model agents many abilities while keeping context size limited, compares three common loading strategies, introduces a progressive‑disclosure skill mechanism with three loading layers, and details its implementation, benefits, limitations, and suitable use cases in AgentScope‑Java.

AgentContext managementJava
0 likes · 17 min read
Progressive Disclosure: Making Multi‑Skill LLM Agents Efficient and Scalable
Wuming AI
Wuming AI
Feb 3, 2026 · Artificial Intelligence

How Short‑Term vs Long‑Term Memory Works in LLM‑Powered Autonomous Agents

This article demystifies short‑term and long‑term memory in LLM‑driven autonomous agents, explaining their mechanisms, limitations, and practical implementations such as sliding windows, summarization, and vector‑based retrieval, while illustrating each concept with concrete Cherry Studio examples and relevant research references.

Autonomous AgentsCherry StudioLLM
0 likes · 7 min read
How Short‑Term vs Long‑Term Memory Works in LLM‑Powered Autonomous Agents
Architecture and Beyond
Architecture and Beyond
Feb 1, 2026 · Artificial Intelligence

5 High‑ROI Strategies to Supercharge RAG Retrieval Performance

This article outlines five practical engineering strategies—multi‑vector retrieval, manual splitting and labeling, scalar enhancement, context augmentation, and dense‑sparse vector integration—that together address common RAG retrieval bottlenecks and dramatically improve recall stability and answer quality.

BM25EngineeringLLM
0 likes · 17 min read
5 High‑ROI Strategies to Supercharge RAG Retrieval Performance
SpringMeng
SpringMeng
Jan 30, 2026 · Artificial Intelligence

Hands‑On Guide: Build AI Agent Chatbots on Windows with RagFlow

Programmer Xiao Meng walks through a complete Windows setup for AI‑powered customer service agents using RagFlow, covering prerequisites, Docker and Ollama installation, model download, container deployment, configuration of knowledge bases, and testing, based on five real‑world projects.

AI chatbotDockerOllama
0 likes · 7 min read
Hands‑On Guide: Build AI Agent Chatbots on Windows with RagFlow
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 29, 2026 · Backend Development

How to Build a BFF Agent with LangGraph: A Step‑by‑Step Guide

This article walks through integrating an AI‑powered Agent into an internal BFF platform using LangGraph, detailing the architectural choices, state‑graph implementation, prompt engineering, knowledge‑base construction, tool integration, conversation handling, and context compression techniques to enable reliable script generation, execution, and validation.

AIAgentLangGraph
0 likes · 24 min read
How to Build a BFF Agent with LangGraph: A Step‑by‑Step Guide
Old Zhang's AI Learning
Old Zhang's AI Learning
Jan 28, 2026 · Artificial Intelligence

RAG-Anything: A Universal RAG Framework for PDFs, Office Docs, and Images

RAG-Anything is an open-source, end-to-end multimodal RAG framework that ingests PDFs, Office files, images, and scientific papers, parses them with high fidelity using MinerU, builds a multimodal knowledge graph, and enables hybrid retrieval, while noting resource and dependency considerations.

AIDocument ProcessingKnowledge Base
0 likes · 7 min read
RAG-Anything: A Universal RAG Framework for PDFs, Office Docs, and Images
PaperAgent
PaperAgent
Jan 27, 2026 · Artificial Intelligence

How Agentic‑R Boosts Multi‑Turn Retrieval for LLMs by 2–3 EM Points

This article analyzes the Agentic‑R framework, which upgrades traditional single‑hop Retrieval‑Augmented Generation by introducing dual‑perspective scoring and a bidirectional flywheel, resulting in 2–3 absolute EM improvements across seven QA datasets and a 10–15% reduction in search rounds.

LLMRAGagentic search
0 likes · 6 min read
How Agentic‑R Boosts Multi‑Turn Retrieval for LLMs by 2–3 EM Points
Data STUDIO
Data STUDIO
Jan 27, 2026 · Artificial Intelligence

How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs

This article explains why large‑language‑model hallucinations are risky, introduces Retrieval‑Augmented Generation (RAG) as a remedy, and walks through nine Python‑based RAG architectures—standard, conversational, corrective, adaptive, fusion, HyDE, self‑RAG, agentic, and graph RAG—detailing their workflows, code examples, strengths, weaknesses, and a decision‑making map for selecting the right design.

AI hallucinationLangChainPython
0 likes · 29 min read
How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs
Efficient Ops
Efficient Ops
Jan 26, 2026 · Artificial Intelligence

Why AI Skills Will Redefine Agents Beyond MCP

This article explains how AI Skills serve as structured knowledge bases that complement, rather than replace, Model Context Protocols, enhance Retrieval‑Augmented Generation, and drive three major trends—standardized agent stacks, low‑code knowledge engineering, and the emergence of personal AI agents.

AI agentsAI ecosystemKnowledge Engineering
0 likes · 8 min read
Why AI Skills Will Redefine Agents Beyond MCP
Old Meng AI Explorer
Old Meng AI Explorer
Jan 24, 2026 · Artificial Intelligence

How UltraRAG Turns Complex RAG Deployment into a One‑Click, No‑Code Process

UltraRAG, an open‑source RAG framework from THUNLP and NEUIR, consolidates data construction, model fine‑tuning, and evaluation into a zero‑code WebUI, offering one‑click multimodal knowledge‑base creation, modular deployment, and multi‑dimensional metrics that boost retrieval accuracy by up to 30% while halving development time.

AIOpen‑SourceRAG
0 likes · 9 min read
How UltraRAG Turns Complex RAG Deployment into a One‑Click, No‑Code Process
AI Waka
AI Waka
Jan 24, 2026 · Artificial Intelligence

Building Production‑Ready AI Agents with NVIDIA’s Nemotron Stack

The article explains how NVIDIA’s Nemotron Stack combines ultra‑fast speech recognition, multimodal retrieval, and advanced safety models into a unified, low‑latency pipeline, offering practical integration code, performance insights, and deployment options for turning experimental AI agents into production‑grade services.

AI agentsContent SafetyDeployment
0 likes · 9 min read
Building Production‑Ready AI Agents with NVIDIA’s Nemotron Stack
AI Waka
AI Waka
Jan 24, 2026 · Artificial Intelligence

2026 Agentic AI Roadmap: How to Build Autonomous AI Agents

This comprehensive 2026 roadmap outlines the essential programming foundations, core agent architectures, LLM and API integrations, tool usage, memory management, RAG systems, deployment strategies, monitoring, and security practices needed to design, develop, and operate autonomous AI agents.

AI roadmapAgentic AIAutonomous Agents
0 likes · 10 min read
2026 Agentic AI Roadmap: How to Build Autonomous AI Agents
Tech Verticals & Horizontals
Tech Verticals & Horizontals
Jan 23, 2026 · Artificial Intelligence

Comparing 9 Major Agent Development Frameworks: Choosing the Best Fit

This article provides an in‑depth comparison of nine mainstream AI agent development frameworks—Pydantic AI, SmolAgents, DeepAgents, LlamaIndex, CAMEL, AutoGen, CrewAI, LangGraph, and OpenAI Agents SDK—detailing their design principles, strengths, weaknesses, typical scenarios, and guidance for selecting or mixing them in production.

Agent FrameworksComparisonLLM
0 likes · 30 min read
Comparing 9 Major Agent Development Frameworks: Choosing the Best Fit
Java Architecture Diary
Java Architecture Diary
Jan 22, 2026 · Artificial Intelligence

Unlock Java Power with Claude Agent SDK: From One‑Shot to Reactive APIs

This article explains how Claude Code, a super‑intelligent AI agent, differs from traditional code‑completion tools, introduces its official SDK limitations, and provides a comprehensive guide to the community‑driven Claude Agent SDK for Java—including one‑shot, blocking, and reactive APIs and a practical RAG‑based Q&A example.

AI AgentClaude CodeJava SDK
0 likes · 10 min read
Unlock Java Power with Claude Agent SDK: From One‑Shot to Reactive APIs
JakartaEE China Community
JakartaEE China Community
Jan 20, 2026 · Backend Development

How to Build AI‑Powered Java Apps with Helidon and LangChain4j

This article explains how Helidon 4.2 integrates the LangChain4j framework to simplify adding large‑language‑model capabilities, covering core features, Maven setup, configuration, component creation, dependency injection, annotations, custom tools, and sample applications such as a coffee‑shop assistant.

AI integrationHelidonJava
0 likes · 14 min read
How to Build AI‑Powered Java Apps with Helidon and LangChain4j
Tencent Cloud Developer
Tencent Cloud Developer
Jan 20, 2026 · Artificial Intelligence

From Transformers to Agents: A Complete Timeline of Large Language Model Evolution

This article traces the evolution of large language models from the 2017 Transformer breakthrough through successive milestones such as BERT, GPT‑3, RL‑HF alignment, multimodal extensions, open‑source alternatives, and the rise of retrieval‑augmented generation, AI agents, and emerging protocols that shape modern AI applications.

Open-source modelsPrompt engineeringRAG
0 likes · 44 min read
From Transformers to Agents: A Complete Timeline of Large Language Model Evolution
macrozheng
macrozheng
Jan 16, 2026 · Artificial Intelligence

Unlock Seamless Document Search with WeKnora: An Open‑Source LLM Retrieval Framework

WeKnora is an open‑source Tencent framework that combines large language models with retrieval‑augmented generation to enable fast, accurate semantic search and question answering across heterogeneous documents such as PDFs, Word files, and images, offering a modular, extensible architecture and easy Docker‑based deployment.

AILLMRAG
0 likes · 7 min read
Unlock Seamless Document Search with WeKnora: An Open‑Source LLM Retrieval Framework
PaperAgent
PaperAgent
Jan 15, 2026 · Artificial Intelligence

How GAG Enables Zero‑Retrieval, Single‑Token Private Knowledge Injection in LLMs

The article presents GAG, a third‑generation framework that injects proprietary domain knowledge into frozen large language models using a single token, eliminating retrieval, avoiding base model updates, and maintaining constant inference budget while delivering strong performance on private QA and public benchmarks.

AI AlignmentGAGLLM
0 likes · 8 min read
How GAG Enables Zero‑Retrieval, Single‑Token Private Knowledge Injection in LLMs
Sohu Tech Products
Sohu Tech Products
Jan 14, 2026 · Artificial Intelligence

Build a Zero‑Cost Open‑Source RAG Smart Document Q&A System from Scratch

This guide walks through building an open‑source Retrieval‑Augmented Generation (RAG) system that indexes local files with Everything, uses hybrid BM25‑vector search via Elasticsearch, and answers questions with a local LLM, covering architecture, core techniques, deployment steps, performance tweaks, and common pitfalls.

ElasticsearchLLMPython
0 likes · 11 min read
Build a Zero‑Cost Open‑Source RAG Smart Document Q&A System from Scratch
Tech Verticals & Horizontals
Tech Verticals & Horizontals
Jan 14, 2026 · Artificial Intelligence

Why Parallelism Matters: Designing Multi‑Agent Architectures for Scalable AI Systems

The article explains why parallelism is crucial for large‑scale AI systems—addressing I/O latency and reliability—by detailing core agent patterns, multi‑agent architectures, reliability strategies, and advanced retrieval‑augmented generation techniques, each illustrated with concrete Jupyter notebooks.

AI GovernanceParallelismRAG
0 likes · 6 min read
Why Parallelism Matters: Designing Multi‑Agent Architectures for Scalable AI Systems
Instant Consumer Technology Team
Instant Consumer Technology Team
Jan 13, 2026 · Artificial Intelligence

Scalable Enterprise AI Assistant: Intent Planning, Context Engineering, Data Iteration

This article details the end‑to‑end design of an enterprise AI office assistant, covering the three‑layer framework of intent planning, context engineering, and data self‑iteration, the key pain points of intent understanding, knowledge integration, and quality control, and practical architectural and implementation solutions for scalable deployment.

AI AssistantAgent CollaborationContext Engineering
0 likes · 25 min read
Scalable Enterprise AI Assistant: Intent Planning, Context Engineering, Data Iteration
Fun with Large Models
Fun with Large Models
Jan 12, 2026 · Artificial Intelligence

Why You Should Master Large‑Model Training: A Full‑Process Practical Guide

The article explains why mastering large‑model training is crucial for professionals, researchers, and enterprises, outlines the end‑to‑end pipeline—from data preparation and pre‑training to instruction fine‑tuning and RLHF alignment—compares training with RAG, and presents a structured learning roadmap.

AI agentsPyTorchRAG
0 likes · 14 min read
Why You Should Master Large‑Model Training: A Full‑Process Practical Guide
AI Algorithm Path
AI Algorithm Path
Jan 11, 2026 · Artificial Intelligence

How Vector Embeddings Enable AI to Understand Anything

This article explains the principle of vector embeddings, shows how they turn words, images, audio and other data into dense numeric vectors, compares them with one‑hot encoding, describes static and contextual models, training methods, similarity metrics, and a wide range of real‑world AI applications.

AI fundamentalsRAGembedding models
0 likes · 15 min read
How Vector Embeddings Enable AI to Understand Anything
PaperAgent
PaperAgent
Jan 9, 2026 · Artificial Intelligence

Why Traditional RAG Breaks the Chain and How SentGraph Fixes It

The article explains why traditional retrieval‑augmented generation fails in multi‑hop scenarios due to overly large chunks, introduces SentGraph’s sentence‑level graph that trims retrieval units and encodes logical relations, details offline construction and online inference steps, and shows experimental gains and remaining limitations.

LLMMulti-hop QARAG
0 likes · 7 min read
Why Traditional RAG Breaks the Chain and How SentGraph Fixes It
Old Meng AI Explorer
Old Meng AI Explorer
Jan 9, 2026 · Artificial Intelligence

How UltraRAG Turns RAG Deployment into a Zero‑Code, Multi‑Modal Powerhouse

UltraRAG, an open‑source RAG framework from THUNLP and NEUIR, eliminates data‑cooking, retrieval tuning, and fine‑tuning hurdles by offering a zero‑code Web UI, one‑click data synthesis, multimodal support, modular design, and comprehensive evaluation, enabling enterprises, developers, and researchers to launch domain‑specific RAG systems up to twice as fast with up to 30% higher accuracy.

DockerOpen-sourceRAG
0 likes · 10 min read
How UltraRAG Turns RAG Deployment into a Zero‑Code, Multi‑Modal Powerhouse
Old Meng AI Explorer
Old Meng AI Explorer
Jan 8, 2026 · Artificial Intelligence

How UltraRAG Turns RAG Development into a Zero‑Code, One‑Click Process

UltraRAG, an open‑source RAG framework from THUNLP and NEUIR, offers a zero‑code WebUI that streamlines data construction, model fine‑tuning, and multi‑dimensional evaluation, boosting retrieval accuracy by up to 30% and cutting deployment time in half for enterprise, AI developers, and researchers.

AIOpen-sourceRAG
0 likes · 10 min read
How UltraRAG Turns RAG Development into a Zero‑Code, One‑Click Process
Sohu Tech Products
Sohu Tech Products
Jan 7, 2026 · Artificial Intelligence

Master Retrieval-Augmented Generation (RAG): Concepts, Benefits, Implementation

This article explains Retrieval‑Augmented Generation (RAG), its dual‑stage architecture that combines parametric LLM knowledge with external non‑parametric data, outlines its technical evolution, discusses why it outperforms pure LLMs, and provides a step‑by‑step guide with toolchain choices, evaluation metrics, and future challenges.

AIKnowledge BaseLLM
0 likes · 14 min read
Master Retrieval-Augmented Generation (RAG): Concepts, Benefits, Implementation
Youzan Coder
Youzan Coder
Jan 6, 2026 · Artificial Intelligence

How to Build Efficient Code Search with Vector Embeddings and AST Indexing

This article explains the motivations, techniques, and practical implementations of code indexing—covering semantic vector‑based RAG pipelines and AST‑based structural analysis—to improve code navigation, AI‑assisted queries, security scanning, and development efficiency.

AI DevelopmentASTRAG
0 likes · 17 min read
How to Build Efficient Code Search with Vector Embeddings and AST Indexing
Advanced AI Application Practice
Advanced AI Application Practice
Jan 6, 2026 · Artificial Intelligence

Enterprise-Grade AI + Knowledge Graph for Automating Complex API Test Scenarios

The article details how an AI‑driven test platform combines large language models with a corporate‑level knowledge graph to automatically generate end‑to‑end API test scripts for complex business flows, overcoming context loss, dependency gaps, and scalability limits of single‑interface generation.

AIAPI testingKnowledge Graph
0 likes · 12 min read
Enterprise-Grade AI + Knowledge Graph for Automating Complex API Test Scenarios
Tech Freedom Circle
Tech Freedom Circle
Jan 5, 2026 · Artificial Intelligence

A Three‑Step Guide to Mastering RAG Semantic‑Loss Interview Questions

RAG (Retrieval‑Augmented Generation) is a hot interview topic, and many candidates stumble on semantic‑loss issues; this article dissects a real JD interview case, identifies three core shortcomings, and presents a three‑step technical solution—structure restoration, semantic splitting, and hybrid retrieval—plus a ready‑to‑use answer template.

AI InterviewDocument ParsingHybrid Search
0 likes · 25 min read
A Three‑Step Guide to Mastering RAG Semantic‑Loss Interview Questions
DataFunTalk
DataFunTalk
Jan 4, 2026 · Artificial Intelligence

How Agentic RAG and Generative Ranking Are Redefining AI Search and Recommendation

This article summarizes three cutting‑edge AI techniques—Alibaba Cloud's Agentic RAG architecture for multimodal search, Huawei Noah's large‑model‑driven recommendation system evolution, and Baidu's generative ranking (GRAB) model for ads—detailing their challenges, designs, performance gains, and practical deployment insights.

AI searchGenerative RankingMulti-Agent Architecture
0 likes · 7 min read
How Agentic RAG and Generative Ranking Are Redefining AI Search and Recommendation