Tagged articles

891 articles

Page 4 of 9

Jan 3, 2026 · Artificial Intelligence

Build Your Own AI Coding Assistant in 5 Minutes: A Hands‑On Guide

The article analyzes common pain points of traditional AI coding chats—repetitive context input, lengthy prompts, and generic answers—and demonstrates how to create a persistent, expert‑level AI coding assistant using Coco AI, with step‑by‑step configuration, example prompts, and future RAG enhancements.

AI AgentCoco AIDeepSeek

0 likes · 9 min read

Build Your Own AI Coding Assistant in 5 Minutes: A Hands‑On Guide

dbaplus Community

Jan 1, 2026 · Artificial Intelligence

Boost LLM Retrieval Accuracy with MCP – A Superior Alternative to RAG

This guide explains why traditional Retrieval‑Augmented Generation (RAG) struggles with precision, introduces the Model Context Protocol (MCP) as a standardized way for large language models to interact with external data sources, and provides step‑by‑step instructions for integrating MCP with MongoDB using Cherry Studio and VSCode +Cline.

FunctionCallMCPMongoDB

0 likes · 25 min read

Boost LLM Retrieval Accuracy with MCP – A Superior Alternative to RAG

Old Meng AI Explorer

Dec 30, 2025 · Artificial Intelligence

How UltraRAG Delivers One‑Click, No‑Code RAG Deployment and Boosts Retrieval Accuracy

UltraRAG, an open‑source RAG framework from THUNLP and NEUIR, consolidates data construction, model fine‑tuning, and evaluation into a zero‑code WebUI, offering multimodal knowledge‑base creation, one‑click optimization, robust multi‑dimensional evaluation, and micro‑service deployment that can raise retrieval accuracy by up to 30% and halve development time.

AIOpen-sourceRAG

0 likes · 10 min read

How UltraRAG Delivers One‑Click, No‑Code RAG Deployment and Boosts Retrieval Accuracy

Alibaba Cloud Big Data AI Platform

Dec 29, 2025 · Cloud Native

How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch

This case study details how a major internet visual platform consolidated its log, keyword, and vector search workloads onto Alibaba Cloud Elasticsearch, eliminating three separate pipelines, reducing write‑costs by 60%, cutting storage expenses over 60%, and achieving multi‑fold performance gains through serverless scaling, FalconSeek engine optimizations, and unified monitoring.

Cost OptimizationElasticsearchRAG

0 likes · 10 min read

How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch

Mingyi World Elasticsearch

Dec 28, 2025 · Artificial Intelligence

Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough

This article walks through the principles of Retrieval‑Augmented Generation (RAG) and provides a complete Python implementation using Elasticsearch, covering document chunking, semantic embedding, bulk indexing, hybrid BM25‑vector search, RRF result fusion, prompt design, LLM invocation, and a practical demo.

ElasticsearchHybrid SearchPrompt engineering

0 likes · 9 min read

Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough

Architecture and Beyond

Dec 27, 2025 · Artificial Intelligence

Turning Claude Skill Folders into Scalable Industry Workflows

This article explains how Anthropic's Claude Skill folders let you package domain expertise, scripts, and resources into reusable modules, differentiate Skills from prompts, combine them with MCP tools and workflows, and build a robust mixed Agent‑Workflow architecture for reliable enterprise automation.

AI agentsClaudeMCP

0 likes · 18 min read

Turning Claude Skill Folders into Scalable Industry Workflows

IT Services Circle

Dec 27, 2025 · Artificial Intelligence

From Ancient Brains to Modern AI: A Journey Through AI’s Evolution and Future

This comprehensive guide traces AI from the origins of human intelligence and the first computers, through the birth of artificial intelligence, the rise of machine learning and large language models, to the emergence of agents, multimodal systems, and the challenges that lie ahead.

AI historyRAGTransformer

0 likes · 39 min read

From Ancient Brains to Modern AI: A Journey Through AI’s Evolution and Future

AI Architecture Hub

Dec 27, 2025 · Artificial Intelligence

How GraphRAG Turns Knowledge Graphs into Smarter Retrieval for LLMs

GraphRAG extends traditional Retrieval‑Augmented Generation by building a knowledge graph from documents, extracting entities and relationships, performing community detection, and supporting both local and global searches, offering detailed step‑by‑step guidance, code examples, configuration tips, and a comparison with classic RAG approaches.

GraphRAGKnowledge GraphLLM

0 likes · 28 min read

How GraphRAG Turns Knowledge Graphs into Smarter Retrieval for LLMs

360 Tech Engineering

Dec 26, 2025 · Artificial Intelligence

15 Chunking Strategies to Supercharge Retrieval‑Augmented Generation

This article presents fifteen practical chunking techniques—ranging from line‑by‑line and fixed‑size chunking to semantic and hierarchical methods—explaining their principles, ideal use‑cases, concrete input examples, chunk outputs, and key advantages or cautions for improving Retrieval‑Augmented Generation with large language models.

AIData RetrievalLLM

0 likes · 28 min read

15 Chunking Strategies to Supercharge Retrieval‑Augmented Generation

Alibaba Cloud Developer

Dec 26, 2025 · Artificial Intelligence

How to Build a Fully Automated Knowledge‑Extraction Pipeline for AI Agents with Python

This article presents a complete end‑to‑end pipeline that automatically extracts, generalizes, incrementally updates, and vector‑syncs knowledge from diverse sources such as tickets, documents, and SQL code, turning the traditionally labor‑intensive knowledge‑base construction for agents into a low‑effort, continuously maintainable Python‑driven solution.

LLMPythonRAG

0 likes · 15 min read

How to Build a Fully Automated Knowledge‑Extraction Pipeline for AI Agents with Python

Architect

Dec 25, 2025 · Artificial Intelligence

How GraphRAG Boosts Retrieval Accuracy with Knowledge Graphs – A Complete Guide

This article explains why traditional RAG suffers from hallucinations, introduces GraphRAG’s knowledge‑graph‑based approach, walks through its indexing and query pipelines—including text splitting, entity‑relation extraction, graph construction, community detection, and local vs. global retrieval—provides practical setup commands, Neo4j visualization steps, and compares its performance with classic RAG.

EmbeddingGraphRAGKnowledge Graph

0 likes · 27 min read

How GraphRAG Boosts Retrieval Accuracy with Knowledge Graphs – A Complete Guide

360 Tech Engineering

Dec 25, 2025 · Artificial Intelligence

Choosing Between Vector Knowledge Bases and Knowledge Graphs for RAG

This article explains the definitions, differences, and integration trends of Knowledge Bases and Knowledge Graphs within Retrieval‑Augmented Generation, helping developers decide which technology best fits their AI system requirements.

AI RetrievalGraphRAGHybrid Search

0 likes · 9 min read

Choosing Between Vector Knowledge Bases and Knowledge Graphs for RAG

Open Source Tech Hub

Dec 25, 2025 · Artificial Intelligence

Explore Symfony AI: Bringing Native AI Capabilities to PHP

Symfony AI v0.1.0 launches with a suite of PHP components that let developers integrate OpenAI‑style models, vector stores, autonomous agents, and chat persistence directly into Symfony apps, offering easy installation, rich demos, and a dedicated website for hands‑on experimentation.

AIOpenAIPHP

0 likes · 6 min read

Explore Symfony AI: Bringing Native AI Capabilities to PHP

AI Architecture Hub

Dec 24, 2025 · Artificial Intelligence

From LLMs to Autonomous Agents: The Three Evolution Stages of AI

This article explains the three evolutionary stages of AI—from large language models that generate text, through workflow‑enhanced systems using retrieval‑augmented generation, to fully autonomous agents capable of self‑directed decision‑making—while detailing the four core technologies that power each stage.

AI evolutionAgentEmbedding

0 likes · 9 min read

From LLMs to Autonomous Agents: The Three Evolution Stages of AI

PMTalk Product Manager Community

Dec 24, 2025 · Artificial Intelligence

Why AI Hallucinates and How Product Managers Can Tame It

The article explains the internal and external causes of AI hallucinations, examines how pre‑training data flaws and fine‑tuning choices amplify them, and presents a five‑pronged technical toolbox—including RAG, prompt engineering, chain‑of‑thought, self‑verification, and safety APIs—plus risk‑based product strategies for different industries.

AI hallucinationPrompt engineeringRAG

0 likes · 12 min read

Why AI Hallucinates and How Product Managers Can Tame It

Tencent Cloud Developer

Dec 24, 2025 · Backend Development

How IMA Scaled Its AI Knowledge Base from Monolith to Micro‑services

This article walks through the end‑to‑end design of IMA's AI‑driven knowledge base, covering its definition, core business flow, architecture evolution, data ingestion pipelines, management challenges, asynchronous processing, permission modeling, and the business value demonstrated by the prototype.

AI ArchitectureData ConsistencyKnowledge Base

0 likes · 14 min read

How IMA Scaled Its AI Knowledge Base from Monolith to Micro‑services

DataFunTalk

Dec 23, 2025 · Artificial Intelligence

Unlocking AI Search: Agentic RAG, LLM‑Powered Recommendations, and Generative Ranking Explained

This article summarizes three cutting‑edge AI search and recommendation techniques—Alibaba Cloud's Agentic RAG architecture, Huawei's LLM‑enhanced recommendation system evolution, and Baidu's generative ranking model GRAB—detailing their challenges, design choices, performance gains, and practical deployment insights.

AIGenerative RankingMulti‑Agent

0 likes · 7 min read

Unlocking AI Search: Agentic RAG, LLM‑Powered Recommendations, and Generative Ranking Explained

Architect's Alchemy Furnace

Dec 21, 2025 · Artificial Intelligence

Deploy and Explore Open WebUI: A Feature‑Rich Self‑Hosted AI Platform

Open WebUI is a self‑hosted, extensible AI platform that runs fully offline, supports multiple LLM back‑ends such as Ollama and OpenAI‑compatible APIs, offers built‑in RAG, role‑based access, multi‑model chat, markdown/LaTeX, image generation, and provides detailed Docker, pip, and Kubernetes installation guides with ready‑to‑run commands.

AI PlatformDockerLLM

0 likes · 11 min read

Deploy and Explore Open WebUI: A Feature‑Rich Self‑Hosted AI Platform

Architecture and Beyond

Dec 21, 2025 · Artificial Intelligence

Designing RAG for Industry‑Specific AI Agents: From Data to Safe Execution

This article explains how to build Retrieval‑Augmented Generation (RAG) for industry‑specific AI agents, covering required capabilities, metrics, data sources, indexing, hybrid retrieval, decision‑point integration, layered output, permission controls, rollout strategies, and common pitfalls to ensure reliable and secure automation.

Agent DesignKnowledge RetrievalRAG

0 likes · 17 min read

Designing RAG for Industry‑Specific AI Agents: From Data to Safe Execution

Mingyi World Elasticsearch

Dec 20, 2025 · Artificial Intelligence

How to Build an Enterprise‑Grade Intelligent Document QA System with Everything plus RAG

This article walks through the need for fast, accurate answers from massive document collections, compares plain keyword search and pure LLM chat, and presents a hybrid Retrieval‑Augmented Generation solution built with open‑source components, detailing architecture, hybrid retrieval, prompt engineering, deployment, performance tuning, and common pitfalls.

ElasticsearchHybrid RetrievalPrompt engineering

0 likes · 12 min read

How to Build an Enterprise‑Grade Intelligent Document QA System with Everything plus RAG

Architect's Journey

Dec 19, 2025 · Artificial Intelligence

Why Context Engineering Is the Hottest AI Skill in 2025

The article explains how context engineering—building a dynamic system that supplies AI with user intent, dialogue history, long‑term memory, external knowledge and tool definitions—outperforms traditional prompt engineering, eliminates hallucinations, and enables AI to complete complex, end‑to‑end tasks.

AIAI agentsContext Engineering

0 likes · 8 min read

Why Context Engineering Is the Hottest AI Skill in 2025

PaperAgent

Dec 18, 2025 · Artificial Intelligence

Can Ontology‑Aware KG‑RAG Double Table QA Performance on Industrial Standards?

This article presents an ontology‑aware knowledge‑graph RAG framework that transforms complex, hierarchical industrial standard documents into a graph of sections, atomic propositions, and refined triples, achieving nearly double F1 scores on table‑based QA tasks and robust performance on long documents.

Knowledge GraphLLMOntology

0 likes · 6 min read

Can Ontology‑Aware KG‑RAG Double Table QA Performance on Industrial Standards?

Architects' Tech Alliance

Dec 17, 2025 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation: From Theory to Scalable Deployment

This guide explains how Retrieval‑Augmented Generation (RAG) overcomes LLM knowledge staleness, hallucination, and domain‑adaptation challenges by combining external knowledge bases with real‑time retrieval, and provides detailed architecture, optimization techniques, engineering practices, monitoring, cost‑control, and future trends for building production‑grade RAG systems.

AICloudflareLLM

0 likes · 15 min read

Mastering Retrieval‑Augmented Generation: From Theory to Scalable Deployment

Aikesheng Open Source Community

Dec 16, 2025 · Databases

How to Build Predictive and Generative AI Apps with MySQL AI

MySQL AI adds built‑in LLMs, embeddings, vector storage, AutoML and a graphical console to on‑premise MySQL, enabling developers to create predictive and generative AI applications—including fraud detection, semantic search, RAG and NL2SQL—without external vector databases or GPUs.

AutoMLMySQL AIPredictive AI

0 likes · 15 min read

How to Build Predictive and Generative AI Apps with MySQL AI

JakartaEE China Community

Dec 16, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

This guide walks through the importance of Retrieval‑Augmented Generation, outlines the core Langchain4j and Ollama 3 components, and provides a complete Java example—including Maven setup, document ingestion, embedding creation, similarity search, prompt construction, and response generation—to demonstrate a functional RAG pipeline.

EmbeddingJavaLLM

0 likes · 9 min read

Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

Architect

Dec 14, 2025 · Artificial Intelligence

Mastering RAG with Spring AI: Build a Retrieval‑Augmented Generation System from Scratch

This article explains the background, principles, and step‑by‑step implementation of Retrieval‑Augmented Generation (RAG) using Spring AI, covering embedding models, vector databases, chunking strategies, indexing algorithms, similarity metrics, re‑ranking, prompt templates, and a complete Java code example.

EmbeddingJavaRAG

0 likes · 32 min read

Mastering RAG with Spring AI: Build a Retrieval‑Augmented Generation System from Scratch

PaperAgent

Dec 14, 2025 · Artificial Intelligence

GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor

The article evaluates GPT‑5.2 and Gemini 3 Pro on real‑world coding tasks, analyzes trends from the 6000 papers presented at NeurIPS 2025, and demonstrates how to extract and refactor the tree‑building component of the open‑source RAPTOR RAG system into an independent module.

AI model evaluationCode RefactoringGPT-5.2

0 likes · 5 min read

GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor

Architect's Alchemy Furnace

Dec 13, 2025 · Artificial Intelligence

Explore 100+ Open‑Source LLM Apps and How to Run Them Locally

This guide presents a curated collection of over a hundred open‑source large language model applications—including AI agents, RAG pipelines, and domain‑specific tools—explains their categories, showcases example projects, and provides step‑by‑step instructions to clone and run them on your own machine.

AI agentsGitHubLLM

0 likes · 8 min read

Explore 100+ Open‑Source LLM Apps and How to Run Them Locally

PMTalk Product Manager Community

Dec 11, 2025 · Artificial Intelligence

AI for Product Managers: Master RAG, Fine‑Tuning, and Agents in One Guide

This article explains how product managers can demystify and apply the three core AI techniques—Retrieval‑Augmented Generation, fine‑tuning, and agents—using the Dify platform, showing step‑by‑step setups, practical benefits, cost considerations, and when to choose each approach.

AIDifyFine-tuning

0 likes · 12 min read

AI for Product Managers: Master RAG, Fine‑Tuning, and Agents in One Guide

Fun with Large Models

Dec 7, 2025 · Frontend Development

Building a Multimodal RAG Front‑End with Trae Solo: A Vibe‑Coding Guide

This article walks through a three‑step Vibe‑Coding workflow—structured prompt creation, prompt optimization with DeepSeek, and precise bug‑fix guidance—to automatically generate, refine, and extend a React + TypeScript front‑end for a multimodal RAG system using Trae Solo, covering architecture, streaming chat, and PDF citation features.

AI programmingLangChainRAG

0 likes · 22 min read

Building a Multimodal RAG Front‑End with Trae Solo: A Vibe‑Coding Guide

dbaplus Community

Dec 7, 2025 · Artificial Intelligence

How AI Agents Can Revolutionize Data Governance: A Step‑by‑Step Blueprint

This article explains how AI agents transform traditional data governance by introducing a four‑layer perception‑decision‑execution‑learning architecture, detailing the required technologies, tool integrations, code examples, deployment steps, team roles, security safeguards, and practical rollout strategies for enterprises seeking automated, intelligent data management.

AI AgentData GovernanceData Quality

0 likes · 10 min read

How AI Agents Can Revolutionize Data Governance: A Step‑by‑Step Blueprint

Old Meng AI Explorer

Dec 5, 2025 · Industry Insights

How Bisheng Turns Enterprise AI Deployment into a Zero‑Code, One‑Stop Process

Bisheng, an open‑source LLM DevOps platform, solves the fragmented, high‑threshold, and compliance‑heavy challenges of enterprise AI by offering a zero‑code visual workflow, all‑in‑one RAG/Agent capabilities, strict security controls, and high‑precision document parsing, enabling rapid, secure AI application rollout.

AI PlatformLLM DevOpsRAG

0 likes · 11 min read

How Bisheng Turns Enterprise AI Deployment into a Zero‑Code, One‑Stop Process

Instant Consumer Technology Team

Dec 4, 2025 · Artificial Intelligence

How to Build an AI‑Powered Jira Assistant with LangGraph, RAG, and MCP

This article walks through the design and implementation of an AI‑driven Jira assistant that uses LangGraph as the agent brain, Retrieval‑Augmented Generation for knowledge access, and a Model Context Protocol (MCP) server to execute Jira operations, complete with architecture diagrams, code snippets, and practical use cases.

AI AgentJira AutomationLangGraph

0 likes · 12 min read

How to Build an AI‑Powered Jira Assistant with LangGraph, RAG, and MCP

DataFunTalk

Dec 4, 2025 · Artificial Intelligence

Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking: Cutting‑Edge AI Search Techniques

This article reviews three advanced AI search solutions—Alibaba Cloud's Agentic RAG architecture for multi‑modal retrieval, Huawei's LLM‑enhanced recommendation system with factorized prompting, and Baidu's generative ranking model GRAB—detailing their technical challenges, design choices, performance gains, and deployment insights.

AI searchBaiduGenerative Ranking

0 likes · 8 min read

Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking: Cutting‑Edge AI Search Techniques

Baidu MEUX

Dec 3, 2025 · User Experience Design

Boost User Research with AI: Automating Short Feedback Classification & Long‑Form Insight Extraction

This article explains how AI large‑language models can automate short user‑feedback classification and extract insights from long interview texts, offering practical prompting tips, fine‑tuning strategies, and Retrieval‑Augmented Generation workflows to make user research faster, more accurate, and less labor‑intensive.

AIFeedback ClassificationPrompt engineering

0 likes · 11 min read

Boost User Research with AI: Automating Short Feedback Classification & Long‑Form Insight Extraction

macrozheng

Dec 3, 2025 · Databases

How Redis’s New Multithreaded Query Engine Boosts Vector Search Performance

Redis has introduced a multithreaded query engine that dramatically reduces latency and increases throughput—up to 16×—for vector similarity searches, enabling vertical scaling and better support for real‑time RAG applications compared to traditional single‑threaded architectures and competing vector databases.

RAGdatabase scalingmultithreading

0 likes · 6 min read

How Redis’s New Multithreaded Query Engine Boosts Vector Search Performance

Yiche Technology

Dec 3, 2025 · Artificial Intelligence

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

This article details how an automotive AI assistant migrated from keyword matching to a Milvus‑based vector retrieval system, overcoming semantic gaps, scaling to millions of daily queries, optimizing indexing, introducing multi‑vector and sparse‑vector search, and building a real‑time RAG pipeline with Flink.

AI AssistantMilvusRAG

0 likes · 12 min read

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

Data STUDIO

Dec 3, 2025 · Artificial Intelligence

Pixeltable: One Table to Power Multimodal AI with Declarative Python

Pixeltable introduces a unified table abstraction that treats images, text, embeddings and model outputs as columns, enabling declarative multimodal AI pipelines, eliminating glue code, supporting built‑in vector indexing, versioned experiments, extensible custom functions, and a concise 30‑line RAG implementation.

Multimodal AIPixeltablePython

0 likes · 15 min read

Pixeltable: One Table to Power Multimodal AI with Declarative Python

DataFunTalk

Dec 2, 2025 · Artificial Intelligence

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

This article reviews three cutting‑edge AI search and recommendation techniques—Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's GRAB generative ranking model—detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI agentsAI searchGenerative Ranking

0 likes · 8 min read

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

AsiaInfo Technology: New Tech Exploration

Dec 2, 2025 · Artificial Intelligence

How LLMs Can Revolutionize Test Case Generation: Methods, Benefits, and Challenges

This article examines the shortcomings of manual test case creation, explains how large language models (LLMs) can dramatically improve efficiency, coverage, consistency, and knowledge sharing in software testing, outlines the key capabilities required, and presents a detailed end‑to‑end solution with practical steps, evaluation metrics, and future outlook.

AI automationKnowledge BaseLLM

0 likes · 20 min read

How LLMs Can Revolutionize Test Case Generation: Methods, Benefits, and Challenges

Instant Consumer Technology Team

Dec 1, 2025 · Artificial Intelligence

Understanding AIGC, RAG, Function Calling, and the MCP Protocol: A Practical AI Guide

This article explains the fundamentals of AI‑generated content (AIGC), the Retrieval‑Augmented Generation (RAG) technique, Function Calling, autonomous agents, and the Model Context Protocol (MCP), highlighting their evolution, technical workflows, limitations, and real‑world examples for developers.

AIAIGCFunction Calling

0 likes · 19 min read

Understanding AIGC, RAG, Function Calling, and the MCP Protocol: A Practical AI Guide

Fun with Large Models

Nov 30, 2025 · Artificial Intelligence

Multimodal RAG with LangChain: PDF Parsing, Chunking, and Citation Guide

This article walks through building a LangChain‑based multimodal RAG system that parses PDFs (both native and scanned), splits them into semantic chunks, stores embeddings in a vector database, and generates answers with precise source citations, complete with code samples and API integration.

FastAPILangChainMultimodal AI

0 likes · 20 min read

Multimodal RAG with LangChain: PDF Parsing, Chunking, and Citation Guide

Fun with Large Models

Nov 27, 2025 · Artificial Intelligence

Mastering Coze Knowledge Base: A Step‑by‑Step Low‑Code Agent Guide

This article provides a comprehensive, hands‑on guide to Coze's knowledge base, covering its core concepts, key features, practical use‑case scenarios, detailed creation steps, configuration options, prompt design, testing methods, and a comparison with variables, memory, and databases.

CozeKnowledge BasePrompt engineering

0 likes · 15 min read

Mastering Coze Knowledge Base: A Step‑by‑Step Low‑Code Agent Guide

Old Meng AI Explorer

Nov 27, 2025 · Artificial Intelligence

How UltraRAG Turns RAG Deployment into a Zero‑Code, One‑Click Process

UltraRAG, an open‑source RAG framework co‑developed by Tsinghua and NEUIR, offers a zero‑code WebUI that streamlines data construction, model fine‑tuning, and multi‑dimensional evaluation, boosting retrieval accuracy by up to 30% and cutting deployment time by half across legal, medical, and research use cases.

AIOpen-sourceRAG

0 likes · 11 min read

How UltraRAG Turns RAG Deployment into a Zero‑Code, One‑Click Process

Java Tech Enthusiast

Nov 26, 2025 · Artificial Intelligence

How LLM, RAG, and AI Agents Work Together

The article clarifies how large language models (LLM), retrieval‑augmented generation (RAG), and AI agents complement each other, describing the brain‑like reasoning of LLMs, the dynamic knowledge access provided by RAG, and the autonomous action capabilities of AI agents, plus practical usage scenarios.

AI AgentLLMRAG

0 likes · 7 min read

How LLM, RAG, and AI Agents Work Together

JavaGuide

Nov 26, 2025 · Interview Experience

Why Campus Recruitment Often Leads to Small‑Mid‑Size Companies: A Student’s Interview Journey

A third‑year student recounts four distinct campus interviews—from a state‑owned big‑data firm to an AI startup—detailing test formats, technical questions, and personal growth, and concludes that most graduates ultimately join the countless small‑mid‑size companies that keep society running.

AIBackendJava

0 likes · 9 min read

Why Campus Recruitment Often Leads to Small‑Mid‑Size Companies: A Student’s Interview Journey

Alibaba Cloud Developer

Nov 26, 2025 · Artificial Intelligence

Unlocking AI-Powered Customer Service: From RAG to Deep Evaluation and Optimization

This article explores how the rapid growth of large language models reshapes intelligent customer service, detailing the evolution from rule‑based NLP bots to Retrieval‑Augmented Generation (RAG) and AI‑native agents, and presents a comprehensive framework for evaluating, diagnosing, and continuously improving chatbot performance using LLM‑driven metrics and context engineering.

AIContext EngineeringLLM evaluation

0 likes · 46 min read

Unlocking AI-Powered Customer Service: From RAG to Deep Evaluation and Optimization

PMTalk Product Manager Community

Nov 25, 2025 · Product Management

Avoid the 3 Common AI Product Management Pitfalls: Prompt Engineering, RAG, and Fine‑Tuning

The article examines why AI product managers repeatedly fall into three traps—over‑relying on prompt engineering, blindly adopting Retrieval‑Augmented Generation, or costly fine‑tuning—by presenting real‑world failures, debunking myths, and offering a five‑layer decision framework with cost, data, resource, and risk analysis to choose the right solution.

AI product managementPrompt engineeringRAG

0 likes · 24 min read

Avoid the 3 Common AI Product Management Pitfalls: Prompt Engineering, RAG, and Fine‑Tuning

Architecture and Beyond

Nov 22, 2025 · Artificial Intelligence

Why Pure AI Agents Fail in Enterprise and How Workflow‑Agent Hybrids Fix It

The article explains that relying solely on autonomous AI agents in industry is impractical, outlines the three major pitfalls of pure‑agent approaches, and details how combining agents with structured workflows, RAG, and multi‑level architectures delivers reliable, cost‑effective enterprise solutions.

AIAgentAutomation

0 likes · 15 min read

Why Pure AI Agents Fail in Enterprise and How Workflow‑Agent Hybrids Fix It

Wu Shixiong's Large Model Academy

Nov 22, 2025 · Artificial Intelligence

Why Your RAG System Slows Down Over Time and How to Fix It

The article explains why a production Retrieval‑Augmented Generation (RAG) system becomes slower as it runs—due to growing embedding costs, expanding vector databases, heavier re‑ranking, and larger prompts—and provides concrete engineering optimizations such as batching, async concurrency, caching, partitioned retrieval, HNSW tuning, replica scaling, answer caching, and prompt sparsification to keep performance stable.

AI EngineeringPerformance OptimizationRAG

0 likes · 10 min read

Why Your RAG System Slows Down Over Time and How to Fix It

JD Tech Talk

Nov 21, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Retrieval‑Augmented Generation

This article explains why effective chunking is crucial for RAG performance, compares seven major chunking strategies—including fixed‑size, semantic, recursive, document‑structure, agent‑driven, sentence, and paragraph methods—and offers practical guidance on selecting and optimizing chunks for real‑world AI applications.

AIRAGRetrieval Augmented Generation

0 likes · 10 min read

Mastering Chunking Strategies for Retrieval‑Augmented Generation

JD Cloud Developers

Nov 21, 2025 · Artificial Intelligence

Why Chunking Strategy Makes or Breaks RAG Performance

This article explains how different chunking methods—fixed size, semantic, recursive, document‑based, agent‑driven, sentence‑level, and paragraph‑level—affect Retrieval‑Augmented Generation, offering practical guidelines, metrics, and optimization tips for real‑world deployments.

AIRAGchunking

0 likes · 9 min read

Why Chunking Strategy Makes or Breaks RAG Performance

Wu Shixiong's Large Model Academy

Nov 21, 2025 · Artificial Intelligence

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

This article explains why dynamic Retrieval‑Augmented Generation (RAG) requires a layered caching strategy rather than simple result caching, details a four‑level cache architecture—including embedding, search, answer, and pipeline caches—provides practical key‑generation and TTL guidelines, and outlines dirty‑data defenses to keep caches consistent and performant.

AI EngineeringLLMRAG

0 likes · 10 min read

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

Baidu Maps Tech Team

Nov 19, 2025 · Artificial Intelligence

Boosting Socio‑Economic Q&A: The ARAG Framework Merges Structured Data Analysis with RAG

ARAG introduces a novel Retrieval‑Augmented Generation framework that tightly integrates LLM‑driven structured data analysis with unstructured information retrieval, addressing the “structured + unstructured” reasoning gap in socio‑economic queries, and demonstrates superior accuracy, robustness, and hallucination resistance through extensive evaluations.

LLMRAGSocio-economic AI

0 likes · 12 min read

Boosting Socio‑Economic Q&A: The ARAG Framework Merges Structured Data Analysis with RAG

Wu Shixiong's Large Model Academy

Nov 19, 2025 · Artificial Intelligence

How to Build a Reliable Dynamic Incremental RAG Pipeline for Real‑Time Data

This article explains why dynamic incremental RAG is harder than static RAG, identifies the three main points where recall accuracy breaks, and presents a three‑stage engineering pipeline—including a quality‑control layer, two‑stage retrieval, and reference‑injection generation—to keep real‑time data retrieval both accurate and robust.

AIDynamic DataRAG

0 likes · 13 min read

How to Build a Reliable Dynamic Incremental RAG Pipeline for Real‑Time Data

Alibaba Cloud Developer

Nov 19, 2025 · Artificial Intelligence

Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation

This article details a practical case study of designing, implementing, and evaluating an AI-driven proofreading agent for a media client, covering background challenges, a three‑layer architecture, prompt engineering techniques, RAG knowledge‑base construction, model selection, fine‑tuning, automated metrics, and lessons learned.

AIModel EvaluationProofreading

0 likes · 26 min read

Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation

JakartaEE China Community

Nov 18, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

This article explains why Retrieval‑Augmented Generation improves LLM accuracy, outlines the key Langchain4j and Ollama3 components, and provides a step‑by‑step Java example—including Maven setup, document ingestion, embedding, similarity search, prompt creation, and response generation—to demonstrate a functional RAG pipeline.

EmbeddingJavaLLM

0 likes · 8 min read

How to Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

Alipay Experience Technology

Nov 18, 2025 · Mobile Development

Boosting KMP Native Cross‑Platform Development with AI Agents: Real‑World Practices

This article details how Alipay's engineering team built an AI‑Agent‑powered coding assistant for Kotlin Multiplatform (KMP) native cross‑platform development, covering architecture, UI generation from designs and images, RAG‑based knowledge retrieval, crash analysis, and future directions for AI‑driven software engineering.

AI AgentCompose MultiplatformKMP

0 likes · 20 min read

Boosting KMP Native Cross‑Platform Development with AI Agents: Real‑World Practices

Wu Shixiong's Large Model Academy

Nov 16, 2025 · Artificial Intelligence

How to Slash RAG First‑Token Latency: Practical Engineering Strategies

This guide breaks down the three layers of a RAG pipeline—embedding, vector retrieval, and system architecture—and provides concrete engineering tactics such as batch embedding, async concurrency, caching, ANN indexing, partitioning, connection pooling, and async pipelines to dramatically reduce Time‑to‑First‑Token latency.

Async PipelineEmbeddingRAG

0 likes · 10 min read

How to Slash RAG First‑Token Latency: Practical Engineering Strategies

Wu Shixiong's Large Model Academy

Nov 14, 2025 · Artificial Intelligence

How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework

This article explains why function‑call accuracy is critical for LLM agents, identifies four common failure causes, and presents a systematic, five‑step engineering framework—including dynamic routing, chain‑of‑thought planning, result validation, memory injection, and log‑driven optimization—backed by concrete examples and quantitative improvements.

Function CallingInterview PreparationLLM

0 likes · 10 min read

How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework

DataFunTalk

Nov 11, 2025 · Artificial Intelligence

How Alibaba Cloud’s AI Search Redefines Vector Retrieval and RAG

This article outlines Alibaba Cloud AI Search’s evolution, detailing its dual product lines—enhanced Elasticsearch and self‑developed OpenSearch—key Agentic RAG technologies, serverless architecture, vector and LLM‑driven search capabilities, and future directions in AI‑powered search.

AI searchAlibaba CloudElasticsearch

0 likes · 4 min read

How Alibaba Cloud’s AI Search Redefines Vector Retrieval and RAG

Rare Earth Juejin Tech Community

Nov 11, 2025 · Artificial Intelligence

How Spring AI Turns Java Backend into a Powerful AI Service Layer

This article reviews Spring AI's evolution, architecture, and key features—including unified APIs, prompt engineering, streaming, RAG, multi‑modal support, and agent workflows—showing how Java back‑ends can efficiently expose AI capabilities without handling low‑level model logic.

AI integrationAgentChatClient

0 likes · 12 min read

How Spring AI Turns Java Backend into a Powerful AI Service Layer

DaTaobao Tech

Nov 10, 2025 · Artificial Intelligence

How Tmall’s AI Transforms Test Case Generation for Faster, Smarter QA

This article details Tmall's technology team's deep AI‑driven testing practice, outlining industry challenges, the need for intelligent test case generation, and a comprehensive strategy that combines prompt engineering, RAG‑based knowledge bases, and platform integration to boost coverage, reduce manual effort, and accelerate release cycles.

AI testingKnowledge BasePrompt engineering

0 likes · 10 min read

How Tmall’s AI Transforms Test Case Generation for Faster, Smarter QA

JavaEdge

Nov 10, 2025 · Artificial Intelligence

What AI Trends Will Dominate the Next Year? A Deep Dive into Physical AI, Agents, and RAG

This report outlines emerging AI trends for the coming year, highlighting Physical AI, AI agents, multimodal models, the MCP standard, and the growing role of AI‑driven DevOps, while also discussing security implications and adoption stages across the technology lifecycle.

AI DevOpsAI agentsAI trends

0 likes · 12 min read

What AI Trends Will Dominate the Next Year? A Deep Dive into Physical AI, Agents, and RAG

Tech Stroll Journey

Nov 9, 2025 · Backend Development

How to Deploy AnythingLLM Locally with Docker for Enterprise Document RAG

This guide walks through setting up a Ubuntu VM, installing Docker, pulling the AnythingLLM image, configuring storage, launching the container, and using it to ingest and query local documents with a DeepSeek‑R1 model.

AI deploymentAnythingLLMDeepSeek

0 likes · 6 min read

How to Deploy AnythingLLM Locally with Docker for Enterprise Document RAG

Data Party THU

Nov 9, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Effective RAG: Fixed, Recursive, Semantic, Structured, and Delayed

This article walks through the core RAG pipeline, explains why chunking is the linchpin of retrieval quality, and provides detailed definitions, trade‑offs, and implementation examples for five chunking techniques—fixed, recursive, semantic, structure‑aware, and delayed—so you can choose the right approach for any document‑heavy AI application.

AILLMRAG

0 likes · 10 min read

Mastering Chunking Strategies for Effective RAG: Fixed, Recursive, Semantic, Structured, and Delayed

DataFunSummit

Nov 8, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World AI Solutions with RAG and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while deep‑diving into the RAG, GraphRAG, and Agent technologies that enable smarter, more reliable AI applications.

AIAgentLLM

0 likes · 4 min read

How Tencent’s LLM Powers Real‑World AI Solutions with RAG and Agents

DataFunSummit

Nov 7, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs

This article examines Tencent’s large language model deployments across content generation, intelligent customer service, and game role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—highlighting how they enhance performance, explainability, and multi‑step reasoning in real‑world business scenarios.

AIAgentLLM

0 likes · 4 min read

How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs

Wu Shixiong's Large Model Academy

Nov 6, 2025 · Artificial Intelligence

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

This article explains why building a high‑quality RAG knowledge base is critical, outlines offline parsing techniques for multi‑format documents, presents semantic chunking strategies that preserve structure and context, and shows how to answer interview questions with a robust, production‑ready pipeline.

AI InterviewKnowledge BaseRAG

0 likes · 8 min read

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

Wu Shixiong's Large Model Academy

Nov 5, 2025 · Artificial Intelligence

Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo

Building a Retrieval‑Augmented Generation (RAG) system may be straightforward in code, but making it reliable, accurate, and scalable in production involves challenges across data preparation, vector retrieval, query rewriting, generation control, and system integration, turning a demo into a truly useful AI service.

AILLMPrompt engineering

0 likes · 8 min read

Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo

DataFunSummit

Nov 4, 2025 · Artificial Intelligence

How Tencent Leverages RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article explores Tencent's large language model deployments across various business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—that enable these applications.

AIAgentRAG

0 likes · 4 min read

How Tencent Leverages RAG, GraphRAG, and Agents to Power Large Language Model Applications

Wu Shixiong's Large Model Academy

Nov 4, 2025 · Artificial Intelligence

Why Financial RAG Fails and How to Solve Its Core Challenges

This article explains why Retrieval‑Augmented Generation (RAG) projects in the financial sector often underperform, highlighting data‑structure complexities, document‑parsing hurdles, chunking strategies, compliance constraints, evaluation metrics, and engineering requirements, and offers practical solutions and code examples.

EngineeringFinancial AIRAG

0 likes · 10 min read

Why Financial RAG Fails and How to Solve Its Core Challenges

dbaplus Community

Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG

0 likes · 17 min read

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

Instant Consumer Technology Team

Nov 3, 2025 · Artificial Intelligence

Large Language Models Power Big Data SRE Knowledge & Root‑Cause Automation

Facing the growing complexity of big‑data platforms, the SRE team adopted large‑language‑model agents to automate knowledge management and root‑cause analysis, employing Retrieval‑Augmented Generation, a vector store, and the Model Context Protocol to enable intelligent, scalable, and efficient incident diagnosis and resolution.

AIMCPRAG

0 likes · 12 min read

Large Language Models Power Big Data SRE Knowledge & Root‑Cause Automation

DataFunSummit

Nov 3, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World AI: From RAG to Agents

This article examines Tencent's large language model applications across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the three key technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agents—that enable these capabilities.

AI applicationsAgentLLM

0 likes · 4 min read

How Tencent’s LLM Powers Real‑World AI: From RAG to Agents

Wu Shixiong's Large Model Academy

Nov 2, 2025 · Artificial Intelligence

Why Document Parsing Is the Real Bottleneck in RAG Projects (And How to Fix It)

The article explains that in Retrieval‑Augmented Generation projects the hardest challenge lies in robust document parsing—handling PDFs, PPTs, scanned contracts, OCR errors, and preserving structure—to ensure high‑quality retrieval and avoid hallucinations.

AIOCRRAG

0 likes · 10 min read

Why Document Parsing Is the Real Bottleneck in RAG Projects (And How to Fix It)

Data Party THU

Nov 1, 2025 · Artificial Intelligence

How to Blend Process‑Oriented and Agent‑Centric AI into a Hybrid Intelligent Pipeline

This article analyzes two contrasting AI agent design paradigms—process‑driven workflow orchestration and autonomous agent intelligence—examines their strengths and limitations, and proposes a hybrid architecture that fuses deterministic pipelines with dynamic planning, tool use, and memory mechanisms to achieve both reliability and adaptability.

AIAgentHybrid

0 likes · 15 min read

How to Blend Process‑Oriented and Agent‑Centric AI into a Hybrid Intelligent Pipeline

Wu Shixiong's Large Model Academy

Nov 1, 2025 · Artificial Intelligence

Turn a Basic RAG Demo into a High‑Impact Interview Project

This guide shows how to evolve a simple Retrieval‑Augmented Generation prototype into a production‑grade system by strengthening data ingestion, optimizing retrieval with hybrid and reranking techniques, adding query rewriting, long‑context handling, reinforcement learning, and multimodal support, so candidates can demonstrate real engineering depth in interviews.

AILLMRAG

0 likes · 7 min read

Turn a Basic RAG Demo into a High‑Impact Interview Project

Rare Earth Juejin Tech Community

Oct 31, 2025 · Artificial Intelligence

Build a Private AI Knowledge Base with Ollama and FastGPT

This guide walks you through setting up a locally deployed AI system using Ollama and FastGPT, covering model selection, Docker deployment, configuration, knowledge‑base creation, and testing so your team can query internal documents securely and efficiently.

AIDockerFastGPT

0 likes · 25 min read

Build a Private AI Knowledge Base with Ollama and FastGPT

Wu Shixiong's Large Model Academy

Oct 30, 2025 · Interview Experience

How to Write a RAG Project Experience That Impresses Interviewers

This guide explains why typical RAG résumé entries fall flat and provides a step‑by‑step framework—including motivation, architecture, optimization, and impact metrics—to craft a compelling, interview‑ready description of Retrieval‑Augmented Generation projects.

AI ProjectInterview TipsRAG

0 likes · 10 min read

How to Write a RAG Project Experience That Impresses Interviewers

BirdNest Tech Talk

Oct 30, 2025 · Artificial Intelligence

Master LangChain Chains with LCEL: From Simple Jokes to RAG and Agent Pipelines

This guide explains how LangChain’s Expression Language (LCEL) lets you declaratively connect prompts, models, and output parsers into chains, walks through environment setup, dependency installation, and detailed code examples ranging from a basic joke generator to retrieval‑augmented generation and memory‑enabled agents.

AgentLCELLangChain

0 likes · 5 min read

Master LangChain Chains with LCEL: From Simple Jokes to RAG and Agent Pipelines

Alibaba Cloud Developer

Oct 30, 2025 · Artificial Intelligence

Why AI Agents Aren’t As Simple As They Appear: Engineering Challenges and Solutions

Building AI agents may seem straightforward with frameworks like LangChain, but hidden complexities in orchestration, memory management, reproducibility, and scalability turn simple demos into fragile systems, requiring systematic engineering, observability, and robust design to achieve reliable, production‑grade intelligent agents.

AI agentsAgent DesignLangChain

0 likes · 21 min read

Why AI Agents Aren’t As Simple As They Appear: Engineering Challenges and Solutions

DeWu Technology

Oct 29, 2025 · Artificial Intelligence

Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code

This article explains how proper document chunking—choosing the right chunk size, overlap, and structure‑aware boundaries—directly impacts the relevance, factuality, and efficiency of Retrieval‑Augmented Generation pipelines, and provides multiple Python implementations ranging from simple fixed‑length splits to semantic and hybrid approaches.

EmbeddingLLMRAG

0 likes · 29 min read

Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code

Wu Shixiong's Large Model Academy

Oct 29, 2025 · Artificial Intelligence

How to Build a Robust RAG Evaluation Framework for Finance Q&A

This guide outlines a five‑dimensional evaluation system—accuracy, credibility, latency, scalability, and user experience—providing concrete metrics, code examples, and practical steps to assess Retrieval‑Augmented Generation models in financial insurance question‑answering scenarios.

AIRAGfinance

0 likes · 11 min read

How to Build a Robust RAG Evaluation Framework for Finance Q&A

Alibaba Cloud Developer

Oct 28, 2025 · Artificial Intelligence

Automating Research with Spring AI: Inside the DeepResearch Java RAG System

The article introduces Spring AI Alibaba DeepResearch, a Java‑based system that automates end‑to‑end research by integrating multi‑agent workflows, configurable RAG pipelines, data ingestion, observability, and dynamic report generation, and provides detailed deployment and extension guidance.

AI workflowDeepResearchJava

0 likes · 20 min read

Automating Research with Spring AI: Inside the DeepResearch Java RAG System

AI Large Model Application Practice

Oct 27, 2025 · Artificial Intelligence

Why Context Engineering Is the Next Evolution Beyond Prompt Engineering

The article explains how traditional prompt engineering is giving way to Context Engineering and the Agentic Context Engineering (ACE) framework, which lets large language model agents continuously learn and improve through evolving, well‑structured context without fine‑tuning.

AI ArchitectureAgentic AIContext Engineering

0 likes · 12 min read

Why Context Engineering Is the Next Evolution Beyond Prompt Engineering

Bilibili Tech

Oct 27, 2025 · Artificial Intelligence

How Bilibili’s LLM-Powered System Cuts Game Localization Costs by 80%

Bilibili’s game algorithm team built a four‑layer, LLM‑based translation platform that automates terminology extraction, retrieval‑augmented generation, and quality assessment, dramatically reducing localization cycles by over 85% and costs by up to 80% while supporting ten languages and ensuring consistent, culturally‑accurate game text.

LLMRAGgame localization

0 likes · 20 min read

How Bilibili’s LLM-Powered System Cuts Game Localization Costs by 80%

Wu Shixiong's Large Model Academy

Oct 27, 2025 · Artificial Intelligence

Designing Effective Generation Modules for RAG: Prompt Engineering, Multi‑Document Fusion, and Hallucination Control

This article explains how to design and optimize the generation module of Retrieval‑Augmented Generation systems by building robust prompts, merging multi‑source information, controlling answer formats, and applying post‑generation verification to reduce hallucinations and improve enterprise‑grade performance.

AIGeneration ModuleHallucination Control

0 likes · 9 min read

Designing Effective Generation Modules for RAG: Prompt Engineering, Multi‑Document Fusion, and Hallucination Control

BirdNest Tech Talk

Oct 27, 2025 · Artificial Intelligence

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

This article explains how LangChain's Indexing API adds state management and synchronization to the classic load‑split‑embed‑store RAG pipeline, detailing the RecordManager component, the index function workflow, key parameters, implementation considerations, and best‑practice code examples for production‑grade vector stores.

FAISSIndexing APILangChain

0 likes · 12 min read

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

BirdNest Tech Talk

Oct 27, 2025 · Artificial Intelligence

Mastering LangChain Retrievers: From Basics to Multi‑Query and Ensemble Techniques

This article explains the concept of LangChain retrievers, compares them with vector stores, and provides step‑by‑step code examples for MultiQueryRetriever and EnsembleRetriever, showing how to boost recall and combine semantic and keyword search for robust RAG pipelines.

AIEnsembleRetrieverLangChain

0 likes · 9 min read

Mastering LangChain Retrievers: From Basics to Multi‑Query and Ensemble Techniques

Huawei Cloud Developer Alliance

Oct 27, 2025 · Artificial Intelligence

Master AI Agents and MCP: A Complete 4‑Month Learning Roadmap

This article presents a structured, step‑by‑step learning path that guides beginners from Python fundamentals through AI API mastery, Retrieval‑Augmented Generation, deep MCP protocol knowledge, and advanced multi‑agent development, complete with practical code examples and performance‑monitoring techniques.

AI agentsLangChainMCP protocol

0 likes · 14 min read

Master AI Agents and MCP: A Complete 4‑Month Learning Roadmap

Wu Shixiong's Large Model Academy

Oct 25, 2025 · Artificial Intelligence

How to Build a High‑Quality RAG Knowledge Base: A Step‑by‑Step Guide

This article breaks down the end‑to‑end engineering pipeline for constructing a Retrieval‑Augmented Generation (RAG) knowledge base, covering document parsing, data cleaning, semantic chunking, embedding, and index creation, plus practical optimization tips and a concise interview answer framework.

LLMRAGvector indexing

0 likes · 10 min read

How to Build a High‑Quality RAG Knowledge Base: A Step‑by‑Step Guide

AI2ML AI to Machine Learning

Oct 24, 2025 · Artificial Intelligence

Beyond RAG: Three Emerging Knowledge‑Engineering Strategies (ICL, Online Learning, SLM)

The article outlines three post‑RAG knowledge‑engineering approaches—In‑Context Learning with dynamic few‑shot selection, Online Learning encompassing Meta‑Learning and Lifelong Learning to quickly adapt to new tasks, and the Small Language Model path that combines fine‑tuned task‑specific experts with LLM‑SLM collaboration for efficient, privacy‑preserving inference.

In-Context LearningKnowledge EngineeringLLM

0 likes · 4 min read

Beyond RAG: Three Emerging Knowledge‑Engineering Strategies (ICL, Online Learning, SLM)

DataFunTalk

Oct 23, 2025 · Artificial Intelligence

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, Supervised Fine‑Tuning, and autonomous agents boost model intelligence, reduce hallucinations, and enable sophisticated content creation, understanding, and interactive applications.

AI agentsRAGTencent

0 likes · 4 min read

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

Huawei Cloud Developer Alliance

Oct 23, 2025 · Artificial Intelligence

Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained

This article walks developers through three essential upgrades for Retrieval‑Augmented Generation systems—hybrid search combining vector and keyword retrieval, query rewriting to clarify conversational inputs, and re‑ranking with a cross‑encoder—providing step‑by‑step code examples using LangChain to dramatically improve answer quality.

AIHybrid SearchLangChain

0 likes · 9 min read

Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained

Xuanwu Backend Tech Stack

Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

RAGRerankRetrieval Augmented Generation

0 likes · 10 min read

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

JD Tech Talk

Oct 21, 2025 · Backend Development

How Backend Engineers Are Breaking Through AI with RAG Architectures

This article details a backend developer's two‑year AI journey, the challenges of rapid model advances, and how applying microservice principles to Retrieval‑Augmented Generation (RAG) creates a scalable, multi‑agent platform for insurance knowledge, memory, and intelligent agents.

Backend AIKnowledge BaseRAG

0 likes · 11 min read

How Backend Engineers Are Breaking Through AI with RAG Architectures

BirdNest Tech Talk

Oct 21, 2025 · Artificial Intelligence

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

This article explains what vector stores are, outlines their core workflow of adding, querying, and searching embeddings, compares popular back‑ends like FAISS, Chroma, and Pinecone, and walks through a complete Chinese‑language example using LangChain’s FAISS integration with detailed code and result analysis.

AIFAISSLangChain

0 likes · 10 min read

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

Alibaba Cloud Developer

Oct 20, 2025 · Artificial Intelligence

How LLM-Powered Agents Transform Secure Code Review in Enterprise Repositories

This article details the implementation of an LLM‑based code‑review agent in a C3‑level secure repository, describing its RAG‑enhanced knowledge base, CI pipeline integration, real‑world results, prompt engineering, and ongoing optimization to boost review efficiency and defect detection.

AI agentsCode reviewLLM

0 likes · 18 min read

How LLM-Powered Agents Transform Secure Code Review in Enterprise Repositories

BirdNest Tech Talk

Oct 16, 2025 · Artificial Intelligence

Mastering Text Splitting in LangChain: From Theory to Code

This guide explains why large documents must be broken into semantic chunks for LLMs, introduces core parameters like chunk_size and chunk_overlap, compares LangChain's various splitters, and walks through a complete Python example that loads a long text, configures a RecursiveCharacterTextSplitter, and inspects the resulting chunks.

EmbeddingLangChainRAG

0 likes · 9 min read

Mastering Text Splitting in LangChain: From Theory to Code