Tagged articles
891 articles
Page 4 of 9
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jan 3, 2026 · Artificial Intelligence

Build Your Own AI Coding Assistant in 5 Minutes: A Hands‑On Guide

The article analyzes common pain points of traditional AI coding chats—repetitive context input, lengthy prompts, and generic answers—and demonstrates how to create a persistent, expert‑level AI coding assistant using Coco AI, with step‑by‑step configuration, example prompts, and future RAG enhancements.

AI AgentCoco AIDeepSeek
0 likes · 9 min read
Build Your Own AI Coding Assistant in 5 Minutes: A Hands‑On Guide
dbaplus Community
dbaplus Community
Jan 1, 2026 · Artificial Intelligence

Boost LLM Retrieval Accuracy with MCP – A Superior Alternative to RAG

This guide explains why traditional Retrieval‑Augmented Generation (RAG) struggles with precision, introduces the Model Context Protocol (MCP) as a standardized way for large language models to interact with external data sources, and provides step‑by‑step instructions for integrating MCP with MongoDB using Cherry Studio and VSCode +Cline.

FunctionCallMCPMongoDB
0 likes · 25 min read
Boost LLM Retrieval Accuracy with MCP – A Superior Alternative to RAG
Old Meng AI Explorer
Old Meng AI Explorer
Dec 30, 2025 · Artificial Intelligence

How UltraRAG Delivers One‑Click, No‑Code RAG Deployment and Boosts Retrieval Accuracy

UltraRAG, an open‑source RAG framework from THUNLP and NEUIR, consolidates data construction, model fine‑tuning, and evaluation into a zero‑code WebUI, offering multimodal knowledge‑base creation, one‑click optimization, robust multi‑dimensional evaluation, and micro‑service deployment that can raise retrieval accuracy by up to 30% and halve development time.

AIOpen-sourceRAG
0 likes · 10 min read
How UltraRAG Delivers One‑Click, No‑Code RAG Deployment and Boosts Retrieval Accuracy
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 29, 2025 · Cloud Native

How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch

This case study details how a major internet visual platform consolidated its log, keyword, and vector search workloads onto Alibaba Cloud Elasticsearch, eliminating three separate pipelines, reducing write‑costs by 60%, cutting storage expenses over 60%, and achieving multi‑fold performance gains through serverless scaling, FalconSeek engine optimizations, and unified monitoring.

Cost OptimizationElasticsearchRAG
0 likes · 10 min read
How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Dec 28, 2025 · Artificial Intelligence

Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough

This article walks through the principles of Retrieval‑Augmented Generation (RAG) and provides a complete Python implementation using Elasticsearch, covering document chunking, semantic embedding, bulk indexing, hybrid BM25‑vector search, RRF result fusion, prompt design, LLM invocation, and a practical demo.

ElasticsearchHybrid SearchPrompt engineering
0 likes · 9 min read
Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough
Architecture and Beyond
Architecture and Beyond
Dec 27, 2025 · Artificial Intelligence

Turning Claude Skill Folders into Scalable Industry Workflows

This article explains how Anthropic's Claude Skill folders let you package domain expertise, scripts, and resources into reusable modules, differentiate Skills from prompts, combine them with MCP tools and workflows, and build a robust mixed Agent‑Workflow architecture for reliable enterprise automation.

AI agentsClaudeMCP
0 likes · 18 min read
Turning Claude Skill Folders into Scalable Industry Workflows
AI Architecture Hub
AI Architecture Hub
Dec 27, 2025 · Artificial Intelligence

How GraphRAG Turns Knowledge Graphs into Smarter Retrieval for LLMs

GraphRAG extends traditional Retrieval‑Augmented Generation by building a knowledge graph from documents, extracting entities and relationships, performing community detection, and supporting both local and global searches, offering detailed step‑by‑step guidance, code examples, configuration tips, and a comparison with classic RAG approaches.

GraphRAGKnowledge GraphLLM
0 likes · 28 min read
How GraphRAG Turns Knowledge Graphs into Smarter Retrieval for LLMs
360 Tech Engineering
360 Tech Engineering
Dec 26, 2025 · Artificial Intelligence

15 Chunking Strategies to Supercharge Retrieval‑Augmented Generation

This article presents fifteen practical chunking techniques—ranging from line‑by‑line and fixed‑size chunking to semantic and hierarchical methods—explaining their principles, ideal use‑cases, concrete input examples, chunk outputs, and key advantages or cautions for improving Retrieval‑Augmented Generation with large language models.

AIData RetrievalLLM
0 likes · 28 min read
15 Chunking Strategies to Supercharge Retrieval‑Augmented Generation
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 26, 2025 · Artificial Intelligence

How to Build a Fully Automated Knowledge‑Extraction Pipeline for AI Agents with Python

This article presents a complete end‑to‑end pipeline that automatically extracts, generalizes, incrementally updates, and vector‑syncs knowledge from diverse sources such as tickets, documents, and SQL code, turning the traditionally labor‑intensive knowledge‑base construction for agents into a low‑effort, continuously maintainable Python‑driven solution.

LLMPythonRAG
0 likes · 15 min read
How to Build a Fully Automated Knowledge‑Extraction Pipeline for AI Agents with Python
Architect
Architect
Dec 25, 2025 · Artificial Intelligence

How GraphRAG Boosts Retrieval Accuracy with Knowledge Graphs – A Complete Guide

This article explains why traditional RAG suffers from hallucinations, introduces GraphRAG’s knowledge‑graph‑based approach, walks through its indexing and query pipelines—including text splitting, entity‑relation extraction, graph construction, community detection, and local vs. global retrieval—provides practical setup commands, Neo4j visualization steps, and compares its performance with classic RAG.

EmbeddingGraphRAGKnowledge Graph
0 likes · 27 min read
How GraphRAG Boosts Retrieval Accuracy with Knowledge Graphs – A Complete Guide
Open Source Tech Hub
Open Source Tech Hub
Dec 25, 2025 · Artificial Intelligence

Explore Symfony AI: Bringing Native AI Capabilities to PHP

Symfony AI v0.1.0 launches with a suite of PHP components that let developers integrate OpenAI‑style models, vector stores, autonomous agents, and chat persistence directly into Symfony apps, offering easy installation, rich demos, and a dedicated website for hands‑on experimentation.

AIOpenAIPHP
0 likes · 6 min read
Explore Symfony AI: Bringing Native AI Capabilities to PHP
AI Architecture Hub
AI Architecture Hub
Dec 24, 2025 · Artificial Intelligence

From LLMs to Autonomous Agents: The Three Evolution Stages of AI

This article explains the three evolutionary stages of AI—from large language models that generate text, through workflow‑enhanced systems using retrieval‑augmented generation, to fully autonomous agents capable of self‑directed decision‑making—while detailing the four core technologies that power each stage.

AI evolutionAgentEmbedding
0 likes · 9 min read
From LLMs to Autonomous Agents: The Three Evolution Stages of AI
PMTalk Product Manager Community
PMTalk Product Manager Community
Dec 24, 2025 · Artificial Intelligence

Why AI Hallucinates and How Product Managers Can Tame It

The article explains the internal and external causes of AI hallucinations, examines how pre‑training data flaws and fine‑tuning choices amplify them, and presents a five‑pronged technical toolbox—including RAG, prompt engineering, chain‑of‑thought, self‑verification, and safety APIs—plus risk‑based product strategies for different industries.

AI hallucinationPrompt engineeringRAG
0 likes · 12 min read
Why AI Hallucinates and How Product Managers Can Tame It
Tencent Cloud Developer
Tencent Cloud Developer
Dec 24, 2025 · Backend Development

How IMA Scaled Its AI Knowledge Base from Monolith to Micro‑services

This article walks through the end‑to‑end design of IMA's AI‑driven knowledge base, covering its definition, core business flow, architecture evolution, data ingestion pipelines, management challenges, asynchronous processing, permission modeling, and the business value demonstrated by the prototype.

AI ArchitectureData ConsistencyKnowledge Base
0 likes · 14 min read
How IMA Scaled Its AI Knowledge Base from Monolith to Micro‑services
DataFunTalk
DataFunTalk
Dec 23, 2025 · Artificial Intelligence

Unlocking AI Search: Agentic RAG, LLM‑Powered Recommendations, and Generative Ranking Explained

This article summarizes three cutting‑edge AI search and recommendation techniques—Alibaba Cloud's Agentic RAG architecture, Huawei's LLM‑enhanced recommendation system evolution, and Baidu's generative ranking model GRAB—detailing their challenges, design choices, performance gains, and practical deployment insights.

AIGenerative RankingMulti‑Agent
0 likes · 7 min read
Unlocking AI Search: Agentic RAG, LLM‑Powered Recommendations, and Generative Ranking Explained
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Dec 21, 2025 · Artificial Intelligence

Deploy and Explore Open WebUI: A Feature‑Rich Self‑Hosted AI Platform

Open WebUI is a self‑hosted, extensible AI platform that runs fully offline, supports multiple LLM back‑ends such as Ollama and OpenAI‑compatible APIs, offers built‑in RAG, role‑based access, multi‑model chat, markdown/LaTeX, image generation, and provides detailed Docker, pip, and Kubernetes installation guides with ready‑to‑run commands.

AI PlatformDockerLLM
0 likes · 11 min read
Deploy and Explore Open WebUI: A Feature‑Rich Self‑Hosted AI Platform
Architecture and Beyond
Architecture and Beyond
Dec 21, 2025 · Artificial Intelligence

Designing RAG for Industry‑Specific AI Agents: From Data to Safe Execution

This article explains how to build Retrieval‑Augmented Generation (RAG) for industry‑specific AI agents, covering required capabilities, metrics, data sources, indexing, hybrid retrieval, decision‑point integration, layered output, permission controls, rollout strategies, and common pitfalls to ensure reliable and secure automation.

Agent DesignKnowledge RetrievalRAG
0 likes · 17 min read
Designing RAG for Industry‑Specific AI Agents: From Data to Safe Execution
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Dec 20, 2025 · Artificial Intelligence

How to Build an Enterprise‑Grade Intelligent Document QA System with Everything plus RAG

This article walks through the need for fast, accurate answers from massive document collections, compares plain keyword search and pure LLM chat, and presents a hybrid Retrieval‑Augmented Generation solution built with open‑source components, detailing architecture, hybrid retrieval, prompt engineering, deployment, performance tuning, and common pitfalls.

ElasticsearchHybrid RetrievalPrompt engineering
0 likes · 12 min read
How to Build an Enterprise‑Grade Intelligent Document QA System with Everything plus RAG
Architect's Journey
Architect's Journey
Dec 19, 2025 · Artificial Intelligence

Why Context Engineering Is the Hottest AI Skill in 2025

The article explains how context engineering—building a dynamic system that supplies AI with user intent, dialogue history, long‑term memory, external knowledge and tool definitions—outperforms traditional prompt engineering, eliminates hallucinations, and enables AI to complete complex, end‑to‑end tasks.

AIAI agentsContext Engineering
0 likes · 8 min read
Why Context Engineering Is the Hottest AI Skill in 2025
PaperAgent
PaperAgent
Dec 18, 2025 · Artificial Intelligence

Can Ontology‑Aware KG‑RAG Double Table QA Performance on Industrial Standards?

This article presents an ontology‑aware knowledge‑graph RAG framework that transforms complex, hierarchical industrial standard documents into a graph of sections, atomic propositions, and refined triples, achieving nearly double F1 scores on table‑based QA tasks and robust performance on long documents.

Knowledge GraphLLMOntology
0 likes · 6 min read
Can Ontology‑Aware KG‑RAG Double Table QA Performance on Industrial Standards?
Architects' Tech Alliance
Architects' Tech Alliance
Dec 17, 2025 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation: From Theory to Scalable Deployment

This guide explains how Retrieval‑Augmented Generation (RAG) overcomes LLM knowledge staleness, hallucination, and domain‑adaptation challenges by combining external knowledge bases with real‑time retrieval, and provides detailed architecture, optimization techniques, engineering practices, monitoring, cost‑control, and future trends for building production‑grade RAG systems.

AICloudflareLLM
0 likes · 15 min read
Mastering Retrieval‑Augmented Generation: From Theory to Scalable Deployment
Aikesheng Open Source Community
Aikesheng Open Source Community
Dec 16, 2025 · Databases

How to Build Predictive and Generative AI Apps with MySQL AI

MySQL AI adds built‑in LLMs, embeddings, vector storage, AutoML and a graphical console to on‑premise MySQL, enabling developers to create predictive and generative AI applications—including fraud detection, semantic search, RAG and NL2SQL—without external vector databases or GPUs.

AutoMLMySQL AIPredictive AI
0 likes · 15 min read
How to Build Predictive and Generative AI Apps with MySQL AI
JakartaEE China Community
JakartaEE China Community
Dec 16, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

This guide walks through the importance of Retrieval‑Augmented Generation, outlines the core Langchain4j and Ollama 3 components, and provides a complete Java example—including Maven setup, document ingestion, embedding creation, similarity search, prompt construction, and response generation—to demonstrate a functional RAG pipeline.

EmbeddingJavaLLM
0 likes · 9 min read
Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3
PaperAgent
PaperAgent
Dec 14, 2025 · Artificial Intelligence

GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor

The article evaluates GPT‑5.2 and Gemini 3 Pro on real‑world coding tasks, analyzes trends from the 6000 papers presented at NeurIPS 2025, and demonstrates how to extract and refactor the tree‑building component of the open‑source RAPTOR RAG system into an independent module.

AI model evaluationCode RefactoringGPT-5.2
0 likes · 5 min read
GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Dec 13, 2025 · Artificial Intelligence

Explore 100+ Open‑Source LLM Apps and How to Run Them Locally

This guide presents a curated collection of over a hundred open‑source large language model applications—including AI agents, RAG pipelines, and domain‑specific tools—explains their categories, showcases example projects, and provides step‑by‑step instructions to clone and run them on your own machine.

AI agentsGitHubLLM
0 likes · 8 min read
Explore 100+ Open‑Source LLM Apps and How to Run Them Locally
Fun with Large Models
Fun with Large Models
Dec 7, 2025 · Frontend Development

Building a Multimodal RAG Front‑End with Trae Solo: A Vibe‑Coding Guide

This article walks through a three‑step Vibe‑Coding workflow—structured prompt creation, prompt optimization with DeepSeek, and precise bug‑fix guidance—to automatically generate, refine, and extend a React + TypeScript front‑end for a multimodal RAG system using Trae Solo, covering architecture, streaming chat, and PDF citation features.

AI programmingLangChainRAG
0 likes · 22 min read
Building a Multimodal RAG Front‑End with Trae Solo: A Vibe‑Coding Guide
dbaplus Community
dbaplus Community
Dec 7, 2025 · Artificial Intelligence

How AI Agents Can Revolutionize Data Governance: A Step‑by‑Step Blueprint

This article explains how AI agents transform traditional data governance by introducing a four‑layer perception‑decision‑execution‑learning architecture, detailing the required technologies, tool integrations, code examples, deployment steps, team roles, security safeguards, and practical rollout strategies for enterprises seeking automated, intelligent data management.

AI AgentData GovernanceData Quality
0 likes · 10 min read
How AI Agents Can Revolutionize Data Governance: A Step‑by‑Step Blueprint
Old Meng AI Explorer
Old Meng AI Explorer
Dec 5, 2025 · Industry Insights

How Bisheng Turns Enterprise AI Deployment into a Zero‑Code, One‑Stop Process

Bisheng, an open‑source LLM DevOps platform, solves the fragmented, high‑threshold, and compliance‑heavy challenges of enterprise AI by offering a zero‑code visual workflow, all‑in‑one RAG/Agent capabilities, strict security controls, and high‑precision document parsing, enabling rapid, secure AI application rollout.

AI PlatformLLM DevOpsRAG
0 likes · 11 min read
How Bisheng Turns Enterprise AI Deployment into a Zero‑Code, One‑Stop Process
Instant Consumer Technology Team
Instant Consumer Technology Team
Dec 4, 2025 · Artificial Intelligence

How to Build an AI‑Powered Jira Assistant with LangGraph, RAG, and MCP

This article walks through the design and implementation of an AI‑driven Jira assistant that uses LangGraph as the agent brain, Retrieval‑Augmented Generation for knowledge access, and a Model Context Protocol (MCP) server to execute Jira operations, complete with architecture diagrams, code snippets, and practical use cases.

AI AgentJira AutomationLangGraph
0 likes · 12 min read
How to Build an AI‑Powered Jira Assistant with LangGraph, RAG, and MCP
DataFunTalk
DataFunTalk
Dec 4, 2025 · Artificial Intelligence

Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking: Cutting‑Edge AI Search Techniques

This article reviews three advanced AI search solutions—Alibaba Cloud's Agentic RAG architecture for multi‑modal retrieval, Huawei's LLM‑enhanced recommendation system with factorized prompting, and Baidu's generative ranking model GRAB—detailing their technical challenges, design choices, performance gains, and deployment insights.

AI searchBaiduGenerative Ranking
0 likes · 8 min read
Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking: Cutting‑Edge AI Search Techniques
Baidu MEUX
Baidu MEUX
Dec 3, 2025 · User Experience Design

Boost User Research with AI: Automating Short Feedback Classification & Long‑Form Insight Extraction

This article explains how AI large‑language models can automate short user‑feedback classification and extract insights from long interview texts, offering practical prompting tips, fine‑tuning strategies, and Retrieval‑Augmented Generation workflows to make user research faster, more accurate, and less labor‑intensive.

AIFeedback ClassificationPrompt engineering
0 likes · 11 min read
Boost User Research with AI: Automating Short Feedback Classification & Long‑Form Insight Extraction
macrozheng
macrozheng
Dec 3, 2025 · Databases

How Redis’s New Multithreaded Query Engine Boosts Vector Search Performance

Redis has introduced a multithreaded query engine that dramatically reduces latency and increases throughput—up to 16×—for vector similarity searches, enabling vertical scaling and better support for real‑time RAG applications compared to traditional single‑threaded architectures and competing vector databases.

RAGdatabase scalingmultithreading
0 likes · 6 min read
How Redis’s New Multithreaded Query Engine Boosts Vector Search Performance
Yiche Technology
Yiche Technology
Dec 3, 2025 · Artificial Intelligence

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

This article details how an automotive AI assistant migrated from keyword matching to a Milvus‑based vector retrieval system, overcoming semantic gaps, scaling to millions of daily queries, optimizing indexing, introducing multi‑vector and sparse‑vector search, and building a real‑time RAG pipeline with Flink.

AI AssistantMilvusRAG
0 likes · 12 min read
How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search
Data STUDIO
Data STUDIO
Dec 3, 2025 · Artificial Intelligence

Pixeltable: One Table to Power Multimodal AI with Declarative Python

Pixeltable introduces a unified table abstraction that treats images, text, embeddings and model outputs as columns, enabling declarative multimodal AI pipelines, eliminating glue code, supporting built‑in vector indexing, versioned experiments, extensible custom functions, and a concise 30‑line RAG implementation.

Multimodal AIPixeltablePython
0 likes · 15 min read
Pixeltable: One Table to Power Multimodal AI with Declarative Python
DataFunTalk
DataFunTalk
Dec 2, 2025 · Artificial Intelligence

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

This article reviews three cutting‑edge AI search and recommendation techniques—Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's GRAB generative ranking model—detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI agentsAI searchGenerative Ranking
0 likes · 8 min read
How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Dec 2, 2025 · Artificial Intelligence

How LLMs Can Revolutionize Test Case Generation: Methods, Benefits, and Challenges

This article examines the shortcomings of manual test case creation, explains how large language models (LLMs) can dramatically improve efficiency, coverage, consistency, and knowledge sharing in software testing, outlines the key capabilities required, and presents a detailed end‑to‑end solution with practical steps, evaluation metrics, and future outlook.

AI automationKnowledge BaseLLM
0 likes · 20 min read
How LLMs Can Revolutionize Test Case Generation: Methods, Benefits, and Challenges
Instant Consumer Technology Team
Instant Consumer Technology Team
Dec 1, 2025 · Artificial Intelligence

Understanding AIGC, RAG, Function Calling, and the MCP Protocol: A Practical AI Guide

This article explains the fundamentals of AI‑generated content (AIGC), the Retrieval‑Augmented Generation (RAG) technique, Function Calling, autonomous agents, and the Model Context Protocol (MCP), highlighting their evolution, technical workflows, limitations, and real‑world examples for developers.

AIAIGCFunction Calling
0 likes · 19 min read
Understanding AIGC, RAG, Function Calling, and the MCP Protocol: A Practical AI Guide
Fun with Large Models
Fun with Large Models
Nov 30, 2025 · Artificial Intelligence

Multimodal RAG with LangChain: PDF Parsing, Chunking, and Citation Guide

This article walks through building a LangChain‑based multimodal RAG system that parses PDFs (both native and scanned), splits them into semantic chunks, stores embeddings in a vector database, and generates answers with precise source citations, complete with code samples and API integration.

FastAPILangChainMultimodal AI
0 likes · 20 min read
Multimodal RAG with LangChain: PDF Parsing, Chunking, and Citation Guide
Fun with Large Models
Fun with Large Models
Nov 27, 2025 · Artificial Intelligence

Mastering Coze Knowledge Base: A Step‑by‑Step Low‑Code Agent Guide

This article provides a comprehensive, hands‑on guide to Coze's knowledge base, covering its core concepts, key features, practical use‑case scenarios, detailed creation steps, configuration options, prompt design, testing methods, and a comparison with variables, memory, and databases.

CozeKnowledge BasePrompt engineering
0 likes · 15 min read
Mastering Coze Knowledge Base: A Step‑by‑Step Low‑Code Agent Guide
Old Meng AI Explorer
Old Meng AI Explorer
Nov 27, 2025 · Artificial Intelligence

How UltraRAG Turns RAG Deployment into a Zero‑Code, One‑Click Process

UltraRAG, an open‑source RAG framework co‑developed by Tsinghua and NEUIR, offers a zero‑code WebUI that streamlines data construction, model fine‑tuning, and multi‑dimensional evaluation, boosting retrieval accuracy by up to 30% and cutting deployment time by half across legal, medical, and research use cases.

AIOpen-sourceRAG
0 likes · 11 min read
How UltraRAG Turns RAG Deployment into a Zero‑Code, One‑Click Process
Java Tech Enthusiast
Java Tech Enthusiast
Nov 26, 2025 · Artificial Intelligence

How LLM, RAG, and AI Agents Work Together

The article clarifies how large language models (LLM), retrieval‑augmented generation (RAG), and AI agents complement each other, describing the brain‑like reasoning of LLMs, the dynamic knowledge access provided by RAG, and the autonomous action capabilities of AI agents, plus practical usage scenarios.

AI AgentLLMRAG
0 likes · 7 min read
How LLM, RAG, and AI Agents Work Together
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 26, 2025 · Artificial Intelligence

Unlocking AI-Powered Customer Service: From RAG to Deep Evaluation and Optimization

This article explores how the rapid growth of large language models reshapes intelligent customer service, detailing the evolution from rule‑based NLP bots to Retrieval‑Augmented Generation (RAG) and AI‑native agents, and presents a comprehensive framework for evaluating, diagnosing, and continuously improving chatbot performance using LLM‑driven metrics and context engineering.

AIContext EngineeringLLM evaluation
0 likes · 46 min read
Unlocking AI-Powered Customer Service: From RAG to Deep Evaluation and Optimization
PMTalk Product Manager Community
PMTalk Product Manager Community
Nov 25, 2025 · Product Management

Avoid the 3 Common AI Product Management Pitfalls: Prompt Engineering, RAG, and Fine‑Tuning

The article examines why AI product managers repeatedly fall into three traps—over‑relying on prompt engineering, blindly adopting Retrieval‑Augmented Generation, or costly fine‑tuning—by presenting real‑world failures, debunking myths, and offering a five‑layer decision framework with cost, data, resource, and risk analysis to choose the right solution.

AI product managementPrompt engineeringRAG
0 likes · 24 min read
Avoid the 3 Common AI Product Management Pitfalls: Prompt Engineering, RAG, and Fine‑Tuning
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 22, 2025 · Artificial Intelligence

Why Your RAG System Slows Down Over Time and How to Fix It

The article explains why a production Retrieval‑Augmented Generation (RAG) system becomes slower as it runs—due to growing embedding costs, expanding vector databases, heavier re‑ranking, and larger prompts—and provides concrete engineering optimizations such as batching, async concurrency, caching, partitioned retrieval, HNSW tuning, replica scaling, answer caching, and prompt sparsification to keep performance stable.

AI EngineeringPerformance OptimizationRAG
0 likes · 10 min read
Why Your RAG System Slows Down Over Time and How to Fix It
JD Tech Talk
JD Tech Talk
Nov 21, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Retrieval‑Augmented Generation

This article explains why effective chunking is crucial for RAG performance, compares seven major chunking strategies—including fixed‑size, semantic, recursive, document‑structure, agent‑driven, sentence, and paragraph methods—and offers practical guidance on selecting and optimizing chunks for real‑world AI applications.

AIRAGRetrieval Augmented Generation
0 likes · 10 min read
Mastering Chunking Strategies for Retrieval‑Augmented Generation
JD Cloud Developers
JD Cloud Developers
Nov 21, 2025 · Artificial Intelligence

Why Chunking Strategy Makes or Breaks RAG Performance

This article explains how different chunking methods—fixed size, semantic, recursive, document‑based, agent‑driven, sentence‑level, and paragraph‑level—affect Retrieval‑Augmented Generation, offering practical guidelines, metrics, and optimization tips for real‑world deployments.

AIRAGchunking
0 likes · 9 min read
Why Chunking Strategy Makes or Breaks RAG Performance
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 21, 2025 · Artificial Intelligence

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

This article explains why dynamic Retrieval‑Augmented Generation (RAG) requires a layered caching strategy rather than simple result caching, details a four‑level cache architecture—including embedding, search, answer, and pipeline caches—provides practical key‑generation and TTL guidelines, and outlines dirty‑data defenses to keep caches consistent and performant.

AI EngineeringLLMRAG
0 likes · 10 min read
How to Build a Multi‑Layer Cache for Dynamic RAG Systems
Baidu Maps Tech Team
Baidu Maps Tech Team
Nov 19, 2025 · Artificial Intelligence

Boosting Socio‑Economic Q&A: The ARAG Framework Merges Structured Data Analysis with RAG

ARAG introduces a novel Retrieval‑Augmented Generation framework that tightly integrates LLM‑driven structured data analysis with unstructured information retrieval, addressing the “structured + unstructured” reasoning gap in socio‑economic queries, and demonstrates superior accuracy, robustness, and hallucination resistance through extensive evaluations.

LLMRAGSocio-economic AI
0 likes · 12 min read
Boosting Socio‑Economic Q&A: The ARAG Framework Merges Structured Data Analysis with RAG
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 19, 2025 · Artificial Intelligence

How to Build a Reliable Dynamic Incremental RAG Pipeline for Real‑Time Data

This article explains why dynamic incremental RAG is harder than static RAG, identifies the three main points where recall accuracy breaks, and presents a three‑stage engineering pipeline—including a quality‑control layer, two‑stage retrieval, and reference‑injection generation—to keep real‑time data retrieval both accurate and robust.

AIDynamic DataRAG
0 likes · 13 min read
How to Build a Reliable Dynamic Incremental RAG Pipeline for Real‑Time Data
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 19, 2025 · Artificial Intelligence

Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation

This article details a practical case study of designing, implementing, and evaluating an AI-driven proofreading agent for a media client, covering background challenges, a three‑layer architecture, prompt engineering techniques, RAG knowledge‑base construction, model selection, fine‑tuning, automated metrics, and lessons learned.

AIModel EvaluationProofreading
0 likes · 26 min read
Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation
JakartaEE China Community
JakartaEE China Community
Nov 18, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

This article explains why Retrieval‑Augmented Generation improves LLM accuracy, outlines the key Langchain4j and Ollama3 components, and provides a step‑by‑step Java example—including Maven setup, document ingestion, embedding, similarity search, prompt creation, and response generation—to demonstrate a functional RAG pipeline.

EmbeddingJavaLLM
0 likes · 8 min read
How to Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3
Alipay Experience Technology
Alipay Experience Technology
Nov 18, 2025 · Mobile Development

Boosting KMP Native Cross‑Platform Development with AI Agents: Real‑World Practices

This article details how Alipay's engineering team built an AI‑Agent‑powered coding assistant for Kotlin Multiplatform (KMP) native cross‑platform development, covering architecture, UI generation from designs and images, RAG‑based knowledge retrieval, crash analysis, and future directions for AI‑driven software engineering.

AI AgentCompose MultiplatformKMP
0 likes · 20 min read
Boosting KMP Native Cross‑Platform Development with AI Agents: Real‑World Practices
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 16, 2025 · Artificial Intelligence

How to Slash RAG First‑Token Latency: Practical Engineering Strategies

This guide breaks down the three layers of a RAG pipeline—embedding, vector retrieval, and system architecture—and provides concrete engineering tactics such as batch embedding, async concurrency, caching, ANN indexing, partitioning, connection pooling, and async pipelines to dramatically reduce Time‑to‑First‑Token latency.

Async PipelineEmbeddingRAG
0 likes · 10 min read
How to Slash RAG First‑Token Latency: Practical Engineering Strategies
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 14, 2025 · Artificial Intelligence

How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework

This article explains why function‑call accuracy is critical for LLM agents, identifies four common failure causes, and presents a systematic, five‑step engineering framework—including dynamic routing, chain‑of‑thought planning, result validation, memory injection, and log‑driven optimization—backed by concrete examples and quantitative improvements.

Function CallingInterview PreparationLLM
0 likes · 10 min read
How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework
DataFunTalk
DataFunTalk
Nov 11, 2025 · Artificial Intelligence

How Alibaba Cloud’s AI Search Redefines Vector Retrieval and RAG

This article outlines Alibaba Cloud AI Search’s evolution, detailing its dual product lines—enhanced Elasticsearch and self‑developed OpenSearch—key Agentic RAG technologies, serverless architecture, vector and LLM‑driven search capabilities, and future directions in AI‑powered search.

AI searchAlibaba CloudElasticsearch
0 likes · 4 min read
How Alibaba Cloud’s AI Search Redefines Vector Retrieval and RAG
DaTaobao Tech
DaTaobao Tech
Nov 10, 2025 · Artificial Intelligence

How Tmall’s AI Transforms Test Case Generation for Faster, Smarter QA

This article details Tmall's technology team's deep AI‑driven testing practice, outlining industry challenges, the need for intelligent test case generation, and a comprehensive strategy that combines prompt engineering, RAG‑based knowledge bases, and platform integration to boost coverage, reduce manual effort, and accelerate release cycles.

AI testingKnowledge BasePrompt engineering
0 likes · 10 min read
How Tmall’s AI Transforms Test Case Generation for Faster, Smarter QA
Data Party THU
Data Party THU
Nov 9, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Effective RAG: Fixed, Recursive, Semantic, Structured, and Delayed

This article walks through the core RAG pipeline, explains why chunking is the linchpin of retrieval quality, and provides detailed definitions, trade‑offs, and implementation examples for five chunking techniques—fixed, recursive, semantic, structure‑aware, and delayed—so you can choose the right approach for any document‑heavy AI application.

AILLMRAG
0 likes · 10 min read
Mastering Chunking Strategies for Effective RAG: Fixed, Recursive, Semantic, Structured, and Delayed
DataFunSummit
DataFunSummit
Nov 8, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World AI Solutions with RAG and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while deep‑diving into the RAG, GraphRAG, and Agent technologies that enable smarter, more reliable AI applications.

AIAgentLLM
0 likes · 4 min read
How Tencent’s LLM Powers Real‑World AI Solutions with RAG and Agents
DataFunSummit
DataFunSummit
Nov 7, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs

This article examines Tencent’s large language model deployments across content generation, intelligent customer service, and game role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—highlighting how they enhance performance, explainability, and multi‑step reasoning in real‑world business scenarios.

AIAgentLLM
0 likes · 4 min read
How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 6, 2025 · Artificial Intelligence

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

This article explains why building a high‑quality RAG knowledge base is critical, outlines offline parsing techniques for multi‑format documents, presents semantic chunking strategies that preserve structure and context, and shows how to answer interview questions with a robust, production‑ready pipeline.

AI InterviewKnowledge BaseRAG
0 likes · 8 min read
How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 5, 2025 · Artificial Intelligence

Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo

Building a Retrieval‑Augmented Generation (RAG) system may be straightforward in code, but making it reliable, accurate, and scalable in production involves challenges across data preparation, vector retrieval, query rewriting, generation control, and system integration, turning a demo into a truly useful AI service.

AILLMPrompt engineering
0 likes · 8 min read
Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo
DataFunSummit
DataFunSummit
Nov 4, 2025 · Artificial Intelligence

How Tencent Leverages RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article explores Tencent's large language model deployments across various business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—that enable these applications.

AIAgentRAG
0 likes · 4 min read
How Tencent Leverages RAG, GraphRAG, and Agents to Power Large Language Model Applications
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 4, 2025 · Artificial Intelligence

Why Financial RAG Fails and How to Solve Its Core Challenges

This article explains why Retrieval‑Augmented Generation (RAG) projects in the financial sector often underperform, highlighting data‑structure complexities, document‑parsing hurdles, chunking strategies, compliance constraints, evaluation metrics, and engineering requirements, and offers practical solutions and code examples.

EngineeringFinancial AIRAG
0 likes · 10 min read
Why Financial RAG Fails and How to Solve Its Core Challenges
dbaplus Community
dbaplus Community
Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG
0 likes · 17 min read
How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms
Instant Consumer Technology Team
Instant Consumer Technology Team
Nov 3, 2025 · Artificial Intelligence

Large Language Models Power Big Data SRE Knowledge & Root‑Cause Automation

Facing the growing complexity of big‑data platforms, the SRE team adopted large‑language‑model agents to automate knowledge management and root‑cause analysis, employing Retrieval‑Augmented Generation, a vector store, and the Model Context Protocol to enable intelligent, scalable, and efficient incident diagnosis and resolution.

AIMCPRAG
0 likes · 12 min read
Large Language Models Power Big Data SRE Knowledge & Root‑Cause Automation
DataFunSummit
DataFunSummit
Nov 3, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World AI: From RAG to Agents

This article examines Tencent's large language model applications across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the three key technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agents—that enable these capabilities.

AI applicationsAgentLLM
0 likes · 4 min read
How Tencent’s LLM Powers Real‑World AI: From RAG to Agents
Data Party THU
Data Party THU
Nov 1, 2025 · Artificial Intelligence

How to Blend Process‑Oriented and Agent‑Centric AI into a Hybrid Intelligent Pipeline

This article analyzes two contrasting AI agent design paradigms—process‑driven workflow orchestration and autonomous agent intelligence—examines their strengths and limitations, and proposes a hybrid architecture that fuses deterministic pipelines with dynamic planning, tool use, and memory mechanisms to achieve both reliability and adaptability.

AIAgentHybrid
0 likes · 15 min read
How to Blend Process‑Oriented and Agent‑Centric AI into a Hybrid Intelligent Pipeline
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 1, 2025 · Artificial Intelligence

Turn a Basic RAG Demo into a High‑Impact Interview Project

This guide shows how to evolve a simple Retrieval‑Augmented Generation prototype into a production‑grade system by strengthening data ingestion, optimizing retrieval with hybrid and reranking techniques, adding query rewriting, long‑context handling, reinforcement learning, and multimodal support, so candidates can demonstrate real engineering depth in interviews.

AILLMRAG
0 likes · 7 min read
Turn a Basic RAG Demo into a High‑Impact Interview Project
BirdNest Tech Talk
BirdNest Tech Talk
Oct 30, 2025 · Artificial Intelligence

Master LangChain Chains with LCEL: From Simple Jokes to RAG and Agent Pipelines

This guide explains how LangChain’s Expression Language (LCEL) lets you declaratively connect prompts, models, and output parsers into chains, walks through environment setup, dependency installation, and detailed code examples ranging from a basic joke generator to retrieval‑augmented generation and memory‑enabled agents.

AgentLCELLangChain
0 likes · 5 min read
Master LangChain Chains with LCEL: From Simple Jokes to RAG and Agent Pipelines
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 30, 2025 · Artificial Intelligence

Why AI Agents Aren’t As Simple As They Appear: Engineering Challenges and Solutions

Building AI agents may seem straightforward with frameworks like LangChain, but hidden complexities in orchestration, memory management, reproducibility, and scalability turn simple demos into fragile systems, requiring systematic engineering, observability, and robust design to achieve reliable, production‑grade intelligent agents.

AI agentsAgent DesignLangChain
0 likes · 21 min read
Why AI Agents Aren’t As Simple As They Appear: Engineering Challenges and Solutions
DeWu Technology
DeWu Technology
Oct 29, 2025 · Artificial Intelligence

Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code

This article explains how proper document chunking—choosing the right chunk size, overlap, and structure‑aware boundaries—directly impacts the relevance, factuality, and efficiency of Retrieval‑Augmented Generation pipelines, and provides multiple Python implementations ranging from simple fixed‑length splits to semantic and hybrid approaches.

EmbeddingLLMRAG
0 likes · 29 min read
Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code
Bilibili Tech
Bilibili Tech
Oct 27, 2025 · Artificial Intelligence

How Bilibili’s LLM-Powered System Cuts Game Localization Costs by 80%

Bilibili’s game algorithm team built a four‑layer, LLM‑based translation platform that automates terminology extraction, retrieval‑augmented generation, and quality assessment, dramatically reducing localization cycles by over 85% and costs by up to 80% while supporting ten languages and ensuring consistent, culturally‑accurate game text.

LLMRAGgame localization
0 likes · 20 min read
How Bilibili’s LLM-Powered System Cuts Game Localization Costs by 80%
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Oct 27, 2025 · Artificial Intelligence

Designing Effective Generation Modules for RAG: Prompt Engineering, Multi‑Document Fusion, and Hallucination Control

This article explains how to design and optimize the generation module of Retrieval‑Augmented Generation systems by building robust prompts, merging multi‑source information, controlling answer formats, and applying post‑generation verification to reduce hallucinations and improve enterprise‑grade performance.

AIGeneration ModuleHallucination Control
0 likes · 9 min read
Designing Effective Generation Modules for RAG: Prompt Engineering, Multi‑Document Fusion, and Hallucination Control
BirdNest Tech Talk
BirdNest Tech Talk
Oct 27, 2025 · Artificial Intelligence

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

This article explains how LangChain's Indexing API adds state management and synchronization to the classic load‑split‑embed‑store RAG pipeline, detailing the RecordManager component, the index function workflow, key parameters, implementation considerations, and best‑practice code examples for production‑grade vector stores.

FAISSIndexing APILangChain
0 likes · 12 min read
How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Oct 27, 2025 · Artificial Intelligence

Master AI Agents and MCP: A Complete 4‑Month Learning Roadmap

This article presents a structured, step‑by‑step learning path that guides beginners from Python fundamentals through AI API mastery, Retrieval‑Augmented Generation, deep MCP protocol knowledge, and advanced multi‑agent development, complete with practical code examples and performance‑monitoring techniques.

AI agentsLangChainMCP protocol
0 likes · 14 min read
Master AI Agents and MCP: A Complete 4‑Month Learning Roadmap
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Oct 24, 2025 · Artificial Intelligence

Beyond RAG: Three Emerging Knowledge‑Engineering Strategies (ICL, Online Learning, SLM)

The article outlines three post‑RAG knowledge‑engineering approaches—In‑Context Learning with dynamic few‑shot selection, Online Learning encompassing Meta‑Learning and Lifelong Learning to quickly adapt to new tasks, and the Small Language Model path that combines fine‑tuned task‑specific experts with LLM‑SLM collaboration for efficient, privacy‑preserving inference.

In-Context LearningKnowledge EngineeringLLM
0 likes · 4 min read
Beyond RAG: Three Emerging Knowledge‑Engineering Strategies (ICL, Online Learning, SLM)
DataFunTalk
DataFunTalk
Oct 23, 2025 · Artificial Intelligence

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, Supervised Fine‑Tuning, and autonomous agents boost model intelligence, reduce hallucinations, and enable sophisticated content creation, understanding, and interactive applications.

AI agentsRAGTencent
0 likes · 4 min read
How Tencent Leverages RAG and Agents to Supercharge Large Language Models
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Oct 23, 2025 · Artificial Intelligence

Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained

This article walks developers through three essential upgrades for Retrieval‑Augmented Generation systems—hybrid search combining vector and keyword retrieval, query rewriting to clarify conversational inputs, and re‑ranking with a cross‑encoder—providing step‑by‑step code examples using LangChain to dramatically improve answer quality.

AIHybrid SearchLangChain
0 likes · 9 min read
Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained
Xuanwu Backend Tech Stack
Xuanwu Backend Tech Stack
Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

RAGRerankRetrieval Augmented Generation
0 likes · 10 min read
How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers
JD Tech Talk
JD Tech Talk
Oct 21, 2025 · Backend Development

How Backend Engineers Are Breaking Through AI with RAG Architectures

This article details a backend developer's two‑year AI journey, the challenges of rapid model advances, and how applying microservice principles to Retrieval‑Augmented Generation (RAG) creates a scalable, multi‑agent platform for insurance knowledge, memory, and intelligent agents.

Backend AIKnowledge BaseRAG
0 likes · 11 min read
How Backend Engineers Are Breaking Through AI with RAG Architectures
BirdNest Tech Talk
BirdNest Tech Talk
Oct 21, 2025 · Artificial Intelligence

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

This article explains what vector stores are, outlines their core workflow of adding, querying, and searching embeddings, compares popular back‑ends like FAISS, Chroma, and Pinecone, and walks through a complete Chinese‑language example using LangChain’s FAISS integration with detailed code and result analysis.

AIFAISSLangChain
0 likes · 10 min read
How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain
BirdNest Tech Talk
BirdNest Tech Talk
Oct 16, 2025 · Artificial Intelligence

Mastering Text Splitting in LangChain: From Theory to Code

This guide explains why large documents must be broken into semantic chunks for LLMs, introduces core parameters like chunk_size and chunk_overlap, compares LangChain's various splitters, and walks through a complete Python example that loads a long text, configures a RecursiveCharacterTextSplitter, and inspects the resulting chunks.

EmbeddingLangChainRAG
0 likes · 9 min read
Mastering Text Splitting in LangChain: From Theory to Code