Tagged articles
894 articles
Page 6 of 9
Ops Development Stories
Ops Development Stories
Jul 14, 2025 · Artificial Intelligence

Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment

This comprehensive guide explores AIOps techniques such as prompt engineering, chat completions, memory management, function calling, fine‑tuning, retrieval‑augmented generation (RAG), graph‑based RAG, and practical steps for deploying open‑source large language models locally, providing code examples and best‑practice recommendations for modern DevOps environments.

Function CallingGraph RAGRAG
0 likes · 47 min read
Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment
Tencent Technical Engineering
Tencent Technical Engineering
Jul 14, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, and MCP: Core Concepts and How They Interact

This article provides a concise overview of the latest AI concepts—including AIGC, Retrieval‑Augmented Generation, Function‑Calling models, intelligent agents, and the Model Context Protocol—explaining their principles, differences, and how they can be combined to build more powerful AI applications for developers outside the AI field.

AIGCFunction CallingLLM
0 likes · 15 min read
Demystifying AIGC, Agents, and MCP: Core Concepts and How They Interact
DaTaobao Tech
DaTaobao Tech
Jul 14, 2025 · Artificial Intelligence

Mastering AI Application Modes: Embedding, Copilot, and Agents Explained

This article explores practical AI engineering strategies, detailing the three AI application modes—Embedding, Copilot, and Agents—along with prompt engineering, model selection, function calling, RAG, workflow design, and multi‑agent architectures to boost business efficiency and user experience.

AIModel EvaluationPrompt engineering
0 likes · 25 min read
Mastering AI Application Modes: Embedding, Copilot, and Agents Explained
Data Thinking Notes
Data Thinking Notes
Jul 13, 2025 · Artificial Intelligence

How to Build an Enterprise Knowledge Base with Dify: Full Setup Guide

This article walks developers through the entire process of deploying Dify locally, configuring model providers, creating and segmenting a knowledge base with RAG, choosing indexing methods, and integrating the knowledge base into a chatbot application, complete with code snippets and visual guides.

AI deploymentDifyKnowledge Base
0 likes · 11 min read
How to Build an Enterprise Knowledge Base with Dify: Full Setup Guide
Architecture and Beyond
Architecture and Beyond
Jul 12, 2025 · Artificial Intelligence

What Exactly Is an AI Agent? History, Architecture, and Future Challenges

This article traces the evolution of AI agents from early expert systems to modern large‑language‑model‑driven assistants, explains their core perception, reasoning, memory, and action modules, compares thinking and execution models, and discusses current limitations such as hallucinations, reliability, cost, and security.

AI AgentMemory ArchitecturePrompt engineering
0 likes · 20 min read
What Exactly Is an AI Agent? History, Architecture, and Future Challenges
Architect
Architect
Jul 11, 2025 · Artificial Intelligence

How OpenAI’s Zero‑Vector Agentic RAG Redefines AI Knowledge Retrieval

OpenAI’s new non‑vectorized Agentic RAG approach replaces traditional vector search with a hierarchical, multi‑round content selection process, leveraging large‑context models like GPT‑4.1‑mini for efficient document loading, dynamic navigation, and accurate answer generation, while outlining model selection strategies, cost trade‑offs, and production considerations.

AI ArchitectureModel SelectionRAG
0 likes · 15 min read
How OpenAI’s Zero‑Vector Agentic RAG Redefines AI Knowledge Retrieval
Sanyou's Java Diary
Sanyou's Java Diary
Jul 10, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, RAG, and MCP: Core AI Concepts Explained

This article provides a concise overview of the latest AI breakthroughs—including AIGC, multimodal technology, Retrieval‑Augmented Generation (RAG), intelligent agents with function‑calling models, and the Model Context Protocol (MCP)—explaining their principles, relationships, and practical implications for developers outside the AI field.

AIAIGCFunction Calling
0 likes · 16 min read
Demystifying AIGC, Agents, RAG, and MCP: Core AI Concepts Explained
DataFunSummit
DataFunSummit
Jul 10, 2025 · Artificial Intelligence

How Large Language Model AI Agents Transform Intelligent Operations and On‑Call Support

This article details the design and implementation of a large‑model‑driven intelligent operations dialogue system, covering intent recognition, routing, multi‑agent planning, RAG, workflow, ReAct, reflection, tree‑search techniques, evaluation challenges, and future multi‑agent collaboration for on‑call support.

AI agentsIntelligent OperationsRAG
0 likes · 23 min read
How Large Language Model AI Agents Transform Intelligent Operations and On‑Call Support
Tencent Cloud Developer
Tencent Cloud Developer
Jul 10, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, and MCP: Essential AI Concepts for Developers

This article provides a concise, developer‑focused overview of emerging AI concepts—including AIGC, multimodal models, Retrieval‑Augmented Generation, intelligent agents, Function‑Calling, and the Model Context Protocol (MCP)—explaining their core principles, differences, and how they interrelate to enable advanced AI applications.

AIAIGCFunction Calling
0 likes · 16 min read
Demystifying AIGC, Agents, and MCP: Essential AI Concepts for Developers
JD Tech Talk
JD Tech Talk
Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Highway for New Developers

New developer Li Ming’s frustrating onboarding experience highlights hidden business rules, undocumented code, and poor knowledge transfer, prompting him to build an AI‑driven knowledge base that links code changes, requirements, and operational docs, ultimately streamlining troubleshooting, accelerating feature development, and improving knowledge retention across teams.

AIRAGcode retrieval
0 likes · 18 min read
How AI Can Turn a Code Maze into a Knowledge Highway for New Developers
JD Cloud Developers
JD Cloud Developers
Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Hub for New Developers

This article follows a new developer named Li Ming as he confronts undocumented code, hidden business rules, and fragmented knowledge, then demonstrates how leveraging large‑language models to index, associate, and retrieve code, requirements, and operational data can create an intelligent knowledge base that streamlines onboarding, reduces errors, and enhances collaboration across development, testing, and product teams.

AIRAGsoftware development
0 likes · 19 min read
How AI Can Turn a Code Maze into a Knowledge Hub for New Developers
AI Algorithm Path
AI Algorithm Path
Jul 3, 2025 · Artificial Intelligence

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

This article examines how Retrieval‑Augmented Generation (RAG) has progressed from simple keyword‑based retrieval to advanced semantic methods, modular architectures, graph‑enhanced reasoning, and autonomous agentic systems, highlighting each approach's workflow, benefits, limitations, and the shift toward dynamic AI decision‑making.

AIAgentic RAGGraph RAG
0 likes · 7 min read
Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation
Instant Consumer Technology Team
Instant Consumer Technology Team
Jul 3, 2025 · Artificial Intelligence

Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises

Enterprises rushing to purchase DeepSeek AI appliances and smart‑agent platforms often face hidden technical, data, and organizational challenges that turn promised "plug‑and‑play" solutions into costly missteps, highlighting the need for realistic strategy, robust data governance, and continuous capability building.

AI capability buildingAI deploymentData Governance
0 likes · 28 min read
Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises
AI Large Model Application Practice
AI Large Model Application Practice
Jul 2, 2025 · Artificial Intelligence

Build a PPT‑Powered RAG Engine with Visual Models and MCP Server

This article explains how to construct a Retrieval‑Augmented Generation (RAG) pipeline for multi‑page PPT documents by converting slides to images, extracting content with a vision model, indexing with LlamaIndex and Chroma, and exposing the functionality through an MCP Server with tools for adding, querying, and managing PPTs.

LlamaIndexMCP ServerPPT
0 likes · 13 min read
Build a PPT‑Powered RAG Engine with Visual Models and MCP Server
Ops Development Stories
Ops Development Stories
Jul 1, 2025 · Artificial Intelligence

From Lean to AIOps: How AI is Transforming Modern Operations

This comprehensive guide walks through the evolution from Lean and Agile practices to DevOps and finally AIOps, explaining core concepts, key algorithms, the role of large language models, RAG‑based root‑cause analysis, and practical implementation steps for intelligent operations.

Large Language ModelsLeanRAG
0 likes · 19 min read
From Lean to AIOps: How AI is Transforming Modern Operations
DataFunTalk
DataFunTalk
Jun 29, 2025 · Artificial Intelligence

Large Models Boost Douyin User Experience: Expert Insights

In an interview at the DA Digital Intelligence Conference, ByteDance AI specialist Cai Conghuai explains how large language models, combined with techniques like SFT, DPO, and RAG, are reshaping Douyin's user‑experience signal detection, root‑cause analysis, and evaluation, while outlining future AI‑agent breakthroughs.

AIDPOLarge Language Models
0 likes · 12 min read
Large Models Boost Douyin User Experience: Expert Insights
Architect
Architect
Jun 28, 2025 · Artificial Intelligence

How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough

This article examines the MultiAgentPPT project, detailing its multi‑agent workflow, the four core agents that generate outlines, split topics, conduct research, and summarize results, and explains how the system retrieves data via a WeChat crawler and constructs prompts for LLM‑driven PPT creation.

AI agentsMultiAgentPPTPPT generation
0 likes · 6 min read
How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jun 27, 2025 · Artificial Intelligence

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

This guide walks you through using the PAI‑LangStudio platform together with the Qwen3 large language model and Elasticsearch to create a full‑stack AI Search RAG solution, covering prerequisites, step‑by‑step configuration of model services, database connections, runtimes, knowledge bases, workflow creation, testing, and deployment for production use.

AI searchElasticsearchPAI‑LangStudio
0 likes · 11 min read
Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jun 27, 2025 · Operations

How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers

This article details the design, technology choices, functional modules, core implementation, performance optimizations, and future roadmap of Ops‑Nexus, an AI‑driven intelligent operations platform that streamlines alarm analysis, log processing, and host health checks for large‑scale monitoring environments.

AI OpsIntelligent OperationsLLM
0 likes · 12 min read
How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers
Su San Talks Tech
Su San Talks Tech
Jun 26, 2025 · Artificial Intelligence

Master Spring AI Alibaba 1.0: Upgrade Guide, New Features & Real‑World Code

This article walks you through what Spring AI Alibaba 1.0 offers, highlights its major updates such as the Graph multi‑agent framework and ecosystem integrations, and provides a step‑by‑step upgrade path with Maven dependency changes, code fixes, and configuration adjustments for Java developers.

AI FrameworkMCPMulti-Agent
0 likes · 20 min read
Master Spring AI Alibaba 1.0: Upgrade Guide, New Features & Real‑World Code
DeWu Technology
DeWu Technology
Jun 25, 2025 · Artificial Intelligence

Engineering Large Language Models with Spring AI: From Basics to RAG and Function Calls

This article walks through the fundamentals of large language models, their stateless and structured-output nature, explains how Spring‑AI provides a Java‑friendly API for model integration, covers RAG architecture, the MCP protocol, and demonstrates end‑to‑end code examples for building intelligent agents.

AI integrationFunction CallingLarge Language Models
0 likes · 15 min read
Engineering Large Language Models with Spring AI: From Basics to RAG and Function Calls
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 24, 2025 · Artificial Intelligence

How Transformers and Mixture-of-Experts Power Large Language Models

This article explores the role of Transformers and Mixture‑of‑Experts in large models, outlines five fine‑tuning methods, compares traditional and agentic RAG, presents classic agent design patterns, text‑chunking strategies, levels of intelligent agent systems, and explains KV‑caching techniques.

Fine-tuningLarge Language ModelsMixture of Experts
0 likes · 2 min read
How Transformers and Mixture-of-Experts Power Large Language Models
Fun with Large Models
Fun with Large Models
Jun 23, 2025 · Artificial Intelligence

Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction

This article walks through the complete GraphRAG knowledge‑graph building pipeline—text splitting, entity extraction, relation mining, community clustering, and report generation—using a concrete example from the book “The Age of Big Data,” and explains why each step improves retrieval and answer quality.

GraphRAGRAGcommunity clustering
0 likes · 20 min read
Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction
Tech Freedom Circle
Tech Freedom Circle
Jun 21, 2025 · Artificial Intelligence

How MCP + LLM + Agent Architecture Becomes the AI Agent’s Neural Hub and New Infrastructure

The article explains the Model Context Protocol (MCP) as a zero‑code bridge that lets large language models seamlessly access databases, external APIs, and execute code, detailing its benefits for developers and everyday users, its core components, step‑by‑step workflow, real‑world examples, and how it outperforms traditional APIs in modern AI agent systems.

AI AgentLLMMCP
0 likes · 37 min read
How MCP + LLM + Agent Architecture Becomes the AI Agent’s Neural Hub and New Infrastructure
Data Thinking Notes
Data Thinking Notes
Jun 19, 2025 · Artificial Intelligence

Andrew Ng on Building Agentic AI Systems: Tools, MCP, and Practical Insights

In a candid conversation, Andrew Ng and Harrison Chase explore the evolving landscape of AI agents, discussing modular toolchains, the emerging MCP standard, challenges of agent‑to‑agent communication, voice interaction latency, and the importance of rapid, technically skilled execution for successful AI product development.

AI agentsLangChainMCP
0 likes · 19 min read
Andrew Ng on Building Agentic AI Systems: Tools, MCP, and Practical Insights
DataFunSummit
DataFunSummit
Jun 19, 2025 · Artificial Intelligence

How Large Models Are Revolutionizing Douyin’s User Experience – Expert Insights

In a detailed interview, ByteDance AI specialist Cai Conghuai explains how large‑model techniques such as SFT, DPO and RAG address Douyin’s multimodal user‑experience challenges, improve signal detection, root‑cause analysis, and outline future AI‑agent breakthroughs for content platforms.

AI AlgorithmsMultimodal LearningRAG
0 likes · 11 min read
How Large Models Are Revolutionizing Douyin’s User Experience – Expert Insights
Fun with Large Models
Fun with Large Models
Jun 19, 2025 · Artificial Intelligence

How GraphRAG Boosts Answer Accuracy with Knowledge Graphs (Part 1)

This article explains GraphRAG’s architecture, compares it with traditional RAG, and presents experimental results showing that GraphRAG’s knowledge‑graph‑driven retrieval markedly improves answer accuracy, especially on low‑match, multi‑paragraph queries.

GraphRAGLarge Language ModelsPerformance Evaluation
0 likes · 11 min read
How GraphRAG Boosts Answer Accuracy with Knowledge Graphs (Part 1)
Tencent Technical Engineering
Tencent Technical Engineering
Jun 16, 2025 · Artificial Intelligence

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

This comprehensive guide walks you through the fundamentals of Retrieval‑Augmented Generation (RAG) and AI agents, explains their inner workings, shares optimization tricks, provides ready‑to‑run code snippets, and demonstrates how to evaluate performance with metrics such as recall, faithfulness, and answer relevance.

AI agentsLLMPrompt engineering
0 likes · 36 min read
Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies
ITPUB
ITPUB
Jun 15, 2025 · Artificial Intelligence

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

This article presents a step‑by‑step guide for constructing a scalable enterprise Retrieval‑Augmented Generation (RAG) solution using the Model Context Protocol (MCP), covering architecture comparison, system design, Milvus‑backed knowledge store, Python client implementation, deployment scripts, code examples, and best‑practice recommendations.

KnowledgeBaseLLMMCP
0 likes · 22 min read
How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)
TAL Education Technology
TAL Education Technology
Jun 13, 2025 · Operations

How Large Language Models Are Revolutionizing Fault Localization

This article explores how the rapid rise of large language models and techniques like Retrieval‑Augmented Generation, Chain‑of‑Thought prompting, and multi‑agent architectures can dramatically improve the speed, accuracy, and automation of fault localization in modern operations environments.

Agent ArchitectureCoTFault Localization
0 likes · 14 min read
How Large Language Models Are Revolutionizing Fault Localization
Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

EmbeddingLLMMilvus
0 likes · 19 min read
How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models
DataFunSummit
DataFunSummit
Jun 12, 2025 · Artificial Intelligence

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

This article details Alibaba Cloud AI Search’s development journey, covering its dual product lines, the evolution of Agentic RAG technology, multi‑agent architectures, vector retrieval breakthroughs, GPU‑accelerated indexing, NL2SQL capabilities, deployment models, and future directions for AI‑driven search solutions.

AI searchGPU AccelerationOpenSearch
0 likes · 33 min read
How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations
Zuoyebang Tech Team
Zuoyebang Tech Team
Jun 12, 2025 · Information Security

How AI‑Powered RAG and Agents Are Revolutionizing Enterprise Security Operations

This article explains how the rise of AI large‑model technology and Retrieval‑Augmented Generation (RAG) combined with autonomous AI agents enable a three‑layer network‑boundary defense, address deep operational challenges such as alert overload and response latency, and dramatically improve incident‑response efficiency in large‑scale enterprises.

AI agentsAI securityLarge Language Models
0 likes · 16 min read
How AI‑Powered RAG and Agents Are Revolutionizing Enterprise Security Operations
Data Thinking Notes
Data Thinking Notes
Jun 11, 2025 · Artificial Intelligence

How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×

This case study details how a government‑focused AI system using retrieval‑augmented generation (RAG) and advanced preprocessing algorithms increased data labeling speed by up to five times, raised accuracy above 95%, and produced high‑quality enterprise, spatial, and economic datasets.

AIGovernmentRAG
0 likes · 5 min read
How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×
Sohu Tech Products
Sohu Tech Products
Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDeepSeekRAG
0 likes · 15 min read
How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 10, 2025 · Artificial Intelligence

How AI Application Architectures Evolve: From Simple LLM Calls to Guardrails, Routing, and Agents

This article traces the evolution of AI application architectures—from the earliest minimal user‑LLM interaction to advanced designs featuring context enhancement, input/output guardrails, intent routing, model gateways, caching strategies, agent capabilities, monitoring, and inference performance optimizations—providing practical insights and references for developers.

AI ArchitectureInference OptimizationLLM
0 likes · 21 min read
How AI Application Architectures Evolve: From Simple LLM Calls to Guardrails, Routing, and Agents
Data Thinking Notes
Data Thinking Notes
Jun 8, 2025 · Artificial Intelligence

Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries

This article presents a panoramic view of AI large‑model technologies, showcasing a series of architecture diagrams that illustrate general model frameworks, RAG knowledge‑base structures, agricultural and retail applications, IoT integration, compliance and risk‑management setups, agent platforms, and CRM‑enhanced solutions.

AIRAGarchitecture
0 likes · 3 min read
Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries
Qborfy AI
Qborfy AI
Jun 7, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) Chatbot with LangChain and Streamlit

This guide walks through the complete process of creating a RAG‑powered question‑answering bot using LangChain, Streamlit, and vector‑store embeddings, covering theory, architecture, data loading, chunking, vector indexing, retrieval, LLM integration, and full code implementation with practical examples.

ChatbotLangChainPython
0 likes · 13 min read
Build a Retrieval‑Augmented Generation (RAG) Chatbot with LangChain and Streamlit
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Jun 6, 2025 · Artificial Intelligence

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

The article enumerates common pitfalls of Retrieval‑Augmented Generation—such as missing content, low‑rank document misses, context limits, format errors, incomplete answers, scalability bottlenecks, complex PDF extraction, data‑quality issues, domain adaptation gaps, hallucinations, and feedback‑loop deficiencies—and offers concrete mitigation strategies ranging from data cleaning and prompt design to hybrid search, hierarchical retrieval, document compression, and automated evaluation.

Data QualityHybrid SearchLLM
0 likes · 9 min read
Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)
IT Services Circle
IT Services Circle
Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AIKnowledge RetrievalRAG
0 likes · 12 min read
Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices
Didi Tech
Didi Tech
Jun 5, 2025 · Artificial Intelligence

Unlocking Modern AI Application Architecture: From RAG to Agents and MCP

This article surveys the evolution of AI applications, explains large language model fundamentals, outlines architectural challenges, and introduces three core patterns—Retrieval‑Augmented Generation (RAG), autonomous Agents, and Model Context Protocol (MCP)—while providing practical LangChain code snippets and integration guidance.

AILLMLangChain
0 likes · 28 min read
Unlocking Modern AI Application Architecture: From RAG to Agents and MCP
Fighter's World
Fighter's World
Jun 2, 2025 · Artificial Intelligence

Why Is Context King for Large Language Models?

This article provides a comprehensive technical analysis of LLM context, covering its definition, types, tokenization, window‑size evolution, diminishing returns, management techniques such as RAG, CoT, memory‑as‑a‑service, and future challenges like multimodal fusion, privacy, and autonomous agent memory.

Agent MemoryLLMMemory-as-a-Service
0 likes · 48 min read
Why Is Context King for Large Language Models?
dbaplus Community
dbaplus Community
May 31, 2025 · Artificial Intelligence

How RAG is Shaping the Future of AI-Powered User Experience

Amid the rapid rise of large language models, this article examines RAG’s development, technical hurdles, core strategies, and future outlook, illustrating how Alibaba’s Chatbot and Copilot projects boost retrieval accuracy to 90% and generation precision to 85% while tackling data quality, heterogeneous retrieval, and evaluation challenges.

AI searchEvaluation MetricsRAG
0 likes · 27 min read
How RAG is Shaping the Future of AI-Powered User Experience
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 30, 2025 · Artificial Intelligence

Explore the Full Spectrum of AI Large Model Architectures

This article presents a comprehensive visual collection of AI large‑model architecture diagrams, covering general frameworks, RAG knowledge‑base systems, agriculture, e‑commerce recommendation, IoT, compliance risk management, agent platforms, and CRM integration, offering a panoramic view of modern AI infrastructure.

AIIoTRAG
0 likes · 3 min read
Explore the Full Spectrum of AI Large Model Architectures
Instant Consumer Technology Team
Instant Consumer Technology Team
May 29, 2025 · Artificial Intelligence

API vs GUI Agents: How to Choose the Right LLM Automation Approach

This article examines the evolution of large language model agents, contrasting API‑based agents that use predefined function calls with GUI‑based agents that interact with visual interfaces, and explores hybrid strategies, orchestration tools, RAG techniques, and practical guidelines for selecting the optimal paradigm.

API vs GUIHybrid automationLLM agents
0 likes · 34 min read
API vs GUI Agents: How to Choose the Right LLM Automation Approach
DevOps
DevOps
May 28, 2025 · Artificial Intelligence

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Google researchers introduce a “sufficient context” framework that classifies retrieved passages as adequate or inadequate for answering a query, enabling large language models in enterprise RAG systems to decide when to answer, refuse, or request more information, thereby improving accuracy and reducing hallucinations.

AI reliabilityEnterprise AILarge Language Models
0 likes · 9 min read
Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems
Sohu Tech Products
Sohu Tech Products
May 28, 2025 · Artificial Intelligence

Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform

AIFlowy is a Java‑powered, open‑source, enterprise‑grade AI platform that offers a bot for natural‑language interaction, extensible plugins, a knowledge‑base with RAG support, and visual workflow automation, enabling developers to quickly build and customize AI applications for domestic B2B scenarios.

AIBotKnowledge Base
0 likes · 10 min read
Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform
phodal
phodal
May 28, 2025 · Artificial Intelligence

Boost Code Retrieval with AutoDev’s Pre‑Generated Context Worker

The article explains how AutoDev’s Context Worker pre‑generates semantic code context to improve RAG performance, outlines the limitations of vector‑based retrieval, describes the tool’s multi‑language AST analysis, knowledge‑graph construction, and provides command‑line usage examples for integrating the generated context into AI‑driven development workflows.

AIASTCLI
0 likes · 8 min read
Boost Code Retrieval with AutoDev’s Pre‑Generated Context Worker
Coder Circle
Coder Circle
May 28, 2025 · Artificial Intelligence

Core AI Concepts Every Spring AI Developer Should Know

This article explains fundamental AI concepts—including models, prompts, prompt templates, embeddings, tokens, structured output, data integration, RAG, and tool calling—and shows how Spring AI simplifies their use for Java developers building intelligent applications.

AI modelsPrompt engineeringRAG
0 likes · 13 min read
Core AI Concepts Every Spring AI Developer Should Know
Programmer DD
Programmer DD
May 21, 2025 · Artificial Intelligence

What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features

Spring AI 1.0 GA introduces a comprehensive suite of AI capabilities for Java developers, including a ChatClient supporting 20 models, vector‑store integrations, RAG pipelines, advanced chat memory, @Tool function calling, model evaluation, observability, Model Context Protocol, and autonomous agents, with examples for major cloud providers.

AI modelsMCPRAG
0 likes · 6 min read
What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features
Java Architecture Diary
Java Architecture Diary
May 21, 2025 · Artificial Intelligence

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Spring AI 1.0, the first production‑grade Java AI framework, introduces ready‑to‑use APIs, seamless model integration, enterprise‑level RAG engine, smart tool calling, and three development modes, empowering developers to rapidly build, customize, and fully control AI applications with major model providers like OpenAI, Anthropic, DeepSeek.

AI FrameworkDeepSeekJava AI
0 likes · 13 min read
Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled
DeWu Technology
DeWu Technology
May 19, 2025 · Artificial Intelligence

AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans

This article presents a comprehensive AI-driven solution for automatically generating functional test cases, detailing the AI background, design scheme, core components such as PRD parsing, test‑point generation, test‑case creation, knowledge‑base construction, implementation results, and future development directions.

AIKnowledge BaseLLM
0 likes · 7 min read
AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans
Tencent Technical Engineering
Tencent Technical Engineering
May 19, 2025 · Artificial Intelligence

RAG, Agents, and Multimodal Large Models: Evolution, Challenges, and Future Trends

This article examines the evolution of large model technologies—including Retrieval‑Augmented Generation, AI agents, and multimodal models—detailing their technical foundations, practical challenges, industry applications, and future development trends, offering a comprehensive perspective for AI practitioners and researchers.

AI AgentKnowledge RetrievalMultimodal
0 likes · 14 min read
RAG, Agents, and Multimodal Large Models: Evolution, Challenges, and Future Trends
DataFunSummit
DataFunSummit
May 13, 2025 · Artificial Intelligence

Integrating Large Language Models and Knowledge Graphs for Financial Applications: Challenges, Solutions, and Future Directions

This talk explores the technical challenges of applying large language models and knowledge graphs in finance, discusses solutions such as RAG enhancements, graph‑guided retrieval, multimodal extensions, and presents future research directions including multimodal graph integration, agentic systems, and decision‑making applications.

AIMultimodalRAG
0 likes · 33 min read
Integrating Large Language Models and Knowledge Graphs for Financial Applications: Challenges, Solutions, and Future Directions
DataFunSummit
DataFunSummit
May 9, 2025 · Artificial Intelligence

Practical Experience Building Zhihu Direct Answer: An AI‑Powered Search Product

This article presents a comprehensive overview of Zhihu Direct Answer, describing its AI‑driven search architecture, RAG framework, query understanding, retrieval, chunking, reranking, generation, evaluation mechanisms, engineering optimizations, and the professional edition, while sharing concrete performance‑boosting practices and future development plans.

AIGenerationProduct Development
0 likes · 14 min read
Practical Experience Building Zhihu Direct Answer: An AI‑Powered Search Product
Alibaba Cloud Native
Alibaba Cloud Native
May 9, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch

This tutorial walks through building a Retrieval‑Augmented Generation (RAG) system by combining LangChain for document processing, Elasticsearch’s vector store with the ELSER v2 model for semantic search, and Higress as a cloud‑native AI gateway, complete with deployment scripts, code examples, and query testing.

AIHigressLangChain
0 likes · 15 min read
Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch
phodal
phodal
May 9, 2025 · Artificial Intelligence

Why Pre‑Generated Context Is the Key to Faster, More Accurate AI Code Retrieval

The article examines how pre‑generating structured context for codebases can overcome the uncertainty and quality issues of traditional Retrieval‑Augmented Generation, outlines the technical and business challenges of RAG, compares existing code‑search tools, and introduces AutoDev’s Context Worker as a practical solution.

AILLMRAG
0 likes · 11 min read
Why Pre‑Generated Context Is the Key to Faster, More Accurate AI Code Retrieval
Youzan Coder
Youzan Coder
May 8, 2025 · Artificial Intelligence

Building and Optimizing a Store Smart Assistant with Aily: Architecture, Workflow, and Practical Lessons

The article details how Youzan’s Store Smart Assistant was built on the Feishu Aily platform, describing why Aily was chosen, the three‑stage development process, deep system integration, practical tips for knowledge‑base management and model stability, and the resulting efficiency gains such as handling 80% of routine queries.

AI AssistantAily platformKnowledge Base
0 likes · 24 min read
Building and Optimizing a Store Smart Assistant with Aily: Architecture, Workflow, and Practical Lessons
ITPUB
ITPUB
Apr 28, 2025 · Artificial Intelligence

How Large Language Models are Transforming Automotive Operations and Optimization

In this interview, an automotive industry expert explains how large language models and advanced operations‑optimization techniques are reshaping vehicle design, production planning, logistics, and customer services, while also discussing implementation challenges, team requirements, and future AI‑driven opportunities.

AI adoptionAutomotive AILarge Language Models
0 likes · 15 min read
How Large Language Models are Transforming Automotive Operations and Optimization
DevOps
DevOps
Apr 27, 2025 · Artificial Intelligence

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

This article examines how Retrieval‑Augmented Generation (RAG), AI agents, and multimodal large‑model techniques are reshaping AI‑industry integration, discusses their technical challenges and practical implementations, and outlines future development directions across algorithms, products, and domain‑specific applications.

AI agentsArtificial IntelligenceMultimodal
0 likes · 14 min read
Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends
Fun with Large Models
Fun with Large Models
Apr 25, 2025 · Artificial Intelligence

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%

This article analyzes common shortcomings of RAG pipelines—data preparation, retrieval, and LLM generation—and provides concrete optimization techniques such as advanced chunking, embedding model selection, retrieval parameter tuning, rerank models, and prompt engineering, promising up to a 20% performance gain.

EmbeddingPrompt engineeringRAG
0 likes · 17 min read
Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%
DataFunTalk
DataFunTalk
Apr 24, 2025 · Artificial Intelligence

Is Retrieval‑Augmented Generation (RAG) Dead Yet?

This article explains the original purpose of Retrieval‑Augmented Generation, why it remains essential despite advances in large‑context LLMs, and how combining RAG with fine‑tuning, longer context windows, and model‑context protocols yields more scalable, accurate, and privacy‑preserving AI systems.

AIContext WindowKnowledge Retrieval
0 likes · 9 min read
Is Retrieval‑Augmented Generation (RAG) Dead Yet?
Tencent Cloud Developer
Tencent Cloud Developer
Apr 24, 2025 · Industry Insights

How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases

The article analyzes the rapid evolution of large‑model technologies—Retrieval‑Augmented Generation, autonomous agents, and multimodal AI—detailing their technical foundations, practical challenges, industry applications such as unified multimodal tasks, open‑world detection, and video moderation, and forecasting future development directions.

AI agentsMultimodal AIRAG
0 likes · 15 min read
How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 22, 2025 · Artificial Intelligence

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

This article explains the fundamentals of Retrieval‑Augmented Generation, demonstrates how to create and query vector indexes using StarRocks, shows how DeepSeek provides embeddings and answer generation, and walks through a complete end‑to‑end RAG pipeline with code examples and a web UI.

AIDeepSeekEmbedding
0 likes · 20 min read
Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek
DevOps
DevOps
Apr 20, 2025 · Artificial Intelligence

Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example

This article demonstrates how to construct an AI‑powered medical knowledge base for diabetes treatment by preprocessing literature, performing semantic chunking, generating BioBERT embeddings, storing them in a FAISS vector database, and using a RAG framework together with a knowledge graph to retrieve and generate accurate answers.

BioBERTFAISSRAG
0 likes · 12 min read
Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example
DaTaobao Tech
DaTaobao Tech
Apr 18, 2025 · Frontend Development

How AI Is Transforming Frontend Development: From Design‑to‑Code to Automated Testing

This article explores how AI-driven tools are reshaping frontend engineering by automating design‑to‑code conversion, interface‑to‑data‑model mapping, private component integration, code fitting, AI code review, and automated test regression, and it evaluates their impact on efficiency and future development workflows.

AICodeGenerationRAG
0 likes · 37 min read
How AI Is Transforming Frontend Development: From Design‑to‑Code to Automated Testing
Fun with Large Models
Fun with Large Models
Apr 18, 2025 · Artificial Intelligence

How RAG Works: From Data Prep to LLM Generation Explained

This article breaks down Retrieval‑Augmented Generation (RAG) into its three core stages—data preparation, data retrieval, and LLM generation—showing how document chunking, embedding, vector databases, similarity search, and optional re‑ranking combine to let large language models produce more accurate, knowledge‑grounded answers.

EmbeddingLLMRAG
0 likes · 9 min read
How RAG Works: From Data Prep to LLM Generation Explained
Data Thinking Notes
Data Thinking Notes
Apr 17, 2025 · Artificial Intelligence

How Dify Accelerates Generative AI App Development with Low‑Code and Modular Design

Dify is an open‑source LLM application platform that blends BaaS and LLMOps, offering low‑code development, modular components, extensive model support, and advanced retrieval features, while also detailing its current limitations and recent enhancements such as MySQL integration and Elasticsearch‑based RAG capabilities.

AIElasticsearchLLM
0 likes · 7 min read
How Dify Accelerates Generative AI App Development with Low‑Code and Modular Design
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
Apr 10, 2025 · Artificial Intelligence

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

This guide walks through creating a Retrieval‑Augmented Generation (RAG) system using Spring Boot 3.4.2, Milvus vector database, and the bge‑m3 embedding model via Ollama, covering environment setup, dependency configuration, vector store operations, and integration with a large language model to deliver refined, similarity‑based answers.

EmbeddingLLMMilvus
0 likes · 11 min read
Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Apr 10, 2025 · Artificial Intelligence

Building a Pet Hospital AI Assistant with RAG and LLMs

This article walks through the motivation, core concepts of Retrieval‑Augmented Generation, and a step‑by‑step guide to constructing a pet‑hospital AI assistant on Alibaba Cloud using LLMs, vector databases, and automated pipelines, complete with code examples and practical tips.

AI AssistantAlibaba CloudLLM
0 likes · 18 min read
Building a Pet Hospital AI Assistant with RAG and LLMs
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Apr 8, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), its three‑step workflow of retrieval, augmentation, and generation, its key advantages such as improved accuracy and explainability, and compares RAG with traditional pre‑trained models, fine‑tuned models, hybrid models, knowledge‑distillation methods, and RLHF, while also covering vector, full‑text, and hybrid retrieval modes and the role of rerank models.

AIKnowledge RetrievalRAG
0 likes · 18 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 8, 2025 · Artificial Intelligence

Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration

This article introduces the fundamental paradigms of large language models, explaining how simple prompts, messages, and tools like RAG and ReAct enable powerful applications, while providing practical code examples, translation strategies, and insights on prompt engineering, tool usage, and model fine‑tuning.

AILLM applicationsLarge Language Models
0 likes · 23 min read
Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration
dbaplus Community
dbaplus Community
Apr 7, 2025 · Databases

How Do LLMs Tackle Oracle Bad Block Errors? A Hands‑On Evaluation

This article presents a hands‑on evaluation of several large language models—including Mistral‑Small, Deepseek‑r1, Llama 3.3 and ChatGPT‑4‑go—on Oracle database bad‑block errors, RAG‑based document retrieval, and log‑driven reasoning, revealing performance gaps, scoring results, and practical DBA implications.

AILLM evaluationOracle
0 likes · 11 min read
How Do LLMs Tackle Oracle Bad Block Errors? A Hands‑On Evaluation
Beijing SF i-TECH City Technology Team
Beijing SF i-TECH City Technology Team
Apr 7, 2025 · Artificial Intelligence

LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing

This article explores the application of Large Language Models (LLM) in text information detection and extraction, focusing on blue-collar recruitment data processing. It details the implementation of LLM through prompt engineering, RAG enhancement, and model fine-tuning to improve data cleaning efficiency and accuracy.

AI applicationsLLMPrompt engineering
0 likes · 31 min read
LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing
DataFunSummit
DataFunSummit
Apr 7, 2025 · Artificial Intelligence

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

This article examines how Retrieval‑Augmented Generation (RAG) and multi‑agent technologies narrow the gap between large language models and practical deployment, highlighting their roles in operations automation, financial risk control, intelligent data governance, database localization, edge inference, and future AI‑driven solutions.

Data GovernanceLarge Language ModelsOperations Automation
0 likes · 8 min read
Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents
JD Cloud Developers
JD Cloud Developers
Apr 7, 2025 · Artificial Intelligence

Why Bigger Prompts Fail: Modular Strategies for Building Efficient AI Agents

This article explains why overloading prompts and tools harms AI‑Agent performance, and offers practical modular design, intent‑driven instruction splitting, and efficient context management strategies such as curated function‑call tools and dynamic RAG to reduce token costs, improve response speed, and avoid hallucinations.

AI AgentLLMPrompt engineering
0 likes · 13 min read
Why Bigger Prompts Fail: Modular Strategies for Building Efficient AI Agents
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 3, 2025 · Artificial Intelligence

Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration

This article explains the Model Context Protocol (MCP) as a standard for LLM‑data integration, describes Retrieval‑Augmented Generation (RAG) techniques to reduce hallucinations, and introduces vector databases like Milvus that store high‑dimensional embeddings for efficient AI retrieval tasks.

LLMMCPMilvus
0 likes · 7 min read
Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration
DevOps
DevOps
Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval-Augmented Generation (RAG): Concepts, Evolution, and Types

This article explains Retrieval‑Augmented Generation (RAG), its role in mitigating large language model knowledge cutoff and hallucination, outlines the evolution from naive to advanced, modular, graph, and agentic RAG, and discusses future directions such as intelligent and multi‑modal RAG systems.

Artificial IntelligenceKnowledge RetrievalLLM
0 likes · 10 min read
Understanding Retrieval-Augmented Generation (RAG): Concepts, Evolution, and Types
AntTech
AntTech
Apr 2, 2025 · Artificial Intelligence

PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

The PEAR framework introduces a position‑embedding‑agnostic attention re‑weighting method that detects and suppresses detrimental attention heads in large language models, dramatically improving retrieval‑augmented generation performance without adding any inference overhead, as demonstrated on multiple RAG benchmarks and LLM families.

Attention Re-weightingLLMPEAR
0 likes · 6 min read
PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead
Tencent Cloud Developer
Tencent Cloud Developer
Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) enhances large language models by fetching up‑to‑date external knowledge before generation, mitigating knowledge‑cutoff limits and hallucinations through a retrieval step (using text, vector, or graph methods) and a generation step, evolving from naive single‑method approaches to advanced, modular, graph‑based, and agentic systems that enable adaptive, multi‑hop reasoning and future intelligent, multimodal pipelines.

AIAgentic AIKnowledge Retrieval
0 likes · 9 min read
Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development
Architect
Architect
Apr 1, 2025 · Artificial Intelligence

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

The article explains why most projects should start with prompt engineering or simple agent workflows, outlines the scenarios where model fine‑tuning adds real value, compares fine‑tuning with Retrieval‑Augmented Generation, and offers practical criteria for deciding which approach to adopt.

AI deploymentLarge Language ModelsLoRA
0 likes · 9 min read
When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG
Architect
Architect
Mar 30, 2025 · Artificial Intelligence

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques

This article provides a comprehensive survey of Retrieval‑Augmented Generation (RAG), covering its basic principles, key components, seven technical variants, challenges, evaluation methods, and future research directions across multimodal, graph‑based, and agentic extensions.

AI SurveyKnowledge RetrievalLarge Language Models
0 likes · 9 min read
What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques