Tagged articles

894 articles

Page 6 of 9

Jul 15, 2025 · Artificial Intelligence

Getting Started with LangChain & LangGraph: Core Concepts of AI Agents

This article introduces AI Agents and explains why LangChain is the leading framework, detailing its core concepts, three‑layer architecture, key features, comparison with other agent frameworks, and showcasing popular projects built with LangChain and LangGraph.

AI AgentLLMLangChain

0 likes · 10 min read

Getting Started with LangChain & LangGraph: Core Concepts of AI Agents

Ops Development Stories

Jul 14, 2025 · Artificial Intelligence

Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment

This comprehensive guide explores AIOps techniques such as prompt engineering, chat completions, memory management, function calling, fine‑tuning, retrieval‑augmented generation (RAG), graph‑based RAG, and practical steps for deploying open‑source large language models locally, providing code examples and best‑practice recommendations for modern DevOps environments.

Function CallingGraph RAGRAG

0 likes · 47 min read

Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment

Tencent Technical Engineering

Jul 14, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, and MCP: Core Concepts and How They Interact

This article provides a concise overview of the latest AI concepts—including AIGC, Retrieval‑Augmented Generation, Function‑Calling models, intelligent agents, and the Model Context Protocol—explaining their principles, differences, and how they can be combined to build more powerful AI applications for developers outside the AI field.

AIGCFunction CallingLLM

0 likes · 15 min read

Demystifying AIGC, Agents, and MCP: Core Concepts and How They Interact

DaTaobao Tech

Jul 14, 2025 · Artificial Intelligence

Mastering AI Application Modes: Embedding, Copilot, and Agents Explained

This article explores practical AI engineering strategies, detailing the three AI application modes—Embedding, Copilot, and Agents—along with prompt engineering, model selection, function calling, RAG, workflow design, and multi‑agent architectures to boost business efficiency and user experience.

AIModel EvaluationPrompt engineering

0 likes · 25 min read

Mastering AI Application Modes: Embedding, Copilot, and Agents Explained

Data Thinking Notes

Jul 13, 2025 · Artificial Intelligence

How to Build an Enterprise Knowledge Base with Dify: Full Setup Guide

This article walks developers through the entire process of deploying Dify locally, configuring model providers, creating and segmenting a knowledge base with RAG, choosing indexing methods, and integrating the knowledge base into a chatbot application, complete with code snippets and visual guides.

AI deploymentDifyKnowledge Base

0 likes · 11 min read

How to Build an Enterprise Knowledge Base with Dify: Full Setup Guide

Architecture and Beyond

Jul 12, 2025 · Artificial Intelligence

What Exactly Is an AI Agent? History, Architecture, and Future Challenges

This article traces the evolution of AI agents from early expert systems to modern large‑language‑model‑driven assistants, explains their core perception, reasoning, memory, and action modules, compares thinking and execution models, and discusses current limitations such as hallucinations, reliability, cost, and security.

AI AgentMemory ArchitecturePrompt engineering

0 likes · 20 min read

Architect

Jul 11, 2025 · Artificial Intelligence

How OpenAI’s Zero‑Vector Agentic RAG Redefines AI Knowledge Retrieval

OpenAI’s new non‑vectorized Agentic RAG approach replaces traditional vector search with a hierarchical, multi‑round content selection process, leveraging large‑context models like GPT‑4.1‑mini for efficient document loading, dynamic navigation, and accurate answer generation, while outlining model selection strategies, cost trade‑offs, and production considerations.

AI ArchitectureModel SelectionRAG

0 likes · 15 min read

How OpenAI’s Zero‑Vector Agentic RAG Redefines AI Knowledge Retrieval

Sanyou's Java Diary

Jul 10, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, RAG, and MCP: Core AI Concepts Explained

This article provides a concise overview of the latest AI breakthroughs—including AIGC, multimodal technology, Retrieval‑Augmented Generation (RAG), intelligent agents with function‑calling models, and the Model Context Protocol (MCP)—explaining their principles, relationships, and practical implications for developers outside the AI field.

AIAIGCFunction Calling

0 likes · 16 min read

Demystifying AIGC, Agents, RAG, and MCP: Core AI Concepts Explained

DataFunSummit

Jul 10, 2025 · Artificial Intelligence

How Large Language Model AI Agents Transform Intelligent Operations and On‑Call Support

This article details the design and implementation of a large‑model‑driven intelligent operations dialogue system, covering intent recognition, routing, multi‑agent planning, RAG, workflow, ReAct, reflection, tree‑search techniques, evaluation challenges, and future multi‑agent collaboration for on‑call support.

AI agentsIntelligent OperationsRAG

0 likes · 23 min read

How Large Language Model AI Agents Transform Intelligent Operations and On‑Call Support

Tencent Cloud Developer

Jul 10, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, and MCP: Essential AI Concepts for Developers

This article provides a concise, developer‑focused overview of emerging AI concepts—including AIGC, multimodal models, Retrieval‑Augmented Generation, intelligent agents, Function‑Calling, and the Model Context Protocol (MCP)—explaining their core principles, differences, and how they interrelate to enable advanced AI applications.

AIAIGCFunction Calling

0 likes · 16 min read

Demystifying AIGC, Agents, and MCP: Essential AI Concepts for Developers

Huolala Tech

Jul 9, 2025 · Artificial Intelligence

How HuoLala Overcomes Large‑Model Limits with Retrieval‑Augmented Vector Databases

This article details HuoLala's journey from confronting large‑model knowledge gaps and latency issues to adopting Retrieval‑Augmented Generation (RAG) with vector databases, outlining their architecture challenges, selection criteria, deployment scenarios, and future plans for AI‑driven logistics.

AIElasticsearchMilvus

0 likes · 14 min read

How HuoLala Overcomes Large‑Model Limits with Retrieval‑Augmented Vector Databases

Fun with Large Models

Jul 9, 2025 · Artificial Intelligence

Boosting RAG Accuracy with GraphRAG: Large‑Scale KG Build and Mode Comparison

This article demonstrates how to construct a knowledge graph from a 58,099‑character document using GraphRAG, measures the indexing time on an M2 PRO machine, and compares Global, Local, and Drift query modes, highlighting three scenarios where Local mode is unsuitable.

GraphRAGRAGglobal mode

0 likes · 11 min read

Boosting RAG Accuracy with GraphRAG: Large‑Scale KG Build and Mode Comparison

JD Tech Talk

Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Highway for New Developers

New developer Li Ming’s frustrating onboarding experience highlights hidden business rules, undocumented code, and poor knowledge transfer, prompting him to build an AI‑driven knowledge base that links code changes, requirements, and operational docs, ultimately streamlining troubleshooting, accelerating feature development, and improving knowledge retention across teams.

AIRAGcode retrieval

0 likes · 18 min read

How AI Can Turn a Code Maze into a Knowledge Highway for New Developers

JD Cloud Developers

Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Hub for New Developers

This article follows a new developer named Li Ming as he confronts undocumented code, hidden business rules, and fragmented knowledge, then demonstrates how leveraging large‑language models to index, associate, and retrieve code, requirements, and operational data can create an intelligent knowledge base that streamlines onboarding, reduces errors, and enhances collaboration across development, testing, and product teams.

AIRAGsoftware development

0 likes · 19 min read

How AI Can Turn a Code Maze into a Knowledge Hub for New Developers

Architect's Alchemy Furnace

Jul 7, 2025 · Artificial Intelligence

How to Build an AI‑Powered Knowledge Retrieval Workflow with Dify and Embedding Models

This guide explains how to organize a knowledge base, select embedding and rerank models, clean user queries, and construct a Dify DSL workflow that iteratively retrieves and merges information before handing it to a large language model for answer generation.

AI workflowDifyKnowledge Retrieval

0 likes · 23 min read

How to Build an AI‑Powered Knowledge Retrieval Workflow with Dify and Embedding Models

AI Algorithm Path

Jul 3, 2025 · Artificial Intelligence

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

This article examines how Retrieval‑Augmented Generation (RAG) has progressed from simple keyword‑based retrieval to advanced semantic methods, modular architectures, graph‑enhanced reasoning, and autonomous agentic systems, highlighting each approach's workflow, benefits, limitations, and the shift toward dynamic AI decision‑making.

AIAgentic RAGGraph RAG

0 likes · 7 min read

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

Instant Consumer Technology Team

Jul 3, 2025 · Artificial Intelligence

Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises

Enterprises rushing to purchase DeepSeek AI appliances and smart‑agent platforms often face hidden technical, data, and organizational challenges that turn promised "plug‑and‑play" solutions into costly missteps, highlighting the need for realistic strategy, robust data governance, and continuous capability building.

AI capability buildingAI deploymentData Governance

0 likes · 28 min read

Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises

AI Large Model Application Practice

Jul 2, 2025 · Artificial Intelligence

Build a PPT‑Powered RAG Engine with Visual Models and MCP Server

This article explains how to construct a Retrieval‑Augmented Generation (RAG) pipeline for multi‑page PPT documents by converting slides to images, extracting content with a vision model, indexing with LlamaIndex and Chroma, and exposing the functionality through an MCP Server with tools for adding, querying, and managing PPTs.

LlamaIndexMCP ServerPPT

0 likes · 13 min read

Build a PPT‑Powered RAG Engine with Visual Models and MCP Server

Instant Consumer Technology Team

Jul 1, 2025 · Artificial Intelligence

Choosing Between Gemini CLI and Claude Code: A Deep AI Coding Tool Comparison

This article examines the rise of terminal AI coding assistants, comparing Google’s Gemini CLI and Anthropic’s Claude Code across features, context length, cost, performance, platform support, and usage tips, helping developers decide which tool best fits their productivity and project needs.

AI CodingClaude CodeGemini CLI

0 likes · 14 min read

Choosing Between Gemini CLI and Claude Code: A Deep AI Coding Tool Comparison

Ops Development Stories

Jul 1, 2025 · Artificial Intelligence

From Lean to AIOps: How AI is Transforming Modern Operations

This comprehensive guide walks through the evolution from Lean and Agile practices to DevOps and finally AIOps, explaining core concepts, key algorithms, the role of large language models, RAG‑based root‑cause analysis, and practical implementation steps for intelligent operations.

Large Language ModelsLeanRAG

0 likes · 19 min read

From Lean to AIOps: How AI is Transforming Modern Operations

Ops Development Stories

Jun 30, 2025 · Artificial Intelligence

Build a Private AI Knowledge Assistant with n8n: Zero‑Code RAG in 30 Minutes

This guide shows how to create a fully local Retrieval‑Augmented Generation (RAG) system using n8n, Docker, Ollama and the free Qwen3 embedding model, enabling secure, up‑to‑date AI assistants that answer enterprise questions without exposing any proprietary data.

AI AssistantDockerEmbedding

0 likes · 17 min read

Build a Private AI Knowledge Assistant with n8n: Zero‑Code RAG in 30 Minutes

DataFunTalk

Jun 29, 2025 · Artificial Intelligence

Large Models Boost Douyin User Experience: Expert Insights

In an interview at the DA Digital Intelligence Conference, ByteDance AI specialist Cai Conghuai explains how large language models, combined with techniques like SFT, DPO, and RAG, are reshaping Douyin's user‑experience signal detection, root‑cause analysis, and evaluation, while outlining future AI‑agent breakthroughs.

AIDPOLarge Language Models

0 likes · 12 min read

Large Models Boost Douyin User Experience: Expert Insights

Architect

Jun 28, 2025 · Artificial Intelligence

How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough

This article examines the MultiAgentPPT project, detailing its multi‑agent workflow, the four core agents that generate outlines, split topics, conduct research, and summarize results, and explains how the system retrieves data via a WeChat crawler and constructs prompts for LLM‑driven PPT creation.

AI agentsMultiAgentPPTPPT generation

0 likes · 6 min read

How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough

Alibaba Cloud Big Data AI Platform

Jun 27, 2025 · Artificial Intelligence

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

This guide walks you through using the PAI‑LangStudio platform together with the Qwen3 large language model and Elasticsearch to create a full‑stack AI Search RAG solution, covering prerequisites, step‑by‑step configuration of model services, database connections, runtimes, knowledge bases, workflow creation, testing, and deployment for production use.

AI searchElasticsearchPAI‑LangStudio

0 likes · 11 min read

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

360 Zhihui Cloud Developer

Jun 27, 2025 · Operations

How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers

This article details the design, technology choices, functional modules, core implementation, performance optimizations, and future roadmap of Ops‑Nexus, an AI‑driven intelligent operations platform that streamlines alarm analysis, log processing, and host health checks for large‑scale monitoring environments.

AI OpsIntelligent OperationsLLM

0 likes · 12 min read

How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers

AI Algorithm Path

Jun 26, 2025 · Artificial Intelligence

The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System

This guide breaks down the ten core building blocks of a production‑ready RAG pipeline—from input handling and vector stores to prompt engineering, LLM inference, observability, and evaluation—showing why each piece matters, common pitfalls, and practical best‑practice recommendations.

LLMPrompt engineeringRAG

0 likes · 9 min read

The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System

Su San Talks Tech

Jun 26, 2025 · Artificial Intelligence

Master Spring AI Alibaba 1.0: Upgrade Guide, New Features & Real‑World Code

This article walks you through what Spring AI Alibaba 1.0 offers, highlights its major updates such as the Graph multi‑agent framework and ecosystem integrations, and provides a step‑by‑step upgrade path with Maven dependency changes, code fixes, and configuration adjustments for Java developers.

AI FrameworkMCPMulti-Agent

0 likes · 20 min read

Master Spring AI Alibaba 1.0: Upgrade Guide, New Features & Real‑World Code

DeWu Technology

Jun 25, 2025 · Artificial Intelligence

Engineering Large Language Models with Spring AI: From Basics to RAG and Function Calls

This article walks through the fundamentals of large language models, their stateless and structured-output nature, explains how Spring‑AI provides a Java‑friendly API for model integration, covers RAG architecture, the MCP protocol, and demonstrates end‑to‑end code examples for building intelligent agents.

AI integrationFunction CallingLarge Language Models

0 likes · 15 min read

Engineering Large Language Models with Spring AI: From Basics to RAG and Function Calls

ITFLY8 Architecture Home

Jun 24, 2025 · Artificial Intelligence

How Transformers and Mixture-of-Experts Power Large Language Models

This article explores the role of Transformers and Mixture‑of‑Experts in large models, outlines five fine‑tuning methods, compares traditional and agentic RAG, presents classic agent design patterns, text‑chunking strategies, levels of intelligent agent systems, and explains KV‑caching techniques.

Fine-tuningLarge Language ModelsMixture of Experts

0 likes · 2 min read

How Transformers and Mixture-of-Experts Power Large Language Models

Fun with Large Models

Jun 23, 2025 · Artificial Intelligence

Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction

This article walks through the complete GraphRAG knowledge‑graph building pipeline—text splitting, entity extraction, relation mining, community clustering, and report generation—using a concrete example from the book “The Age of Big Data,” and explains why each step improves retrieval and answer quality.

GraphRAGRAGcommunity clustering

0 likes · 20 min read

Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction

Tech Freedom Circle

Jun 21, 2025 · Artificial Intelligence

How MCP + LLM + Agent Architecture Becomes the AI Agent’s Neural Hub and New Infrastructure

The article explains the Model Context Protocol (MCP) as a zero‑code bridge that lets large language models seamlessly access databases, external APIs, and execute code, detailing its benefits for developers and everyday users, its core components, step‑by‑step workflow, real‑world examples, and how it outperforms traditional APIs in modern AI agent systems.

AI AgentLLMMCP

0 likes · 37 min read

How MCP + LLM + Agent Architecture Becomes the AI Agent’s Neural Hub and New Infrastructure

Data Thinking Notes

Jun 19, 2025 · Artificial Intelligence

Andrew Ng on Building Agentic AI Systems: Tools, MCP, and Practical Insights

In a candid conversation, Andrew Ng and Harrison Chase explore the evolving landscape of AI agents, discussing modular toolchains, the emerging MCP standard, challenges of agent‑to‑agent communication, voice interaction latency, and the importance of rapid, technically skilled execution for successful AI product development.

AI agentsLangChainMCP

0 likes · 19 min read

Andrew Ng on Building Agentic AI Systems: Tools, MCP, and Practical Insights

DataFunSummit

Jun 19, 2025 · Artificial Intelligence

How Large Models Are Revolutionizing Douyin’s User Experience – Expert Insights

In a detailed interview, ByteDance AI specialist Cai Conghuai explains how large‑model techniques such as SFT, DPO and RAG address Douyin’s multimodal user‑experience challenges, improve signal detection, root‑cause analysis, and outline future AI‑agent breakthroughs for content platforms.

AI AlgorithmsMultimodal LearningRAG

0 likes · 11 min read

How Large Models Are Revolutionizing Douyin’s User Experience – Expert Insights

Fun with Large Models

Jun 19, 2025 · Artificial Intelligence

How GraphRAG Boosts Answer Accuracy with Knowledge Graphs (Part 1)

This article explains GraphRAG’s architecture, compares it with traditional RAG, and presents experimental results showing that GraphRAG’s knowledge‑graph‑driven retrieval markedly improves answer accuracy, especially on low‑match, multi‑paragraph queries.

GraphRAGLarge Language ModelsPerformance Evaluation

0 likes · 11 min read

How GraphRAG Boosts Answer Accuracy with Knowledge Graphs (Part 1)

Tencent Technical Engineering

Jun 16, 2025 · Artificial Intelligence

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

This comprehensive guide walks you through the fundamentals of Retrieval‑Augmented Generation (RAG) and AI agents, explains their inner workings, shares optimization tricks, provides ready‑to‑run code snippets, and demonstrates how to evaluate performance with metrics such as recall, faithfulness, and answer relevance.

AI agentsLLMPrompt engineering

0 likes · 36 min read

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

ITPUB

Jun 15, 2025 · Artificial Intelligence

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

This article presents a step‑by‑step guide for constructing a scalable enterprise Retrieval‑Augmented Generation (RAG) solution using the Model Context Protocol (MCP), covering architecture comparison, system design, Milvus‑backed knowledge store, Python client implementation, deployment scripts, code examples, and best‑practice recommendations.

KnowledgeBaseLLMMCP

0 likes · 22 min read

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

TAL Education Technology

Jun 13, 2025 · Operations

How Large Language Models Are Revolutionizing Fault Localization

This article explores how the rapid rise of large language models and techniques like Retrieval‑Augmented Generation, Chain‑of‑Thought prompting, and multi‑agent architectures can dramatically improve the speed, accuracy, and automation of fault localization in modern operations environments.

Agent ArchitectureCoTFault Localization

0 likes · 14 min read

How Large Language Models Are Revolutionizing Fault Localization

Instant Consumer Technology Team

Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

EmbeddingLLMMilvus

0 likes · 19 min read

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

DataFunSummit

Jun 12, 2025 · Artificial Intelligence

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

This article details Alibaba Cloud AI Search’s development journey, covering its dual product lines, the evolution of Agentic RAG technology, multi‑agent architectures, vector retrieval breakthroughs, GPU‑accelerated indexing, NL2SQL capabilities, deployment models, and future directions for AI‑driven search solutions.

AI searchGPU AccelerationOpenSearch

0 likes · 33 min read

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

Zuoyebang Tech Team

Jun 12, 2025 · Information Security

How AI‑Powered RAG and Agents Are Revolutionizing Enterprise Security Operations

This article explains how the rise of AI large‑model technology and Retrieval‑Augmented Generation (RAG) combined with autonomous AI agents enable a three‑layer network‑boundary defense, address deep operational challenges such as alert overload and response latency, and dramatically improve incident‑response efficiency in large‑scale enterprises.

AI agentsAI securityLarge Language Models

0 likes · 16 min read

How AI‑Powered RAG and Agents Are Revolutionizing Enterprise Security Operations

Data Thinking Notes

Jun 11, 2025 · Artificial Intelligence

How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×

This case study details how a government‑focused AI system using retrieval‑augmented generation (RAG) and advanced preprocessing algorithms increased data labeling speed by up to five times, raised accuracy above 95%, and produced high‑quality enterprise, spatial, and economic datasets.

AIGovernmentRAG

0 likes · 5 min read

How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×

Sohu Tech Products

Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDeepSeekRAG

0 likes · 15 min read

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

Alibaba Cloud Developer

Jun 10, 2025 · Artificial Intelligence

How AI Application Architectures Evolve: From Simple LLM Calls to Guardrails, Routing, and Agents

This article traces the evolution of AI application architectures—from the earliest minimal user‑LLM interaction to advanced designs featuring context enhancement, input/output guardrails, intent routing, model gateways, caching strategies, agent capabilities, monitoring, and inference performance optimizations—providing practical insights and references for developers.

AI ArchitectureInference OptimizationLLM

0 likes · 21 min read

How AI Application Architectures Evolve: From Simple LLM Calls to Guardrails, Routing, and Agents

Data Thinking Notes

Jun 8, 2025 · Artificial Intelligence

Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries

This article presents a panoramic view of AI large‑model technologies, showcasing a series of architecture diagrams that illustrate general model frameworks, RAG knowledge‑base structures, agricultural and retail applications, IoT integration, compliance and risk‑management setups, agent platforms, and CRM‑enhanced solutions.

AIRAGarchitecture

0 likes · 3 min read

Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries

Qborfy AI

Jun 7, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) Chatbot with LangChain and Streamlit

This guide walks through the complete process of creating a RAG‑powered question‑answering bot using LangChain, Streamlit, and vector‑store embeddings, covering theory, architecture, data loading, chunking, vector indexing, retrieval, LLM integration, and full code implementation with practical examples.

ChatbotLangChainPython

0 likes · 13 min read

Build a Retrieval‑Augmented Generation (RAG) Chatbot with LangChain and Streamlit

AI2ML AI to Machine Learning

Jun 6, 2025 · Artificial Intelligence

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

The article enumerates common pitfalls of Retrieval‑Augmented Generation—such as missing content, low‑rank document misses, context limits, format errors, incomplete answers, scalability bottlenecks, complex PDF extraction, data‑quality issues, domain adaptation gaps, hallucinations, and feedback‑loop deficiencies—and offers concrete mitigation strategies ranging from data cleaning and prompt design to hybrid search, hierarchical retrieval, document compression, and automated evaluation.

Data QualityHybrid SearchLLM

0 likes · 9 min read

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

IT Services Circle

Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AIKnowledge RetrievalRAG

0 likes · 12 min read

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

Didi Tech

Jun 5, 2025 · Artificial Intelligence

Unlocking Modern AI Application Architecture: From RAG to Agents and MCP

This article surveys the evolution of AI applications, explains large language model fundamentals, outlines architectural challenges, and introduces three core patterns—Retrieval‑Augmented Generation (RAG), autonomous Agents, and Model Context Protocol (MCP)—while providing practical LangChain code snippets and integration guidance.

AILLMLangChain

0 likes · 28 min read

Unlocking Modern AI Application Architecture: From RAG to Agents and MCP

Instant Consumer Technology Team

Jun 4, 2025 · Artificial Intelligence

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

This comprehensive article examines Retrieval‑Augmented Generation (RAG), covering its historical evolution, core theory, implementation variants, practical code examples, diverse applications, current controversies, and future research directions within the AI and NLP landscape.

Artificial IntelligenceGenerative ModelsRAG

0 likes · 21 min read

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

Full-Stack Internet Architecture

Jun 4, 2025 · Artificial Intelligence

Key AI Concepts for Spring AI: Models, Prompts, Embeddings, Tokens, Structured Output, and RAG

This article introduces essential AI concepts—including models, prompts and prompt templates, embeddings, tokens, structured output, and Retrieval‑Augmented Generation—explaining their meanings and relevance for effectively using Spring AI in real‑world applications.

AIPrompt engineeringRAG

0 likes · 7 min read

Key AI Concepts for Spring AI: Models, Prompts, Embeddings, Tokens, Structured Output, and RAG

Satori Komeiji's Programming Classroom

Jun 3, 2025 · Artificial Intelligence

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

The article explains Retrieval‑Augmented Generation (RAG) by describing how a programmer, frustrated with oversized prompts for a large language model, discovers that retrieving relevant document fragments, embedding them, and feeding the augmented context to the model yields accurate, fact‑based answers.

AIEmbeddingRAG

0 likes · 6 min read

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

Fighter's World

Jun 2, 2025 · Artificial Intelligence

Why Is Context King for Large Language Models?

This article provides a comprehensive technical analysis of LLM context, covering its definition, types, tokenization, window‑size evolution, diminishing returns, management techniques such as RAG, CoT, memory‑as‑a‑service, and future challenges like multimodal fusion, privacy, and autonomous agent memory.

Agent MemoryLLMMemory-as-a-Service

0 likes · 48 min read

Why Is Context King for Large Language Models?

dbaplus Community

May 31, 2025 · Artificial Intelligence

How RAG is Shaping the Future of AI-Powered User Experience

Amid the rapid rise of large language models, this article examines RAG’s development, technical hurdles, core strategies, and future outlook, illustrating how Alibaba’s Chatbot and Copilot projects boost retrieval accuracy to 90% and generation precision to 85% while tackling data quality, heterogeneous retrieval, and evaluation challenges.

AI searchEvaluation MetricsRAG

0 likes · 27 min read

How RAG is Shaping the Future of AI-Powered User Experience

ITFLY8 Architecture Home

May 30, 2025 · Artificial Intelligence

Explore the Full Spectrum of AI Large Model Architectures

This article presents a comprehensive visual collection of AI large‑model architecture diagrams, covering general frameworks, RAG knowledge‑base systems, agriculture, e‑commerce recommendation, IoT, compliance risk management, agent platforms, and CRM integration, offering a panoramic view of modern AI infrastructure.

AIIoTRAG

0 likes · 3 min read

Explore the Full Spectrum of AI Large Model Architectures

Instant Consumer Technology Team

May 29, 2025 · Artificial Intelligence

API vs GUI Agents: How to Choose the Right LLM Automation Approach

This article examines the evolution of large language model agents, contrasting API‑based agents that use predefined function calls with GUI‑based agents that interact with visual interfaces, and explores hybrid strategies, orchestration tools, RAG techniques, and practical guidelines for selecting the optimal paradigm.

API vs GUIHybrid automationLLM agents

0 likes · 34 min read

API vs GUI Agents: How to Choose the Right LLM Automation Approach

DevOps

May 28, 2025 · Artificial Intelligence

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Google researchers introduce a “sufficient context” framework that classifies retrieved passages as adequate or inadequate for answering a query, enabling large language models in enterprise RAG systems to decide when to answer, refuse, or request more information, thereby improving accuracy and reducing hallucinations.

AI reliabilityEnterprise AILarge Language Models

0 likes · 9 min read

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Sohu Tech Products

May 28, 2025 · Artificial Intelligence

Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform

AIFlowy is a Java‑powered, open‑source, enterprise‑grade AI platform that offers a bot for natural‑language interaction, extensible plugins, a knowledge‑base with RAG support, and visual workflow automation, enabling developers to quickly build and customize AI applications for domestic B2B scenarios.

AIBotKnowledge Base

0 likes · 10 min read

Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform

phodal

May 28, 2025 · Artificial Intelligence

Boost Code Retrieval with AutoDev’s Pre‑Generated Context Worker

The article explains how AutoDev’s Context Worker pre‑generates semantic code context to improve RAG performance, outlines the limitations of vector‑based retrieval, describes the tool’s multi‑language AST analysis, knowledge‑graph construction, and provides command‑line usage examples for integrating the generated context into AI‑driven development workflows.

AIASTCLI

0 likes · 8 min read

Boost Code Retrieval with AutoDev’s Pre‑Generated Context Worker

Coder Circle

May 28, 2025 · Artificial Intelligence

Core AI Concepts Every Spring AI Developer Should Know

This article explains fundamental AI concepts—including models, prompts, prompt templates, embeddings, tokens, structured output, data integration, RAG, and tool calling—and shows how Spring AI simplifies their use for Java developers building intelligent applications.

AI modelsPrompt engineeringRAG

0 likes · 13 min read

Core AI Concepts Every Spring AI Developer Should Know

Architect

May 23, 2025 · Artificial Intelligence

How We Won the RAG Challenge: Multi‑Router & Dynamic Knowledge Base Techniques Revealed

This article details the end‑to‑end design, parsing tricks, vector database setup, retrieval strategies, prompt engineering, and LLM reranking that powered the winning solution in a company‑annual‑report question‑answering competition.

FAISSLLMPDF parsing

0 likes · 37 min read

How We Won the RAG Challenge: Multi‑Router & Dynamic Knowledge Base Techniques Revealed

Eric Tech Circle

May 22, 2025 · Artificial Intelligence

Build a Fast, Zero‑Cost Local AI Knowledge Base with Ollama, Cherry Studio, and Qwen‑3

This guide walks you through building a high‑performance local AI knowledge base using Ollama, Cherry Studio, and the Qwen‑3 model, covering RAG fundamentals, model selection, document preparation, system configuration, and step‑by‑step UI operations for non‑programmers.

AIKnowledge BaseLocal-LLM

0 likes · 13 min read

Build a Fast, Zero‑Cost Local AI Knowledge Base with Ollama, Cherry Studio, and Qwen‑3

Programmer DD

May 21, 2025 · Artificial Intelligence

What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features

Spring AI 1.0 GA introduces a comprehensive suite of AI capabilities for Java developers, including a ChatClient supporting 20 models, vector‑store integrations, RAG pipelines, advanced chat memory, @Tool function calling, model evaluation, observability, Model Context Protocol, and autonomous agents, with examples for major cloud providers.

AI modelsMCPRAG

0 likes · 6 min read

What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features

Java Tech Enthusiast

May 21, 2025 · Artificial Intelligence

How ChatGPT's New Memory Feature Works: Technical Analysis and Implementation Details

The article provides a detailed technical breakdown of OpenAI's new ChatGPT memory feature, explaining its two memory modes, underlying sub‑systems, possible implementation approaches using vector stores and scheduled jobs, and early user feedback highlighting both benefits and bugs.

AIChatGPTRAG

0 likes · 8 min read

How ChatGPT's New Memory Feature Works: Technical Analysis and Implementation Details

Java Architecture Diary

May 21, 2025 · Artificial Intelligence

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Spring AI 1.0, the first production‑grade Java AI framework, introduces ready‑to‑use APIs, seamless model integration, enterprise‑level RAG engine, smart tool calling, and three development modes, empowering developers to rapidly build, customize, and fully control AI applications with major model providers like OpenAI, Anthropic, DeepSeek.

AI FrameworkDeepSeekJava AI

0 likes · 13 min read

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

DeWu Technology

May 19, 2025 · Artificial Intelligence

AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans

This article presents a comprehensive AI-driven solution for automatically generating functional test cases, detailing the AI background, design scheme, core components such as PRD parsing, test‑point generation, test‑case creation, knowledge‑base construction, implementation results, and future development directions.

AIKnowledge BaseLLM

0 likes · 7 min read

AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans

Tencent Technical Engineering

May 19, 2025 · Artificial Intelligence

RAG, Agents, and Multimodal Large Models: Evolution, Challenges, and Future Trends

This article examines the evolution of large model technologies—including Retrieval‑Augmented Generation, AI agents, and multimodal models—detailing their technical foundations, practical challenges, industry applications, and future development trends, offering a comprehensive perspective for AI practitioners and researchers.

AI AgentKnowledge RetrievalMultimodal

0 likes · 14 min read

RAG, Agents, and Multimodal Large Models: Evolution, Challenges, and Future Trends

DataFunSummit

May 13, 2025 · Artificial Intelligence

Integrating Large Language Models and Knowledge Graphs for Financial Applications: Challenges, Solutions, and Future Directions

This talk explores the technical challenges of applying large language models and knowledge graphs in finance, discusses solutions such as RAG enhancements, graph‑guided retrieval, multimodal extensions, and presents future research directions including multimodal graph integration, agentic systems, and decision‑making applications.

AIMultimodalRAG

0 likes · 33 min read

Integrating Large Language Models and Knowledge Graphs for Financial Applications: Challenges, Solutions, and Future Directions

DataFunSummit

May 9, 2025 · Artificial Intelligence

Practical Experience Building Zhihu Direct Answer: An AI‑Powered Search Product

This article presents a comprehensive overview of Zhihu Direct Answer, describing its AI‑driven search architecture, RAG framework, query understanding, retrieval, chunking, reranking, generation, evaluation mechanisms, engineering optimizations, and the professional edition, while sharing concrete performance‑boosting practices and future development plans.

AIGenerationProduct Development

0 likes · 14 min read

Practical Experience Building Zhihu Direct Answer: An AI‑Powered Search Product

Alibaba Cloud Native

May 9, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch

This tutorial walks through building a Retrieval‑Augmented Generation (RAG) system by combining LangChain for document processing, Elasticsearch’s vector store with the ELSER v2 model for semantic search, and Higress as a cloud‑native AI gateway, complete with deployment scripts, code examples, and query testing.

AIHigressLangChain

0 likes · 15 min read

Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch

phodal

May 9, 2025 · Artificial Intelligence

Why Pre‑Generated Context Is the Key to Faster, More Accurate AI Code Retrieval

The article examines how pre‑generating structured context for codebases can overcome the uncertainty and quality issues of traditional Retrieval‑Augmented Generation, outlines the technical and business challenges of RAG, compares existing code‑search tools, and introduces AutoDev’s Context Worker as a practical solution.

AILLMRAG

0 likes · 11 min read

Why Pre‑Generated Context Is the Key to Faster, More Accurate AI Code Retrieval

Youzan Coder

May 8, 2025 · Artificial Intelligence

Building and Optimizing a Store Smart Assistant with Aily: Architecture, Workflow, and Practical Lessons

The article details how Youzan’s Store Smart Assistant was built on the Feishu Aily platform, describing why Aily was chosen, the three‑stage development process, deep system integration, practical tips for knowledge‑base management and model stability, and the resulting efficiency gains such as handling 80% of routine queries.

AI AssistantAily platformKnowledge Base

0 likes · 24 min read

Building and Optimizing a Store Smart Assistant with Aily: Architecture, Workflow, and Practical Lessons

Alibaba Cloud Big Data AI Platform

May 6, 2025 · Artificial Intelligence

Build a Powerful RAG‑Enabled AI Q&A App with PAI‑LangStudio and Qwen3

This guide walks you through using Alibaba Cloud's PAI‑LangStudio together with the Qwen3 large language model to create an AI‑powered question‑answering system that combines Retrieval‑Augmented Generation, web search, secure deployment, and flexible customization for production use.

LangStudioMilvusQwen3

0 likes · 10 min read

Build a Powerful RAG‑Enabled AI Q&A App with PAI‑LangStudio and Qwen3

ITPUB

Apr 28, 2025 · Artificial Intelligence

How Large Language Models are Transforming Automotive Operations and Optimization

In this interview, an automotive industry expert explains how large language models and advanced operations‑optimization techniques are reshaping vehicle design, production planning, logistics, and customer services, while also discussing implementation challenges, team requirements, and future AI‑driven opportunities.

AI adoptionAutomotive AILarge Language Models

0 likes · 15 min read

How Large Language Models are Transforming Automotive Operations and Optimization

DevOps

Apr 27, 2025 · Artificial Intelligence

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

This article examines how Retrieval‑Augmented Generation (RAG), AI agents, and multimodal large‑model techniques are reshaping AI‑industry integration, discusses their technical challenges and practical implementations, and outlines future development directions across algorithms, products, and domain‑specific applications.

AI agentsArtificial IntelligenceMultimodal

0 likes · 14 min read

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

Fun with Large Models

Apr 25, 2025 · Artificial Intelligence

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%

This article analyzes common shortcomings of RAG pipelines—data preparation, retrieval, and LLM generation—and provides concrete optimization techniques such as advanced chunking, embedding model selection, retrieval parameter tuning, rerank models, and prompt engineering, promising up to a 20% performance gain.

EmbeddingPrompt engineeringRAG

0 likes · 17 min read

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%

DataFunTalk

Apr 24, 2025 · Artificial Intelligence

Is Retrieval‑Augmented Generation (RAG) Dead Yet?

This article explains the original purpose of Retrieval‑Augmented Generation, why it remains essential despite advances in large‑context LLMs, and how combining RAG with fine‑tuning, longer context windows, and model‑context protocols yields more scalable, accurate, and privacy‑preserving AI systems.

AIContext WindowKnowledge Retrieval

0 likes · 9 min read

Is Retrieval‑Augmented Generation (RAG) Dead Yet?

Tencent Cloud Developer

Apr 24, 2025 · Industry Insights

How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases

The article analyzes the rapid evolution of large‑model technologies—Retrieval‑Augmented Generation, autonomous agents, and multimodal AI—detailing their technical foundations, practical challenges, industry applications such as unified multimodal tasks, open‑world detection, and video moderation, and forecasting future development directions.

AI agentsMultimodal AIRAG

0 likes · 15 min read

How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases

Big Data Technology & Architecture

Apr 22, 2025 · Artificial Intelligence

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

This article explains the fundamentals of Retrieval‑Augmented Generation, demonstrates how to create and query vector indexes using StarRocks, shows how DeepSeek provides embeddings and answer generation, and walks through a complete end‑to‑end RAG pipeline with code examples and a web UI.

AIDeepSeekEmbedding

0 likes · 20 min read

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

Alibaba Cloud Developer

Apr 21, 2025 · Artificial Intelligence

Building a Private Knowledge‑Base Q&A Agent with MCP: A Hands‑On Guide

This article explores the rapid rise of AI agents, explains the Model Context Protocol (MCP) standard, and provides a step‑by‑step implementation of a private knowledge‑base Q&A system using Python and Java, covering knowledge construction, retrieval, architecture, and deployment details.

AI AgentKnowledge BaseMCP

0 likes · 9 min read

Building a Private Knowledge‑Base Q&A Agent with MCP: A Hands‑On Guide

DevOps

Apr 20, 2025 · Artificial Intelligence

Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example

This article demonstrates how to construct an AI‑powered medical knowledge base for diabetes treatment by preprocessing literature, performing semantic chunking, generating BioBERT embeddings, storing them in a FAISS vector database, and using a RAG framework together with a knowledge graph to retrieve and generate accurate answers.

BioBERTFAISSRAG

0 likes · 12 min read

Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example

DaTaobao Tech

Apr 18, 2025 · Frontend Development

How AI Is Transforming Frontend Development: From Design‑to‑Code to Automated Testing

This article explores how AI-driven tools are reshaping frontend engineering by automating design‑to‑code conversion, interface‑to‑data‑model mapping, private component integration, code fitting, AI code review, and automated test regression, and it evaluates their impact on efficiency and future development workflows.

AICodeGenerationRAG

0 likes · 37 min read

How AI Is Transforming Frontend Development: From Design‑to‑Code to Automated Testing

Fun with Large Models

Apr 18, 2025 · Artificial Intelligence

How RAG Works: From Data Prep to LLM Generation Explained

This article breaks down Retrieval‑Augmented Generation (RAG) into its three core stages—data preparation, data retrieval, and LLM generation—showing how document chunking, embedding, vector databases, similarity search, and optional re‑ranking combine to let large language models produce more accurate, knowledge‑grounded answers.

EmbeddingLLMRAG

0 likes · 9 min read

How RAG Works: From Data Prep to LLM Generation Explained

Data Thinking Notes

Apr 17, 2025 · Artificial Intelligence

How Dify Accelerates Generative AI App Development with Low‑Code and Modular Design

Dify is an open‑source LLM application platform that blends BaaS and LLMOps, offering low‑code development, modular components, extensive model support, and advanced retrieval features, while also detailing its current limitations and recent enhancements such as MySQL integration and Elasticsearch‑based RAG capabilities.

AIElasticsearchLLM

0 likes · 7 min read

How Dify Accelerates Generative AI App Development with Low‑Code and Modular Design

Java Architecture Diary

Apr 14, 2025 · Artificial Intelligence

How to Empower LLMs with a Private SearXNG Search Engine for Real‑Time Knowledge

This guide explains why large language models need private search capabilities, outlines the benefits of a self‑hosted SearXNG engine, provides step‑by‑step Docker deployment, and demonstrates Java integration using LangChain4j for both basic queries and retrieval‑augmented generation (RAG).

DockerLLMLangChain4j

0 likes · 6 min read

How to Empower LLMs with a Private SearXNG Search Engine for Real‑Time Knowledge

Data Thinking Notes

Apr 13, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation Knowledge Base with DeepSeek and RAGFlow

This guide walks you through the fundamentals of Retrieval‑Augmented Generation, introduces the open‑source RAGFlow framework, details installation steps, shows how to integrate DeepSeek LLMs, and explores practical application scenarios such as intelligent customer service and enterprise document QA.

AIDeepSeekLLM

0 likes · 11 min read

How to Build a Retrieval‑Augmented Generation Knowledge Base with DeepSeek and RAGFlow

Spring Full-Stack Practical Cases

Apr 10, 2025 · Artificial Intelligence

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

This guide walks through creating a Retrieval‑Augmented Generation (RAG) system using Spring Boot 3.4.2, Milvus vector database, and the bge‑m3 embedding model via Ollama, covering environment setup, dependency configuration, vector store operations, and integration with a large language model to deliver refined, similarity‑based answers.

EmbeddingLLMMilvus

0 likes · 11 min read

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

Alibaba Cloud Big Data AI Platform

Apr 10, 2025 · Artificial Intelligence

Building a Pet Hospital AI Assistant with RAG and LLMs

This article walks through the motivation, core concepts of Retrieval‑Augmented Generation, and a step‑by‑step guide to constructing a pet‑hospital AI assistant on Alibaba Cloud using LLMs, vector databases, and automated pipelines, complete with code examples and practical tips.

AI AssistantAlibaba CloudLLM

0 likes · 18 min read

Building a Pet Hospital AI Assistant with RAG and LLMs

Architect's Alchemy Furnace

Apr 8, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), its three‑step workflow of retrieval, augmentation, and generation, its key advantages such as improved accuracy and explainability, and compares RAG with traditional pre‑trained models, fine‑tuned models, hybrid models, knowledge‑distillation methods, and RLHF, while also covering vector, full‑text, and hybrid retrieval modes and the role of rerank models.

AIKnowledge RetrievalRAG

0 likes · 18 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

Alibaba Cloud Developer

Apr 8, 2025 · Artificial Intelligence

Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration

This article introduces the fundamental paradigms of large language models, explaining how simple prompts, messages, and tools like RAG and ReAct enable powerful applications, while providing practical code examples, translation strategies, and insights on prompt engineering, tool usage, and model fine‑tuning.

AILLM applicationsLarge Language Models

0 likes · 23 min read

Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration

dbaplus Community

Apr 7, 2025 · Databases

How Do LLMs Tackle Oracle Bad Block Errors? A Hands‑On Evaluation

This article presents a hands‑on evaluation of several large language models—including Mistral‑Small, Deepseek‑r1, Llama 3.3 and ChatGPT‑4‑go—on Oracle database bad‑block errors, RAG‑based document retrieval, and log‑driven reasoning, revealing performance gaps, scoring results, and practical DBA implications.

AILLM evaluationOracle

0 likes · 11 min read

How Do LLMs Tackle Oracle Bad Block Errors? A Hands‑On Evaluation

Beijing SF i-TECH City Technology Team

Apr 7, 2025 · Artificial Intelligence

LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing

This article explores the application of Large Language Models (LLM) in text information detection and extraction, focusing on blue-collar recruitment data processing. It details the implementation of LLM through prompt engineering, RAG enhancement, and model fine-tuning to improve data cleaning efficiency and accuracy.

AI applicationsLLMPrompt engineering

0 likes · 31 min read

LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing

DataFunSummit

Apr 7, 2025 · Artificial Intelligence

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

This article examines how Retrieval‑Augmented Generation (RAG) and multi‑agent technologies narrow the gap between large language models and practical deployment, highlighting their roles in operations automation, financial risk control, intelligent data governance, database localization, edge inference, and future AI‑driven solutions.

Data GovernanceLarge Language ModelsOperations Automation

0 likes · 8 min read

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

JD Tech Talk

Apr 7, 2025 · Artificial Intelligence

Best Practices for Building AI Agents: Prompt Design, Tool Management, and Context Optimization

This article explains how to develop robust AI agents by breaking down large prompts, selecting appropriate tools, managing context efficiently, and applying modular design principles to reduce token costs, avoid hallucinations, and improve overall performance and reliability.

AI AgentLLMPrompt engineering

0 likes · 11 min read

Best Practices for Building AI Agents: Prompt Design, Tool Management, and Context Optimization

JD Cloud Developers

Apr 7, 2025 · Artificial Intelligence

Why Bigger Prompts Fail: Modular Strategies for Building Efficient AI Agents

This article explains why overloading prompts and tools harms AI‑Agent performance, and offers practical modular design, intent‑driven instruction splitting, and efficient context management strategies such as curated function‑call tools and dynamic RAG to reduce token costs, improve response speed, and avoid hallucinations.

AI AgentLLMPrompt engineering

0 likes · 13 min read

Why Bigger Prompts Fail: Modular Strategies for Building Efficient AI Agents

Big Data Technology & Architecture

Apr 3, 2025 · Artificial Intelligence

Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration

This article explains the Model Context Protocol (MCP) as a standard for LLM‑data integration, describes Retrieval‑Augmented Generation (RAG) techniques to reduce hallucinations, and introduces vector databases like Milvus that store high‑dimensional embeddings for efficient AI retrieval tasks.

LLMMCPMilvus

0 likes · 7 min read

Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration

DevOps

Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval-Augmented Generation (RAG): Concepts, Evolution, and Types

This article explains Retrieval‑Augmented Generation (RAG), its role in mitigating large language model knowledge cutoff and hallucination, outlines the evolution from naive to advanced, modular, graph, and agentic RAG, and discusses future directions such as intelligent and multi‑modal RAG systems.

Artificial IntelligenceKnowledge RetrievalLLM

0 likes · 10 min read

Understanding Retrieval-Augmented Generation (RAG): Concepts, Evolution, and Types

AntTech

Apr 2, 2025 · Artificial Intelligence

PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

The PEAR framework introduces a position‑embedding‑agnostic attention re‑weighting method that detects and suppresses detrimental attention heads in large language models, dramatically improving retrieval‑augmented generation performance without adding any inference overhead, as demonstrated on multiple RAG benchmarks and LLM families.

Attention Re-weightingLLMPEAR

0 likes · 6 min read

PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

Tencent Cloud Developer

Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) enhances large language models by fetching up‑to‑date external knowledge before generation, mitigating knowledge‑cutoff limits and hallucinations through a retrieval step (using text, vector, or graph methods) and a generation step, evolving from naive single‑method approaches to advanced, modular, graph‑based, and agentic systems that enable adaptive, multi‑hop reasoning and future intelligent, multimodal pipelines.

AIAgentic AIKnowledge Retrieval

0 likes · 9 min read

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Architect

Apr 1, 2025 · Artificial Intelligence

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

The article explains why most projects should start with prompt engineering or simple agent workflows, outlines the scenarios where model fine‑tuning adds real value, compares fine‑tuning with Retrieval‑Augmented Generation, and offers practical criteria for deciding which approach to adopt.

AI deploymentLarge Language ModelsLoRA

0 likes · 9 min read

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

Architect

Mar 30, 2025 · Artificial Intelligence

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques

This article provides a comprehensive survey of Retrieval‑Augmented Generation (RAG), covering its basic principles, key components, seven technical variants, challenges, evaluation methods, and future research directions across multimodal, graph‑based, and agentic extensions.

AI SurveyKnowledge RetrievalLarge Language Models

0 likes · 9 min read

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques