Tagged articles

Vector Store

38 articles · Page 1 of 1

Jun 24, 2026 · Artificial Intelligence

How to Safely Delete Data in RAG Systems: Governance Best Practices

The article explains why data deletion is the most delicate stage in RAG governance, outlines four deletion categories, details the multi‑layer removal process across vector indexes, metadata, raw storage, backups, caches and session history, and proposes proactive lifecycle strategies to ensure compliance and auditability.

AIData GovernanceRAG

0 likes · 8 min read

How to Safely Delete Data in RAG Systems: Governance Best Practices

Coder Trainee

Jun 20, 2026 · Artificial Intelligence

Java RAG Tutorial: Vector Search and Knowledge‑Base Integration

This article explains how to equip a Java application with Retrieval‑Augmented Generation (RAG) so large language models can access private PDFs, Word files, and internal documents, covering the core architecture, two implementation paths using LangChain4j and Spring AI, vector‑store options, and practical tuning techniques.

JavaLangChain4jRAG

0 likes · 12 min read

Java RAG Tutorial: Vector Search and Knowledge‑Base Integration

James' Growth Diary

May 13, 2026 · Artificial Intelligence

Multimodal RAG: A Complete Guide to Ingesting Images, Tables, and PDFs

This article examines the blind spot of pure‑text RAG for visual content, compares three multimodal ingestion strategies—CLIP embeddings, image‑to‑text captioning with a MultiVectorRetriever, and ColPali visual retrieval—covers table‑specific handling, presents end‑to‑end TypeScript implementations, and lists common pitfalls to avoid when deploying production‑grade multimodal RAG pipelines.

CLIPColPaliImage Captioning

0 likes · 22 min read

Multimodal RAG: A Complete Guide to Ingesting Images, Tables, and PDFs

IT Services Circle

May 11, 2026 · Artificial Intelligence

Can Claude’s Code Generation Replace Agent Memory Systems? Understanding CLAUDE.md, Memory, and RAG

The article explains why large language model agents need dedicated memory systems to overcome limited context windows, outlines short‑term and long‑term memory architectures, storage forms, functional categories, lifecycle operations, control‑policy research, compares leading products, and presents best‑practice engineering guidelines for building scalable, privacy‑aware agent memory pipelines.

Agent MemoryControl PolicyRAG

0 likes · 25 min read

Can Claude’s Code Generation Replace Agent Memory Systems? Understanding CLAUDE.md, Memory, and RAG

James' Growth Diary

May 10, 2026 · Artificial Intelligence

Syncing Vectors with Changing Documents: Add, Update, Delete Made Simple

This article walks through why keeping a vector store consistent with a mutable knowledge base is challenging, explains the three failure points, introduces hash‑based incremental syncing, shows idempotent add, proper update and soft‑delete workflows, covers embedding model upgrades, and presents a production‑grade event‑driven architecture with common pitfalls and remedies.

Hash DeduplicationLangChainRAG

0 likes · 17 min read

Syncing Vectors with Changing Documents: Add, Update, Delete Made Simple

AgentGuide

Apr 17, 2026 · Artificial Intelligence

Designing Short‑Term and Long‑Term Memory for AI Agents: Key Strategies and Trade‑offs

The article explains how to split an AI agent's memory into short‑term and long‑term layers, compares fixed‑window truncation with rolling summarisation for session memory, and details building a vector‑based long‑term store, its benefits, drawbacks, and governance practices.

Agent MemoryLLMShort-term Memory

0 likes · 6 min read

Designing Short‑Term and Long‑Term Memory for AI Agents: Key Strategies and Trade‑offs

AI Step-by-Step

Apr 14, 2026 · Artificial Intelligence

How Hermes Memory Splits Knowledge for Efficient Agent Recall

The article analyzes Hermes' memory architecture, showing how it separates user preferences, environmental facts, conversation history, and procedural skills into distinct storage layers—file‑based defaults for high‑frequency data and vector‑based augmentation for large‑scale semantic retrieval—thereby improving reliability, transparency, and maintainability of LLM agents.

AgentFile MemoryHermes

0 likes · 12 min read

How Hermes Memory Splits Knowledge for Efficient Agent Recall

Alibaba Cloud Big Data AI Platform

Mar 31, 2026 · Artificial Intelligence

How to Build a Production‑Ready AI Memory System with Mem0 and Elasticsearch

This guide explains how to overcome the stateless nature of large language models by using the Mem0 framework together with Elasticsearch to create a persistent, vector‑searchable memory layer, covering architecture, real‑world scenarios, step‑by‑step deployment, and integration with the OpenClaw agent framework.

AI memoryElasticsearchLLM

0 likes · 15 min read

How to Build a Production‑Ready AI Memory System with Mem0 and Elasticsearch

AI Tech Publishing

Mar 28, 2026 · Artificial Intelligence

Designing Agent Memory Systems: Four Types, Three Strategies, and Full Python Implementation

This article breaks down agentic memory into four distinct types—In‑context, External, Episodic, and Semantic/Parametric—explains three forgetting strategies (time decay, importance scoring, periodic consolidation), shows how memory flows through an agent loop, and provides complete Python code using OpenAI embeddings and ChromaDB for a production‑ready memory layer.

Agent MemoryChromaDBLLM

0 likes · 22 min read

Designing Agent Memory Systems: Four Types, Three Strategies, and Full Python Implementation

LuTiao Programming

Mar 10, 2026 · Artificial Intelligence

Why Most Java Teams Miss the Quiet AI Revolution Brought by Spring AI

Spring AI eliminates SDK fragmentation, side‑car Python services, and operational complexity by providing a unified AI abstraction layer for Java, enabling seamless model switching, RAG, tool calling, and offering concrete performance data and best‑practice guidance for production use.

AI integrationChatClientJava

0 likes · 12 min read

Why Most Java Teams Miss the Quiet AI Revolution Brought by Spring AI

Data STUDIO

Feb 22, 2026 · Artificial Intelligence

Building AI Agents with LangGraph: Implementing RAG and Long‑Term Memory

This tutorial walks through adding Retrieval‑Augmented Generation (RAG) and persistent long‑term memory to a LangGraph AI agent, covering concepts, step‑by‑step code for document loading, vector store creation, prompt engineering, memory management, and best‑practice pitfalls.

AI AgentEmbeddingLangChain

0 likes · 16 min read

Building AI Agents with LangGraph: Implementing RAG and Long‑Term Memory

Alibaba Cloud Native

Dec 27, 2025 · Artificial Intelligence

Unlocking AI Agent Memory: Short‑Term vs Long‑Term Strategies and Framework Integration

This article explains how AI agents overcome context window limits by using memory systems, distinguishes short‑term (session) and long‑term (cross‑session) memory, compares implementations in Google ADK, LangChain and AgentScope, and outlines context‑engineering techniques, core components, challenges, and emerging trends.

AI memoryLLMShort-term Memory

0 likes · 20 min read

Unlocking AI Agent Memory: Short‑Term vs Long‑Term Strategies and Framework Integration

Fun with Large Models

Nov 30, 2025 · Artificial Intelligence

Multimodal RAG with LangChain: PDF Parsing, Chunking, and Citation Guide

This article walks through building a LangChain‑based multimodal RAG system that parses PDFs (both native and scanned), splits them into semantic chunks, stores embeddings in a vector database, and generates answers with precise source citations, complete with code samples and API integration.

FastAPILangChainMultimodal AI

0 likes · 20 min read

Multimodal RAG with LangChain: PDF Parsing, Chunking, and Citation Guide

BirdNest Tech Talk

Oct 27, 2025 · Artificial Intelligence

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

This article explains how LangChain's Indexing API adds state management and synchronization to the classic load‑split‑embed‑store RAG pipeline, detailing the RecordManager component, the index function workflow, key parameters, implementation considerations, and best‑practice code examples for production‑grade vector stores.

FAISSIndexing APILangChain

0 likes · 12 min read

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

Data Party THU

Sep 11, 2025 · Artificial Intelligence

How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores

ComRAG tackles the static‑knowledge gaps, uneven QA quality, and storage explosion of community question‑answer platforms by integrating a static documentation vector store with dual dynamic CQA stores managed via a centroid‑based memory, delivering higher accuracy, lower latency, and scalable storage for industrial retrieval‑augmented generation.

Community QADynamic RetrievalLLM

0 likes · 7 min read

How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores

Ops Development & AI Practice

Jul 29, 2025 · Artificial Intelligence

Building a Retrieval‑Augmented Generation QA Bot to Keep LLMs Up‑to‑Date

This article explains how to create a RAG‑based intelligent QA system that fetches the latest documentation (e.g., PlantUML) before querying Gemini, detailing knowledge‑base creation, embedding, vector store management, LangChain integration, and deployment tips.

AI assistantEmbeddingGemini

0 likes · 8 min read

Building a Retrieval‑Augmented Generation QA Bot to Keep LLMs Up‑to‑Date

Spring Full-Stack Practical Cases

Jul 23, 2025 · Backend Development

Build a Spring Boot 3 Web QA App with Vector Store and AI in Minutes

This guide walks through creating a Spring Boot 3 application that loads web pages into a vector database using Spring AI, configures AI models and embeddings, and exposes REST endpoints for storing documents and answering queries with retrieval‑augmented generation.

MilvusSpring AISpring Boot

0 likes · 10 min read

Build a Spring Boot 3 Web QA App with Vector Store and AI in Minutes

Ops Development Stories

Jun 30, 2025 · Artificial Intelligence

Build a Private AI Knowledge Assistant with n8n: Zero‑Code RAG in 30 Minutes

This guide shows how to create a fully local Retrieval‑Augmented Generation (RAG) system using n8n, Docker, Ollama and the free Qwen3 embedding model, enabling secure, up‑to‑date AI assistants that answer enterprise questions without exposing any proprietary data.

AI assistantDockerEmbedding

0 likes · 17 min read

Build a Private AI Knowledge Assistant with n8n: Zero‑Code RAG in 30 Minutes

Programmer DD

May 21, 2025 · Artificial Intelligence

What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features

Spring AI 1.0 GA introduces a comprehensive suite of AI capabilities for Java developers, including a ChatClient supporting 20 models, vector‑store integrations, RAG pipelines, advanced chat memory, @Tool function calling, model evaluation, observability, Model Context Protocol, and autonomous agents, with examples for major cloud providers.

AI modelsJavaMCP

0 likes · 6 min read

What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features

Java Tech Enthusiast

May 21, 2025 · Artificial Intelligence

How ChatGPT's New Memory Feature Works: Technical Analysis and Implementation Details

The article provides a detailed technical breakdown of OpenAI's new ChatGPT memory feature, explaining its two memory modes, underlying sub‑systems, possible implementation approaches using vector stores and scheduled jobs, and early user feedback highlighting both benefits and bugs.

AIChatGPTRAG

0 likes · 8 min read

How ChatGPT's New Memory Feature Works: Technical Analysis and Implementation Details

Qborfy AI

Mar 29, 2025 · Artificial Intelligence

Mastering LangChain: Build LLM Apps with Chains, Agents, and Vector Stores

This tutorial walks through the limitations of simple prompt usage, introduces LangChain as a framework for building full‑featured LLM applications, explains its core concepts and components, and provides step‑by‑step code examples for installing, configuring, and running a basic LangChain demo.

AI ApplicationAgentsLLM

0 likes · 11 min read

Mastering LangChain: Build LLM Apps with Chains, Agents, and Vector Stores

Alibaba Cloud Native

Mar 19, 2025 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide

This article explains the Retrieval‑Augmented Generation (RAG) paradigm, walks through its four core steps, and provides a detailed Spring AI implementation—including configuration, vector storage, REST controller, multi‑query expansion, query rewriting, document joining, and error handling—plus best‑practice recommendations for production deployments.

AIJavaRAG

0 likes · 23 min read

Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide

AI Large Model Application Practice

Oct 30, 2024 · Artificial Intelligence

How to Efficiently Incrementally Update Knowledge in RAG Applications

Incremental knowledge updates in Retrieval‑Augmented Generation (RAG) systems can be achieved by using document‑level or chunk‑level strategies, leveraging hash fingerprints, record managers, and framework‑specific APIs such as LangChain’s index() with cleanup modes or LlamaIndex’s ingestion pipeline, reducing redundant computation and cost.

LangChainLlamaIndexRAG

0 likes · 12 min read

How to Efficiently Incrementally Update Knowledge in RAG Applications

21CTO

Aug 17, 2024 · Artificial Intelligence

Vector Store vs Vector Database: Which Powers Your AI Apps Better?

This guide explains the differences between vector stores and vector databases, covering vector embeddings, performance, scalability, integration, and ideal use‑cases, helping developers choose the right tool—or a hybrid approach—for AI applications.

AI embeddingsScalable ArchitectureVector Store

0 likes · 12 min read

Vector Store vs Vector Database: Which Powers Your AI Apps Better?

Eric Tech Circle

Apr 11, 2024 · Artificial Intelligence

Build a Generative AI RAG App with Spring AI in Minutes

This guide walks you through setting up Spring AI, configuring model providers and vector stores, initializing a Spring Boot project, adding OpenAI credentials, and running a complete RAG (Retrieval‑Augmented Generation) demo with code snippets and sample API calls.

JavaOpenAIRAG

0 likes · 15 min read

Build a Generative AI RAG App with Spring AI in Minutes

Sohu Tech Products

Mar 13, 2024 · Artificial Intelligence

Build a Minimal Retrieval‑Augmented Generation (Tiny‑RAG) from Scratch

This step‑by‑step guide explains how to implement a lightweight Retrieval‑Augmented Generation system—Tiny‑RAG—by creating embedding classes, loading and chunking documents, building a simple vector store, performing similarity search, and integrating a large language model for answer generation, complete with runnable Python code.

EmbeddingLLMPython

0 likes · 14 min read

Build a Minimal Retrieval‑Augmented Generation (Tiny‑RAG) from Scratch

Rare Earth Juejin Tech Community

Nov 29, 2023 · Artificial Intelligence

Building a Private LLM‑Powered Knowledge Base with LangChain and ChatGLM3

This article explains how to migrate personal notes into a private knowledge base by combining a large language model with an external vector store, detailing the concepts of tokenization, embedding, vector databases, and step‑by‑step deployment using LangChain‑Chatchat and the open‑source ChatGLM3 model.

ChatGLM3EmbeddingKnowledge Base

0 likes · 10 min read

Building a Private LLM‑Powered Knowledge Base with LangChain and ChatGLM3

Rare Earth Juejin Tech Community

Nov 24, 2023 · Artificial Intelligence

Step-by-Step Guide to Deploying LangChain‑Chatchat with ChatGLM‑2 on a Local Machine

This article provides a comprehensive tutorial on setting up the LangChain‑Chatchat open‑source project, covering environment preparation, model and embedding downloads, configuration files, database initialization, API service launch, and example curl commands for interacting with both the large language model and the local knowledge base.

APIChatGLMEmbedding

0 likes · 9 min read

Step-by-Step Guide to Deploying LangChain‑Chatchat with ChatGLM‑2 on a Local Machine

Baobao Algorithm Notes

Nov 7, 2023 · Artificial Intelligence

A Complete Technical Guide to LLM Foundations, Advanced Topics, Fine‑Tuning, and LangChain Applications

This article provides an in‑depth technical overview of large language models (LLMs), covering core model families, architectural differences, emergent abilities, common challenges such as repetition and token limits, detailed fine‑tuning strategies including PEFT, practical guidance for training custom models, and a thorough introduction to the LangChain framework with code examples, core concepts, and troubleshooting tips for building LLM‑powered applications.

LLMLangChainVector Store

0 likes · 97 min read

A Complete Technical Guide to LLM Foundations, Advanced Topics, Fine‑Tuning, and LangChain Applications

dbaplus Community

Oct 14, 2023 · Artificial Intelligence

Demystifying Retrieval‑Augmented Generation: From Theory to Working Chatbot

This guide explains the Retrieval‑Augmented Generation (RAG) technique, detailing how user queries are matched to private knowledge bases, how relevant passages are retrieved, and how large language models use those passages to generate context‑aware answers, complete with code examples and practical tips.

ChatbotEmbeddingLLM

0 likes · 19 min read

Demystifying Retrieval‑Augmented Generation: From Theory to Working Chatbot

ByteFE

Oct 11, 2023 · Artificial Intelligence

CR Copilot: An Open‑Source LLM‑Based Code Review Assistant with Private Knowledge Base

This article describes the design and implementation of a code‑review assistant powered by open‑source large language models and a privately hosted knowledge base, covering background, pain points, system architecture, model selection, vector‑store integration, prompt engineering, diff parsing, and practical reflections.

AIKnowledge BaseLLM

0 likes · 24 min read

CR Copilot: An Open‑Source LLM‑Based Code Review Assistant with Private Knowledge Base

UCloud Tech

Sep 11, 2023 · Artificial Intelligence

Build a Soul‑Healing Chatbot with LangChain & Llama 2: A Step‑by‑Step Guide

This article walks through constructing a domain‑specific, soul‑healing chatbot using LangChain and Llama 2, comparing fine‑tuning versus external knowledge bases, detailing environment setup, data loading, text splitting, embedding with a Chinese model, vector store creation, prompt engineering, inference, and optimization strategies.

Knowledge BaseLangChainLlama 2

0 likes · 14 min read

Build a Soul‑Healing Chatbot with LangChain & Llama 2: A Step‑by‑Step Guide

JD Cloud Developers

Aug 15, 2023 · Artificial Intelligence

Unlocking LangChain: A Deep Dive into LLM‑Powered Application Development

This article explains what LangChain is, outlines its core components such as Models, Indexes, Chains, Memory and Agents, provides practical code examples for building summarization and QA pipelines, and discusses future directions for LLM‑centric development.

AgentsLLMLangChain

0 likes · 22 min read

Unlocking LangChain: A Deep Dive into LLM‑Powered Application Development

Architect's Guide

Aug 10, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM Applications in Python

This tutorial introduces LangChain, an open‑source Python framework that provides unified model access, prompt management, memory, retrieval, and tool integration, enabling developers to quickly prototype AI‑driven applications using large language models and various external data sources.

AgentsLLMLangChain

0 likes · 13 min read

Getting Started with LangChain: Building LLM Applications in Python

MoonWebTeam

Jul 28, 2023 · Artificial Intelligence

Unlocking LangChain: A Complete Guide to Building LLM Applications

This article introduces LangChain, explains its architecture and core components, and provides step‑by‑step Python examples for chat models, embeddings, prompts, indexes, chains, memory, agents, and practical use‑cases such as QA bots, web search, summarization, and persistent vector stores.

LLMLangChainPython

0 likes · 34 min read

Unlocking LangChain: A Complete Guide to Building LLM Applications

Alibaba Cloud Big Data AI Platform

Jul 26, 2023 · Artificial Intelligence

Build a Custom LLM Chatbot with Hologres Vector Store and LangChain

This guide shows how to create a domain‑specific chatbot by combining large language models, prompt‑tuning, and Alibaba Cloud's Hologres real‑time vector store, covering prerequisites, environment setup, code configuration, vector data preparation, verification, and optional DingTalk integration.

ChatbotHologresLLM

0 likes · 12 min read

Build a Custom LLM Chatbot with Hologres Vector Store and LangChain

Volcano Engine Developer Services

Jul 21, 2023 · Artificial Intelligence

Build a Custom AI Q&A System Using Volcano Engine Cloud Search & LangChain

This guide explains how to create a domain‑specific intelligent Q&A system by leveraging prompt‑tuning, Volcano Engine Cloud Search vector store, and LangChain, covering model selection, data embedding, vector indexing, retrieval, and LLM integration with full code examples.

AI Q&AEmbeddingLLM

0 likes · 8 min read

Build a Custom AI Q&A System Using Volcano Engine Cloud Search & LangChain

21CTO

May 31, 2023 · Artificial Intelligence

How to Build a Private, Offline GPT with Python – Step‑by‑Step Guide

This tutorial explains how to set up PrivateGPT, a Python‑based offline LLM solution that runs locally without sending any data to the cloud, covering environment preparation, model download, repository cloning, data ingestion, and interactive querying.

Offline AIPrivateGPTPython

0 likes · 5 min read

How to Build a Private, Offline GPT with Python – Step‑by‑Step Guide