Tagged articles

LangChain

346 articles · Page 1 of 4
Code Mala Tang
Code Mala Tang
Jun 28, 2026 · Artificial Intelligence

7 Essential Things to Know About MCP AI (Multi‑Context Prompting)

MCP AI, a multi‑context prompting approach, replaces linear chat interactions by maintaining several active contexts that the model can switch between, solving context‑window limits, improving coherence, and enabling system‑level workflows, while requiring proper role definition, rules, and feedback loops.

AI ArchitectureClaudeCrewAI
0 likes · 7 min read
7 Essential Things to Know About MCP AI (Multi‑Context Prompting)
Fun with Large Models
Fun with Large Models
Jun 27, 2026 · Artificial Intelligence

Quick Guide to LangChain DeepAgents: Exploring the Production‑Grade DeepAgents Code Framework

This article provides a comprehensive walkthrough of the DeepAgents Code repository, explaining its client‑server architecture, module organization, technology stack—including DeepAgents SDK, Textual UI, SQLite persistence, and streaming protocol—and the design rationale behind building a production‑ready AI agent framework.

AI AgentsDeepAgentsLangChain
0 likes · 14 min read
Quick Guide to LangChain DeepAgents: Exploring the Production‑Grade DeepAgents Code Framework
AI Architecture Hub
AI Architecture Hub
Jun 19, 2026 · Artificial Intelligence

Mastering Loop Engineering: Building Robust AI Agents with LangChain

This article explains how to construct reliable, high‑value AI agents by stacking four loop layers—agent, validation, event‑driven, and optimization—using LangChain’s create_agent, RubricMiddleware, LangSmith Deployment, and Engine, while discussing trade‑offs, human oversight, and future RL‑based enhancements.

AI AgentsAgent validationEvent-driven automation
0 likes · 9 min read
Mastering Loop Engineering: Building Robust AI Agents with LangChain
Coder Trainee
Coder Trainee
Jun 16, 2026 · Artificial Intelligence

Building a Data Analysis AI Agent: From Basics to Real‑World Implementation

This article walks through the design and implementation of a data‑analysis AI agent that converts natural‑language queries into SQL, executes them on a SQLite sales database, generates visualizations, and produces insight reports, complete with architecture diagrams and full Python code examples.

AI AgentData VisualizationLLM
0 likes · 9 min read
Building a Data Analysis AI Agent: From Basics to Real‑World Implementation
DeepHub IMBA
DeepHub IMBA
Jun 16, 2026 · Artificial Intelligence

10 Essential LangChain & LangGraph Concepts Every AI Engineer Must Master

The article outlines ten core concepts—State, Node, Chain vs Graph, Routing, Retrieval, Structured Output, Streaming, Memory, Checkpointing, and Human‑in‑the‑Loop—explaining why they are crucial for building reliable, scalable AI agents and showing concrete Python examples for each.

AI AgentsLangChainLangGraph
0 likes · 11 min read
10 Essential LangChain & LangGraph Concepts Every AI Engineer Must Master
ZhiKe AI
ZhiKe AI
Jun 16, 2026 · Artificial Intelligence

What Is LangChain? Turning Scattered LLM Steps into Standardized Components

LangChain is an LLM application framework that standardizes development steps into reusable components linked by a unified syntax (LCEL), offering modules such as Models, Prompts, Chains, Agents, Tools, and Memory, and shows measurable benefits like 17% lower latency and halved development time for multi‑step workflows.

AI FrameworkAgentsLLM
0 likes · 4 min read
What Is LangChain? Turning Scattered LLM Steps into Standardized Components
Coder Trainee
Coder Trainee
Jun 15, 2026 · Artificial Intelligence

Building a Smart AI Coding Assistant: From Design to Real‑World Use

This tutorial walks through the functional planning, project layout, core Python code, context management, FastAPI service, and execution steps needed to create a full‑featured AI coding assistant that can generate, explain, refactor, fix, test, and review code.

AI AgentFastAPILangChain
0 likes · 9 min read
Building a Smart AI Coding Assistant: From Design to Real‑World Use
SpringMeng
SpringMeng
Jun 14, 2026 · Artificial Intelligence

How I Built an AI Contract Review System for 60,000 RMB in One Month

In 45 days a two‑person team delivered an AI‑powered contract review platform that parses PDFs, extracts key clauses, flags risks, and integrates with enterprise tools, using Python, FastAPI, LangChain, large language models, vector databases and OCR technologies.

AIContract ReviewFastAPI
0 likes · 7 min read
How I Built an AI Contract Review System for 60,000 RMB in One Month
Coder Trainee
Coder Trainee
Jun 13, 2026 · Artificial Intelligence

AI Agent Observability and Debugging: Building a Transparent Agent System

This article explains why AI agents behave like black boxes, introduces a three‑pillar observability framework (tracing, metrics, logging), demonstrates practical tracing with LangSmith and LangFuse, shows how to instrument agents with custom metrics, evaluate performance, and share best‑practice guidelines for production‑ready debugging.

AI AgentLangChainLangFuse
0 likes · 19 min read
AI Agent Observability and Debugging: Building a Transparent Agent System
Java Architect Handbook
Java Architect Handbook
Jun 13, 2026 · Artificial Intelligence

Why Fixed-Size Chunking Fails in RAG: Interview Insights

The article explains that fixed-size chunking in Retrieval‑Augmented Generation ignores semantic boundaries, causing broken sentences, scattered topics, redundant or missing information, and noisy retrieval, and it evaluates overlap as a partial fix while presenting better alternatives such as recursive, semantic, structural, and agentic chunking along with practical production tips and future trends.

AI interviewChunkingLangChain
0 likes · 12 min read
Why Fixed-Size Chunking Fails in RAG: Interview Insights
Coder Trainee
Coder Trainee
Jun 11, 2026 · Artificial Intelligence

Deep Dive into Function Calling for AI Agents: Enabling External Tool Integration

This article explains the concept of Function Calling in large language models, walks through defining function schemas, shows step‑by‑step API call flows, demonstrates multi‑tool orchestration, parallel execution, tool‑chain composition, and integrates Function Calling with LangChain, while providing best‑practice guidelines and code examples.

AI AgentsFunction CallingLangChain
0 likes · 16 min read
Deep Dive into Function Calling for AI Agents: Enabling External Tool Integration
Coder Trainee
Coder Trainee
Jun 9, 2026 · Backend Development

Building Java AI Agents with Spring AI: A Hands‑On Guide

This article walks Java developers through using Spring AI to build AI agents, comparing it with Python's LangChain, detailing architecture, environment setup, prompt templates, tool integration, RAG implementation, production‑grade features, and a side‑by‑side feature comparison.

AI AgentJavaLangChain
0 likes · 17 min read
Building Java AI Agents with Spring AI: A Hands‑On Guide
Fun with Large Models
Fun with Large Models
Jun 9, 2026 · Artificial Intelligence

Master AI Agents: 6 Essential GitHub Projects to Learn From

The article outlines a progressive learning path for AI agents, recommending six GitHub projects—from a beginner-friendly tutorial to production‑grade frameworks—detailing each project's purpose, difficulty, key takeaways, and suitable audience, helping programmers transition from users to builders.

AI AgentsAgent developmentGitHub
0 likes · 15 min read
Master AI Agents: 6 Essential GitHub Projects to Learn From
Coder Trainee
Coder Trainee
Jun 8, 2026 · Artificial Intelligence

Rapidly Build AI Agents with LangChain: A Hands‑On Tutorial

This article walks through why LangChain is the leading framework for AI agents, compares it with low‑level implementations, and provides step‑by‑step code examples for installation, prompt templates, LCEL pipelines, memory modules, RAG, custom tools, and a complete customer‑service agent, concluding with a concise feature comparison.

AI AgentsLLMLangChain
0 likes · 14 min read
Rapidly Build AI Agents with LangChain: A Hands‑On Tutorial
Coder Trainee
Coder Trainee
Jun 7, 2026 · Artificial Intelligence

AI Agent Deep Dive: Understanding Planning, Memory, Tools, and Action

This article revisits the AI Agent architecture and provides a detailed analysis of its four core components—Planning, Memory, Tools, and Action—covering mainstream planning strategies, memory types, tool specifications, and execution loops, accompanied by concrete LangChain code examples that demonstrate building a fully integrated multi‑component agent.

AI AgentLangChainPlanning
0 likes · 12 min read
AI Agent Deep Dive: Understanding Planning, Memory, Tools, and Action
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jun 7, 2026 · Artificial Intelligence

Build an Enterprise RAG Vector Search System from Scratch with LangChain, Easysearch, and MiMo

This article walks through the complete end‑to‑end pipeline for building a production‑grade RAG system—including document chunking, embedding generation via MiMo, vector storage and kNN retrieval in Easysearch, hybrid search configuration, prompt engineering, answer generation, interactive chat, and a detailed list of common pitfalls and fixes.

EasysearchLangChainMiMo
0 likes · 17 min read
Build an Enterprise RAG Vector Search System from Scratch with LangChain, Easysearch, and MiMo
Coder Trainee
Coder Trainee
Jun 6, 2026 · Artificial Intelligence

What Is an AI Agent? From Large Language Models to Autonomous Agents

This article explains why large language models are powerful yet limited, defines AI agents as autonomous systems that combine a model, memory, tools, and actions, details the ReAct reasoning‑and‑acting loop, provides a 30‑line Python LangChain example and a Java Spring AI implementation, and outlines five practical use‑case scenarios and the roadmap for the series.

AI AgentJavaLangChain
0 likes · 10 min read
What Is an AI Agent? From Large Language Models to Autonomous Agents
AI Engineering
AI Engineering
Jun 1, 2026 · Artificial Intelligence

Why Do Most Agent Projects Fail Before Launch? LangChain’s Solution

The article explains why many AI Agent projects collapse before production due to non‑determinism, error propagation, and creative solutions, and presents LangChain’s Deep Agent evaluation framework—integrated with LangSmith, AWS Bedrock, and Pytest—to provide a reproducible, end‑to‑end testing and monitoring process.

AWS BedrockAgent evaluationDeep Agent
0 likes · 9 min read
Why Do Most Agent Projects Fail Before Launch? LangChain’s Solution
James' Growth Diary
James' Growth Diary
May 28, 2026 · Artificial Intelligence

Mastering Prompt Engineering: Few‑Shot, Chain‑of‑Thought, and Self‑Consistency Techniques

This article breaks down three core prompt‑engineering techniques—Few‑Shot prompting for output format stability, Chain‑of‑Thought for multi‑step reasoning, and Self‑Consistency for answer robustness—showing when to use each, how to combine them in LangChain, and providing concrete code examples, performance data, and common pitfalls.

Chain-of-ThoughtDynamic RoutingFew-shot
0 likes · 30 min read
Mastering Prompt Engineering: Few‑Shot, Chain‑of‑Thought, and Self‑Consistency Techniques
James' Growth Diary
James' Growth Diary
May 27, 2026 · Operations

Detecting Agent Silent Killers: Early Alerts for Latency Spikes, Token Explosions, and Infinite Loops

The article presents a three‑layer monitoring system—LangSmith tracing, Prometheus metrics, and Alertmanager alerts—together with concrete metric definitions, alert rules, and code examples to proactively detect latency spikes, token overuse, and dead‑loop cycles in production LLM agents, while also outlining common pitfalls and best‑practice recommendations.

AgentCostAlertLLM
0 likes · 18 min read
Detecting Agent Silent Killers: Early Alerts for Latency Spikes, Token Explosions, and Infinite Loops
James' Growth Diary
James' Growth Diary
May 25, 2026 · Artificial Intelligence

Practical Agent Performance Tuning: Slash Latency 75%, Cut Token Costs 71%, Boost Throughput 217%

The article walks through a systematic performance map of LangChain agents and demonstrates concrete latency, token‑usage, and concurrency optimizations—streaming responses, Redis caching, model routing, prompt trimming, context summarisation, dynamic tool selection, parallel graph nodes and batch processing—showing real‑world gains of up to 75% lower latency, 71% fewer tokens and a 217% throughput increase.

Agent OptimizationLangChainLangGraph
0 likes · 30 min read
Practical Agent Performance Tuning: Slash Latency 75%, Cut Token Costs 71%, Boost Throughput 217%
AI Engineer Programming
AI Engineer Programming
May 25, 2026 · Artificial Intelligence

From Demo to Production: Building a Reliable Agent Development Lifecycle

The article outlines a four‑stage agent development lifecycle—Build, Test, Deploy, Monitor—explaining how early, iterative delivery, systematic testing, controlled deployment, and continuous monitoring transform experimental agents into reliable production systems while addressing governance, cost, and scalability challenges.

AgentGovernanceLangChain
0 likes · 16 min read
From Demo to Production: Building a Reliable Agent Development Lifecycle
James' Growth Diary
James' Growth Diary
May 24, 2026 · Artificial Intelligence

End-to-End Observability with LangSmith: Trace Debugging and RAG Evaluation from Development to Production

This article walks through LangSmith’s three core capabilities—Trace, Evaluation, and Dataset management—showing how to integrate zero‑code tracing, quantify RAG performance with custom evaluators, run version‑comparison experiments, and set up production monitoring with sampling and feedback loops.

LangChainLangSmithObservability
0 likes · 23 min read
End-to-End Observability with LangSmith: Trace Debugging and RAG Evaluation from Development to Production
DeepHub IMBA
DeepHub IMBA
May 23, 2026 · Artificial Intelligence

Reason → Act → Observe: Building an Agentic Loop with LangChain and Python

This article explains what an agentic loop is, contrasts it with single‑pass chatbots, outlines its five stages, shows a visual architecture, walks through a concrete multi‑step example, provides Python pseudocode and a LangChain implementation, and discusses when to use or avoid such loops.

AI AgentsAgentic LoopLLM
0 likes · 8 min read
Reason → Act → Observe: Building an Agentic Loop with LangChain and Python
James' Growth Diary
James' Growth Diary
May 21, 2026 · Databases

Building a Neo4j Knowledge Graph: Entity Modeling, Cypher Queries, and LangChain Integration

This article walks through why graph databases excel at multi‑hop queries, compares Neo4j with relational and vector stores, explains core concepts of nodes, relationships and properties, shows Docker setup, demonstrates six common Cypher patterns, integrates LangChain for LLM‑generated queries, and shares production‑grade modeling tips and pitfalls.

CypherKnowledge GraphLangChain
0 likes · 19 min read
Building a Neo4j Knowledge Graph: Entity Modeling, Cypher Queries, and LangChain Integration
James' Growth Diary
James' Growth Diary
May 20, 2026 · Artificial Intelligence

Boosting RAG Retrieval Quality with Cohere Rerank and Cross‑Encoder

After achieving high recall with hybrid Elasticsearch and vector search, the article shows how inserting a reranker—either Cohere's cloud API or a local Cross‑Encoder—compresses the top‑20 candidates to the most relevant three to five, dramatically improving answer accuracy, cutting token costs, and detailing a dual‑track implementation for production and development environments.

CohereCross-EncoderLangChain
0 likes · 22 min read
Boosting RAG Retrieval Quality with Cohere Rerank and Cross‑Encoder
DeepHub IMBA
DeepHub IMBA
May 14, 2026 · Artificial Intelligence

How HyDE Transforms RAG Retrieval from Keyword Matching to Intent Understanding

The article explains how Hypothetical Document Embeddings (HyDE) improve Retrieval‑Augmented Generation by generating a synthetic answer before vector search, allowing the system to embed richer semantic intent rather than relying on shallow keyword similarity, and provides a step‑by‑step implementation using LangChain.

HyDELLMLangChain
0 likes · 6 min read
How HyDE Transforms RAG Retrieval from Keyword Matching to Intent Understanding
James' Growth Diary
James' Growth Diary
May 14, 2026 · Artificial Intelligence

LLM Semantic Routing Explained: Model‑Based Intent Classification and Three Keyword‑Matching Pitfalls

This article breaks down LLM semantic routing as a classifier, compares keyword, embedding, and LLM‑based routes, provides full TypeScript implementations, introduces hybrid routing for speed and accuracy, and covers production‑grade observability and dynamic configuration to avoid common pitfalls.

Hybrid RoutingLLMLangChain
0 likes · 33 min read
LLM Semantic Routing Explained: Model‑Based Intent Classification and Three Keyword‑Matching Pitfalls
James' Growth Diary
James' Growth Diary
May 13, 2026 · Artificial Intelligence

Multimodal RAG: A Complete Guide to Ingesting Images, Tables, and PDFs

This article examines the blind spot of pure‑text RAG for visual content, compares three multimodal ingestion strategies—CLIP embeddings, image‑to‑text captioning with a MultiVectorRetriever, and ColPali visual retrieval—covers table‑specific handling, presents end‑to‑end TypeScript implementations, and lists common pitfalls to avoid when deploying production‑grade multimodal RAG pipelines.

CLIPColPaliImage Captioning
0 likes · 22 min read
Multimodal RAG: A Complete Guide to Ingesting Images, Tables, and PDFs
James' Growth Diary
James' Growth Diary
May 12, 2026 · Artificial Intelligence

GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs

This article explains why traditional vector RAG loses relational information, how GraphRAG reconstructs entity‑relationship triples into a knowledge graph, and provides step‑by‑step code, performance benchmarks, retrieval modes, and practical tips that raise multi‑hop reasoning accuracy from around 50% to 85%.

GraphRAGKnowledge GraphLangChain
0 likes · 14 min read
GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs
James' Growth Diary
James' Growth Diary
May 11, 2026 · Artificial Intelligence

Mastering RAG Evaluation: Recall@K, MRR, NDCG, and RAGAS Explained

This article breaks down RAG evaluation into a two‑layer framework, explains the four core metrics—Recall@K, MRR, NDCG, and the four RAGAS scores—shows how to implement them with LangChain.js, highlights common pitfalls, and offers scenario‑specific metric combinations for reliable performance monitoring.

EvaluationLangChainMRR
0 likes · 20 min read
Mastering RAG Evaluation: Recall@K, MRR, NDCG, and RAGAS Explained
James' Growth Diary
James' Growth Diary
May 10, 2026 · Artificial Intelligence

Syncing Vectors with Changing Documents: Add, Update, Delete Made Simple

This article walks through why keeping a vector store consistent with a mutable knowledge base is challenging, explains the three failure points, introduces hash‑based incremental syncing, shows idempotent add, proper update and soft‑delete workflows, covers embedding model upgrades, and presents a production‑grade event‑driven architecture with common pitfalls and remedies.

Hash DeduplicationLangChainRAG
0 likes · 17 min read
Syncing Vectors with Changing Documents: Add, Update, Delete Made Simple
IT Services Circle
IT Services Circle
May 9, 2026 · Artificial Intelligence

How to Choose Between LangChain and LlamaIndex: Core Use‑Case Comparison for Agent Development

The article analyzes the design philosophies, key components, strengths, and weaknesses of LangChain and LlamaIndex, explains their distinct core scenarios—complex multi‑step agent orchestration versus private‑data RAG—and shows how they can be combined in real projects while outlining emerging ecosystem trends.

AgentLLMLangChain
0 likes · 13 min read
How to Choose Between LangChain and LlamaIndex: Core Use‑Case Comparison for Agent Development
James' Growth Diary
James' Growth Diary
May 8, 2026 · Artificial Intelligence

How to Test Multi‑Agent Systems? Mock LLM and Graph Replay Explained

The article analyzes why testing Multi‑Agent systems is difficult—due to LLM output randomness, cross‑node state propagation, and tool side‑effects—and presents a systematic solution using mock LLMs, MemorySaver checkpoints with graph replay, tool stubs, and a three‑layer testing pyramid while highlighting common pitfalls and best practices.

Graph ReplayLangChainMock LLM
0 likes · 14 min read
How to Test Multi‑Agent Systems? Mock LLM and Graph Replay Explained
Data Party THU
Data Party THU
May 7, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Building a Multi‑Agent Trading System for End‑to‑End Intelligent Decisions

This article walks through constructing a multi‑agent trading platform—analysts, researchers, traders, risk managers, and a portfolio manager—using LangChain, LangGraph, and LLMs (gpt‑4o, gpt‑4o‑mini), with real‑time data tools, shared and long‑term memory, ReAct loops, structured debates, and a final executable trade proposal.

ChromaDBLLMLangChain
0 likes · 46 min read
Step‑by‑Step Guide to Building a Multi‑Agent Trading System for End‑to‑End Intelligent Decisions
James' Growth Diary
James' Growth Diary
May 7, 2026 · Artificial Intelligence

Three Design Patterns for Multi‑Agent Permission Isolation: Assigning Dedicated Toolsets

The article explains three architectural patterns—static binding, dynamic injection, and tool‑level guards—for isolating tool permissions in production‑grade multi‑agent LLM systems, compares their trade‑offs, shows concrete code examples, and highlights common pitfalls and best‑practice recommendations.

Dynamic InjectionLangChainPermission Isolation
0 likes · 16 min read
Three Design Patterns for Multi‑Agent Permission Isolation: Assigning Dedicated Toolsets
James' Growth Diary
James' Growth Diary
May 2, 2026 · Artificial Intelligence

How to Add Real‑Time Speech Recognition and Streaming TTS to Your AI Agent

This guide walks through choosing the right voice‑agent architecture, implementing streaming ASR with WebSocket, triggering sentence‑by‑sentence TTS, wiring the three layers together via async generators, optimizing latency to under a second, and avoiding common pitfalls such as missing VAD and checkpoint persistence.

LangChainText‑to‑SpeechWebSocket
0 likes · 19 min read
How to Add Real‑Time Speech Recognition and Streaming TTS to Your AI Agent
Data Party THU
Data Party THU
May 1, 2026 · Artificial Intelligence

LangChain vs LangGraph: Choosing Between a Toolkit and an Orchestration Layer

This article compares LangChain and LangGraph by implementing the same three‑stage code‑review pipeline with both frameworks, showing how LangChain offers a simple linear flow while LangGraph provides state‑machine orchestration for loops, conditional branches, and retries, and explains when each approach is preferable.

GeminiLLM workflowLangChain
0 likes · 8 min read
LangChain vs LangGraph: Choosing Between a Toolkit and an Orchestration Layer
AI Illustrated Series
AI Illustrated Series
Apr 28, 2026 · Artificial Intelligence

Comprehensive Interview Guide: LangChain & LangGraph Frameworks

This article provides a detailed, question‑and‑answer style walkthrough of LangChain and LangGraph, covering their core concepts, components, workflow patterns, memory mechanisms, LCEL syntax, graph construction, conditional edges, loops, multi‑agent collaboration, persistence, and a comparison with LlamaIndex, offering concrete code examples and practical insights for AI interview preparation.

AI FrameworkAgentLCEL
0 likes · 32 min read
Comprehensive Interview Guide: LangChain & LangGraph Frameworks
James' Growth Diary
James' Growth Diary
Apr 26, 2026 · Databases

Vector Database Fundamentals: Embedding, Similarity Search, and Index Structures Explained in One Go

This article walks through the complete workflow of turning split text into high‑dimensional vectors, choosing the right embedding model, selecting an appropriate similarity metric, comparing index structures such as Flat, IVF, HNSW and PQ, and finally picking a vector database and integrating it with LangChain.js for production‑grade RAG pipelines.

IndexingLangChainRAG
0 likes · 25 min read
Vector Database Fundamentals: Embedding, Similarity Search, and Index Structures Explained in One Go
AI Illustrated Series
AI Illustrated Series
Apr 26, 2026 · Artificial Intelligence

Build Your First LangChain Agent: A Hands‑On Framework Tutorial

This article walks through a practical, step‑by‑step construction of a LangChain agent—from basic concepts and a simple weather‑query agent to a more complex market‑research agent, adding memory and RAG capabilities, and finally comparing LangChain with LangGraph.

AI AgentLangChainPrompt Engineering
0 likes · 15 min read
Build Your First LangChain Agent: A Hands‑On Framework Tutorial
AI Architect Hub
AI Architect Hub
Apr 26, 2026 · Artificial Intelligence

Embedding Explained: How Vectorization Turns Text into Numbers for RAG

This article walks through why traditional keyword matching fails for RAG, explains the evolution from one‑hot encoding to Word2Vec and BERT, details sentence‑level embeddings and similarity metrics, compares leading Chinese and multilingual embedding models using the C‑MTEB benchmark, and provides practical LangChain code, deployment tips, and common pitfalls.

Chinese NLPEmbeddingLangChain
0 likes · 18 min read
Embedding Explained: How Vectorization Turns Text into Numbers for RAG
AI Architect Hub
AI Architect Hub
Apr 25, 2026 · Artificial Intelligence

How to Feed Massive Documents to an RAG System: Mastering the Art of Text Chunking

This article explains why proper text chunking is critical for Retrieval‑Augmented Generation, illustrates common pitfalls with real‑world examples, compares four chunking strategies (fixed length, recursive, structure‑aware, and code‑aware), and provides practical guidelines for chunk size, overlap, metadata handling, and a production‑ready pipeline.

AI RetrievalLangChainMetadata
0 likes · 21 min read
How to Feed Massive Documents to an RAG System: Mastering the Art of Text Chunking
James' Growth Diary
James' Growth Diary
Apr 25, 2026 · Artificial Intelligence

LangChain Memory Best Practices: Avoid Common Pitfalls and Choose the Right Module

This article dissects the most frequent LangChain Memory pitfalls—missing placeholders, wrong memory type, shared instances, and multi‑process issues—provides correct code patterns, compares the five built‑in memory classes, introduces the new RunnableWithMessageHistory approach, and offers a production‑ready checklist.

ChatOpenAIConversationSummaryBufferMemoryLLMChain
0 likes · 12 min read
LangChain Memory Best Practices: Avoid Common Pitfalls and Choose the Right Module
James' Growth Diary
James' Growth Diary
Apr 25, 2026 · Artificial Intelligence

Choosing the Right AI Memory: Truncation, Summarization, or Vector Retrieval

This article breaks down LangChain.js's three memory strategies—window truncation, summary compression, and vector‑store retrieval—explaining their inner workings, code setup, trade‑offs in token cost and information retention, and provides a decision guide for selecting the best approach in multi‑turn LLM conversations.

Conversation MemoryLLMLangChain
0 likes · 14 min read
Choosing the Right AI Memory: Truncation, Summarization, or Vector Retrieval
DeepHub IMBA
DeepHub IMBA
Apr 24, 2026 · Artificial Intelligence

LangChain vs LangGraph: Choosing a Toolkit or an Orchestrator

The article compares LangChain and LangGraph by implementing the same three‑stage code‑review pipeline with identical agents and Gemini 2.5 Flash calls, showing when a linear toolkit suffices and when a state‑machine orchestrator becomes necessary.

AgentLLM orchestrationLangChain
0 likes · 8 min read
LangChain vs LangGraph: Choosing a Toolkit or an Orchestrator
AI Architect Hub
AI Architect Hub
Apr 24, 2026 · Artificial Intelligence

RAG Level 1: Avoid Dirty Data Poisoning Your AI – A Data Cleaning Guide

This article explains why noisy documents cripple Retrieval‑Augmented Generation, enumerates common garbage data types, describes three typical data‑quality problems, warns against over‑cleaning, encoding, and regex pitfalls, and provides a configurable LangChain pipeline with deduplication and validation best practices.

AIDeduplicationEmbedding
0 likes · 21 min read
RAG Level 1: Avoid Dirty Data Poisoning Your AI – A Data Cleaning Guide
Fun with Large Models
Fun with Large Models
Apr 22, 2026 · Artificial Intelligence

How to Quickly Integrate Agent Skills in LangChain DeepAgents

This article provides a step‑by‑step guide to using Agent Skills in LangChain DeepAgents, covering the Skills directory structure, the four engineering steps (discovery, system‑prompt injection, progressive loading, execution), and two practical examples—a simple skill lookup and a complex docx‑processing skill—complete with code snippets and troubleshooting tips.

Agent SkillDeepAgentsFileSystemMiddleware
0 likes · 15 min read
How to Quickly Integrate Agent Skills in LangChain DeepAgents
Linyb Geek Road
Linyb Geek Road
Apr 22, 2026 · Artificial Intelligence

How to Build Short‑Term and Long‑Term Memory for LLM Agents Using Vector DBs and RAG

The article analyzes Agent memory design by comparing human short‑term and long‑term memory, explains context‑window management strategies, outlines persistent storage options such as vector databases, relational stores, knowledge graphs and fine‑tuning, and presents a three‑layer architecture with write, retrieval and forgetting mechanisms.

Agent MemoryLLMLangChain
0 likes · 15 min read
How to Build Short‑Term and Long‑Term Memory for LLM Agents Using Vector DBs and RAG
James' Growth Diary
James' Growth Diary
Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

ChunkingHybrid SearchLangChain
0 likes · 18 min read
Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices
Tech Freedom Circle
Tech Freedom Circle
Apr 21, 2026 · Artificial Intelligence

Deep Dive into DeerFlow’s 14‑Layer Middleware: An Onion‑Style Chain Architecture Case Study

This article provides a detailed technical analysis of DeerFlow 2.0’s 14‑layer middleware stack, explaining how it extends LangChain’s runnable middleware with an onion‑style responsibility‑chain, compares the design to MyBatis interceptors, and breaks down each middleware’s purpose, implementation details, execution order, and engineering benefits for AI agent frameworks.

AI AgentsDeerFlowLangChain
0 likes · 36 min read
Deep Dive into DeerFlow’s 14‑Layer Middleware: An Onion‑Style Chain Architecture Case Study
James' Growth Diary
James' Growth Diary
Apr 19, 2026 · Artificial Intelligence

Vector Database Basics: Embeddings, Similarity Search, and Index Structures

This article explains how embeddings turn text into high‑dimensional vectors, compares commercial and open‑source embedding models, details cosine, Euclidean and inner‑product similarity metrics, reviews common index structures such as Flat, IVF, HNSW and PQ, and shows how to choose and use a vector database with LangChain.js while avoiding typical pitfalls.

IndexingLangChainRAG
0 likes · 25 min read
Vector Database Basics: Embeddings, Similarity Search, and Index Structures
SpringMeng
SpringMeng
Apr 19, 2026 · Artificial Intelligence

Build a LangChain AI Agent in 20 Minutes: Step‑by‑Step Guide

This tutorial walks through creating a LangChain‑based AI agent by covering model integration, tool definition with @tool, short‑ and long‑term memory handling via checkpointers and vector stores, and assembling everything with create_agent, middleware, and code examples for a functional travel assistant.

AI AgentLangChainLangGraph
0 likes · 16 min read
Build a LangChain AI Agent in 20 Minutes: Step‑by‑Step Guide
James' Growth Diary
James' Growth Diary
Apr 17, 2026 · Artificial Intelligence

How to Load and Split Documents for RAG: First Step to Building a Knowledge Base

This tutorial explains why document loading and splitting are critical for RAG pipelines, introduces LangChain's Document format, demonstrates loaders for various file types, details the RecursiveCharacterTextSplitter and alternative splitters, and provides practical tips on parameter tuning, metadata preservation, Chinese text handling, and common pitfalls.

AIChunkingDocument Loader
0 likes · 27 min read
How to Load and Split Documents for RAG: First Step to Building a Knowledge Base
PaperAgent
PaperAgent
Apr 15, 2026 · Artificial Intelligence

How Open‑Source Agent Harnesses Are Redefining LLM Deployments

The article analyzes the shift from proprietary Claude Managed Agents to open‑source frameworks like LangChain Deep Agents Deploy, detailing harness engineering, deployment steps, memory management, and the benefits of an open ecosystem for building production‑ready AI agents.

Harness EngineeringLangChaindeployment
0 likes · 8 min read
How Open‑Source Agent Harnesses Are Redefining LLM Deployments
Qborfy AI
Qborfy AI
Apr 14, 2026 · Artificial Intelligence

How to Give AI a Map, Not an Encyclopedia: Mastering Context Engineering

This article explains why AI can only act on information that appears in its context window, outlines the twin problems of missing data and overload, and presents a practical methodology—including AGENTS.md maps and LangChain's LocalContextMiddleware implementation—to engineer concise, searchable context for reliable agent behavior.

AILangChainOpenAI
0 likes · 29 min read
How to Give AI a Map, Not an Encyclopedia: Mastering Context Engineering
ShiZhen AI
ShiZhen AI
Apr 13, 2026 · Artificial Intelligence

Who Owns Your AI Memory? The Risks of Closed Agent Harnesses

The article explains that Agent Harnesses are essential for managing AI memory and context, argues that closed‑source harnesses give vendors control over user data, outlines three risk levels of memory lock‑in, and advocates open, user‑controlled harnesses such as OpenClaw and Deep Agents.

AI memoryAgent HarnessLangChain
0 likes · 14 min read
Who Owns Your AI Memory? The Risks of Closed Agent Harnesses
AI Architect Hub
AI Architect Hub
Apr 12, 2026 · Artificial Intelligence

Which AI Agent Framework Wins in 2026? LangChain, LlamaIndex, LangGraph, AutoGen

This article provides a practical selection guide for developers building AI agents in 2026, dissecting the design, core components, strengths, and limitations of four major frameworks—LangChain, LlamaIndex, LangGraph, and AutoGen—while offering use‑case recommendations, code examples, and a decision‑tree to help choose the most suitable tool.

AI AgentsAutoGenLangChain
0 likes · 23 min read
Which AI Agent Framework Wins in 2026? LangChain, LlamaIndex, LangGraph, AutoGen
James' Growth Diary
James' Growth Diary
Apr 12, 2026 · Artificial Intelligence

Build a Complete Private Knowledge Base with RAG: A Hands‑On Guide

This article walks through a complete, production‑ready Retrieval‑Augmented Generation pipeline that lets AI answer a company’s private documents, covering chunking strategies, embedding model choices, vector‑database selection, retrieval methods, full LangChain chain assembly, and common pitfalls to avoid.

EmbeddingLangChainPromptEngineering
0 likes · 18 min read
Build a Complete Private Knowledge Base with RAG: A Hands‑On Guide
Tech Freedom Circle
Tech Freedom Circle
Apr 12, 2026 · Artificial Intelligence

What Is Harness Agent? A Deep Dive into the New AI Engineering Framework

Harness Agent is an AI engineering framework that combines a large language model with a runtime control system—called the Harness—to provide task planning, sandboxed execution, tool integration, memory management, safety guardrails, and observability, turning raw model capabilities into reliable, production‑grade agents.

AI EngineeringDeerFlowHarness Agent
0 likes · 26 min read
What Is Harness Agent? A Deep Dive into the New AI Engineering Framework
Qborfy AI
Qborfy AI
Apr 11, 2026 · Industry Insights

Why AI Agents Need Harness Engineering: Insights from OpenAI, LangChain, and Anthropic

This article explains how AI agents often stall, repeat mistakes, or diverge on complex tasks, argues that the missing piece is a well‑designed harness, and demonstrates with real‑world case studies from OpenAI, LangChain, and Anthropic how a six‑component harness can boost performance by over 13 percentage points and enable million‑line code generation.

AI EngineeringAgent HarnessAnthropic
0 likes · 12 min read
Why AI Agents Need Harness Engineering: Insights from OpenAI, LangChain, and Anthropic
James' Growth Diary
James' Growth Diary
Apr 11, 2026 · Artificial Intelligence

Deep Dive into Tools: Function Calling Mechanics and LangChain Toolchain Design

This article explains how LLMs use Function Calling to output structured JSON for tool execution, walks through the full multi‑turn tool call loop, shows how LangChain standardizes disparate vendor APIs with BaseTool and bind_tools, and shares practical pitfalls, best‑practice guidelines, and security considerations for building robust agents.

AgentFunction CallingLLM
0 likes · 16 min read
Deep Dive into Tools: Function Calling Mechanics and LangChain Toolchain Design
James' Growth Diary
James' Growth Diary
Apr 10, 2026 · Artificial Intelligence

Designing Agent Memory Systems: Short‑Term, Long‑Term, and Knowledge Graph Layers

The article breaks down how to build a three‑layer memory architecture for AI agents—short‑term context windows with sliding‑window summarization, long‑term semantic retrieval via vector databases with selective storage and time decay, and a knowledge‑graph layer for relational reasoning—plus implementation tips and common pitfalls.

Agent MemoryKnowledge GraphLangChain
0 likes · 19 min read
Designing Agent Memory Systems: Short‑Term, Long‑Term, and Knowledge Graph Layers
James' Growth Diary
James' Growth Diary
Apr 10, 2026 · Artificial Intelligence

Build Your First Production‑Ready LCEL Chain with the Pipe Operator

This tutorial walks through LCEL’s pipe operator and its underlying RunnableSequence, then demonstrates sequential, parallel, and lambda‑based chains, shows how to preserve context with RunnablePassthrough/Assign, compares invoke/stream/batch execution modes, and provides a complete production‑grade RAG chain with common pitfalls and a self‑check checklist.

AILCELLangChain
0 likes · 12 min read
Build Your First Production‑Ready LCEL Chain with the Pipe Operator
PMTalk Product Manager Community
PMTalk Product Manager Community
Apr 10, 2026 · Artificial Intelligence

AI Handles 80% of a Medical Triage Agent, Product Managers Cover the Rest

The article walks through a medical triage AI Agent built with LangChain, LangGraph, and LangSmith, showing how the framework supplies core model and tool interfaces, how graph‑based orchestration manages complex branching, loops and human‑in‑the‑loop steps, and how tracing and evaluation prove reliability for product managers.

AI AgentLangChainLangGraph
0 likes · 23 min read
AI Handles 80% of a Medical Triage Agent, Product Managers Cover the Rest
James' Growth Diary
James' Growth Diary
Apr 9, 2026 · Artificial Intelligence

How ReAct Enables Agents to Think While Acting

This article explains the ReAct pattern—interleaving reasoning and acting for LLM agents—by defining its core loop, comparing it with plain tool‑calling, providing a step‑by‑step hand‑written implementation in JavaScript, showing the LangChain.js wrapper, streaming output, and detailing five common pitfalls and a pre‑deployment checklist.

JavaScriptLLMLangChain
0 likes · 16 min read
How ReAct Enables Agents to Think While Acting
Data STUDIO
Data STUDIO
Apr 9, 2026 · Artificial Intelligence

Two Weeks of RAG Troubles: How Bad PDF Parsing Made My LLM Look Stupid

After two weeks of failed RAG queries caused by fragmented tables, multi‑column layouts, and poor OCR, the author switched from open‑source PDF parsers to the commercial TextIn xParse engine, boosting retrieval accuracy from under 30% to over 95% and sharing practical integration tips.

AILangChainPDF parsing
0 likes · 12 min read
Two Weeks of RAG Troubles: How Bad PDF Parsing Made My LLM Look Stupid
James' Growth Diary
James' Growth Diary
Apr 8, 2026 · Artificial Intelligence

Practical Guide to Output Parsers: Ensuring Stable JSON from LLMs

The article explains why LLMs often produce malformed JSON, categorizes three common failure types, and walks through modern solutions—including withStructuredOutput + Zod, JsonOutputParser, and OutputFixingParser—plus a decision tree to choose the right approach for production use.

FunctionCallingLLMLangChain
0 likes · 14 min read
Practical Guide to Output Parsers: Ensuring Stable JSON from LLMs
James' Growth Diary
James' Growth Diary
Apr 7, 2026 · Artificial Intelligence

Parser vs withStructuredOutput: Choosing the Right Structured Output for LangChain

The article analyzes why LLMs often return unstructured text, compares LangChain's OutputParser and withStructuredOutput approaches, evaluates their stability, token usage, and model compatibility, and provides a decision guide and best‑practice recommendations for production‑grade structured output in 2025.

Function CallingLLMLangChain
0 likes · 10 min read
Parser vs withStructuredOutput: Choosing the Right Structured Output for LangChain
James' Growth Diary
James' Growth Diary
Apr 6, 2026 · Artificial Intelligence

10 Practical LangChain Performance Hacks to Speed Up and Cut Costs

This article presents ten concrete techniques—including in‑memory and Redis caching, semantic caching, parallel execution, batch processing, prompt compression, model routing, streaming output, and connection‑pool reuse—to dramatically reduce latency and token costs in production LangChain applications.

CachingLangChainNode.js
0 likes · 14 min read
10 Practical LangChain Performance Hacks to Speed Up and Cut Costs
Fun with Large Models
Fun with Large Models
Apr 3, 2026 · Artificial Intelligence

Fast Guide to LangChain DeepAgents: How SubAgents Work

This article explains DeepAgents SubAgent mechanisms, showing how context isolation and task division improve complex agent workflows, details two creation methods (dictionary‑based and compiled), demonstrates a search‑and‑report demo, and outlines suitable and unsuitable scenarios with practical code examples.

AI AgentsContext IsolationDeepAgents
0 likes · 15 min read
Fast Guide to LangChain DeepAgents: How SubAgents Work
Data STUDIO
Data STUDIO
Apr 2, 2026 · Artificial Intelligence

Building a Dual‑Stack Memory Agent: Situational + Semantic Memory for Long‑Term AI Understanding

This tutorial walks through designing and implementing a dual‑stack memory architecture for AI agents—combining episodic vector‑based situational memory with graph‑based semantic memory—using LangChain, FAISS, and Neo4j, and demonstrates a complete end‑to‑end workflow with code examples.

Agent MemoryFAISSKnowledge Graph
0 likes · 14 min read
Building a Dual‑Stack Memory Agent: Situational + Semantic Memory for Long‑Term AI Understanding
AI Waka
AI Waka
Mar 30, 2026 · Artificial Intelligence

Exploring Deep Agents: An Open‑Source Alternative to Claude Code for Coding AI Agents

Deep Agents, an open‑source framework built on LangChain and LangGraph, provides a ready‑to‑use agent harness with planning, file‑system tools, sandboxed shell access, sub‑agents, automatic context management, and built‑in observability for Python and TypeScript developers seeking a flexible replacement for Claude Code.

AI AutomationDeepAgentsLangChain
0 likes · 9 min read
Exploring Deep Agents: An Open‑Source Alternative to Claude Code for Coding AI Agents
Data STUDIO
Data STUDIO
Mar 30, 2026 · Artificial Intelligence

Why a Single AI Falls Short: Building a Multi‑Agent Expert Team for Superior Reports

The article demonstrates how a monolithic LLM struggles with multi‑dimensional market analysis and shows, through step‑by‑step code, how assembling specialized AI agents for news, technical and financial analysis yields clearer structure, deeper insight, and higher evaluation scores.

AI ArchitectureLLM evaluationLangChain
0 likes · 17 min read
Why a Single AI Falls Short: Building a Multi‑Agent Expert Team for Superior Reports
Data STUDIO
Data STUDIO
Mar 27, 2026 · Artificial Intelligence

Boost Agent Efficiency with Planning Architecture: A Hands‑On Comparison to ReAct

This article explains the planning architecture for AI agents, contrasts it with the ReAct approach, provides step‑by‑step Python code using LangChain and LangGraph, evaluates both methods on task completion and process efficiency, and discusses when each architecture is most suitable.

AI AgentsLangChainLangGraph
0 likes · 18 min read
Boost Agent Efficiency with Planning Architecture: A Hands‑On Comparison to ReAct
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Mar 26, 2026 · Artificial Intelligence

How to Build a Full‑Stack RAG Chatbot Using LangChain, FAISS & Langfuse

This guide walks through an end‑to‑end RAG implementation with LangChain, covering multi‑format document loading, recursive text splitting, embedding selection, FAISS vector storage, ConversationalRetrievalChain setup, prompt engineering, source citation, Langfuse observability, and best‑practice configuration management.

FAISSLLMOpsLangChain
0 likes · 13 min read
How to Build a Full‑Stack RAG Chatbot Using LangChain, FAISS & Langfuse
Fun with Large Models
Fun with Large Models
Mar 25, 2026 · Artificial Intelligence

Quick Guide to LangChain DeepAgents: Core Features and Fast Onboarding

This article introduces the background and key advantages of the DeepAgents framework, explains its four core capabilities—task planning, context management, sub‑agent generation, and long‑term memory—and provides a step‑by‑step code example that builds a complex AI agent with just a few lines of Python.

AI AgentsDeepAgentsLangChain
0 likes · 11 min read
Quick Guide to LangChain DeepAgents: Core Features and Fast Onboarding
Test Development Learning Exchange
Test Development Learning Exchange
Mar 24, 2026 · Artificial Intelligence

Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports

This guide presents an end‑to‑end solution for creating a test‑focused AI agent that indexes project code and defect data, integrates a large language model via LangChain, generates compliant Pytest cases, parses Allure reports, and offers deployment tips for seamless PyCharm integration.

AI AgentAllureLangChain
0 likes · 13 min read
Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports
Data STUDIO
Data STUDIO
Mar 24, 2026 · Artificial Intelligence

Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes

This article explains why large language models alone can hallucinate, introduces the tool‑using agent architecture, and provides a step‑by‑step Python tutorial using LangChain, LangGraph, and Tavily to create, run, and evaluate a real‑time web‑search capable AI assistant.

AgentLLMLangChain
0 likes · 16 min read
Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 24, 2026 · Artificial Intelligence

Why LLMs Behave Unpredictably: From Uncertainty to Practical Agent Design

This article analyzes the sources of LLM output uncertainty, explores hardware and architectural constraints, demonstrates how to build robust AI agents with prompt engineering, tool orchestration, and memory management, and compares traditional micro‑service design with modern LLM‑centric workflows.

AI AgentHardwareLLM
0 likes · 64 min read
Why LLMs Behave Unpredictably: From Uncertainty to Practical Agent Design
DeepHub IMBA
DeepHub IMBA
Mar 18, 2026 · Artificial Intelligence

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

The article analyzes how most RAG pipelines blindly feed retrieved documents to LLMs, introduces CRAG's lightweight evaluator with confidence thresholds, describes its sentence‑level decomposition, filtering, and dual‑knowledge routing, and provides a full implementation walkthrough with a real insurance query example.

CRAGFAISSLLM
0 likes · 13 min read
CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator
JavaGuide
JavaGuide
Mar 18, 2026 · Artificial Intelligence

Why Build Your Own Claude Code Agent? A Step‑by‑Step Walkthrough

This article explores the Learn Claude Code website, breaking down the universal agent loop into twelve incremental versions, demonstrating language‑agnostic implementations in Python and Java, and detailing progressive capabilities—from basic tool integration to memory compression, concurrency, and multi‑agent collaboration.

AI AgentAgent LoopClaude
0 likes · 9 min read
Why Build Your Own Claude Code Agent? A Step‑by‑Step Walkthrough
AI Engineer Programming
AI Engineer Programming
Mar 16, 2026 · Artificial Intelligence

Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents

The article explains that the term “Agent development” hides a fundamental split between Agent Frameworks, which give developers building blocks to assemble their own agents, and Agent Harnesses, which provide ready‑to‑run agents, and shows how this distinction affects decisions, maintenance, and troubleshooting.

AI EngineeringAgentClaude Code
0 likes · 10 min read
Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents
Fun with Large Models
Fun with Large Models
Mar 15, 2026 · Artificial Intelligence

A Complete Guide to 2026’s Hottest Tech Concept: Agent Engineering

The article explains Agent Engineering—a systematic approach that turns nondeterministic large‑language‑model agents into reliable production‑grade applications through an iterative build‑test‑deploy‑observe‑improve loop, combining product, engineering, and data‑science thinking to address unpredictability and achieve continuous growth.

AI AgentData‑Driven OptimizationIterative Development
0 likes · 12 min read
A Complete Guide to 2026’s Hottest Tech Concept: Agent Engineering
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 12, 2026 · Artificial Intelligence

How to Build Cross-Session Memory for RAG Chatbots: Short‑Term vs Long‑Term Strategies

This article explains the role of memory modules in Retrieval‑Augmented Generation systems, compares short‑term and long‑term memory techniques, outlines storage and retrieval methods, discusses management strategies like forgetting and deduplication, and compares LangChain and LlamaIndex implementations for practical deployment.

LLMLangChainLlamaIndex
0 likes · 11 min read
How to Build Cross-Session Memory for RAG Chatbots: Short‑Term vs Long‑Term Strategies
Fun with Large Models
Fun with Large Models
Mar 11, 2026 · Artificial Intelligence

LangChain DeepAgents Quick Guide – FileSystem Middleware Gives AI Agents System‑Level Memory Management

This article explains why AI agents need a memory‑management solution, introduces LangChain DeepAgents' FileSystem middleware, details its four backend options for short‑term, long‑term, disk‑based, and hybrid storage, and provides step‑by‑step Python examples for installing, configuring, and using the middleware in real‑world scenarios.

AI AgentDeepAgentsFileSystemMiddleware
0 likes · 16 min read
LangChain DeepAgents Quick Guide – FileSystem Middleware Gives AI Agents System‑Level Memory Management
AI Tech Publishing
AI Tech Publishing
Mar 10, 2026 · Artificial Intelligence

Agent Frameworks vs. Agent Harness: Understanding the Key Differences

The article explains how Agent Frameworks and Agent Harness occupy different points on an opinionated spectrum, detailing their abstractions, built‑in components, trade‑offs, and when to choose each, with examples like OpenClaw, LangChain, and Deep Agents.

Agent HarnessAgent frameworkLLM
0 likes · 5 min read
Agent Frameworks vs. Agent Harness: Understanding the Key Differences