Tagged articles
314 articles
Page 1 of 4
DeepHub IMBA
DeepHub IMBA
May 14, 2026 · Artificial Intelligence

How HyDE Transforms RAG Retrieval from Keyword Matching to Intent Understanding

The article explains how Hypothetical Document Embeddings (HyDE) improve Retrieval‑Augmented Generation by generating a synthetic answer before vector search, allowing the system to embed richer semantic intent rather than relying on shallow keyword similarity, and provides a step‑by‑step implementation using LangChain.

HyDELLMLangChain
0 likes · 6 min read
How HyDE Transforms RAG Retrieval from Keyword Matching to Intent Understanding
James' Growth Diary
James' Growth Diary
May 14, 2026 · Artificial Intelligence

LLM Semantic Routing Explained: Model‑Based Intent Classification and Three Keyword‑Matching Pitfalls

This article breaks down LLM semantic routing as a classifier, compares keyword, embedding, and LLM‑based routes, provides full TypeScript implementations, introduces hybrid routing for speed and accuracy, and covers production‑grade observability and dynamic configuration to avoid common pitfalls.

Hybrid RoutingLLMLangChain
0 likes · 33 min read
LLM Semantic Routing Explained: Model‑Based Intent Classification and Three Keyword‑Matching Pitfalls
James' Growth Diary
James' Growth Diary
May 13, 2026 · Artificial Intelligence

Multimodal RAG: A Complete Guide to Ingesting Images, Tables, and PDFs

This article examines the blind spot of pure‑text RAG for visual content, compares three multimodal ingestion strategies—CLIP embeddings, image‑to‑text captioning with a MultiVectorRetriever, and ColPali visual retrieval—covers table‑specific handling, presents end‑to‑end TypeScript implementations, and lists common pitfalls to avoid when deploying production‑grade multimodal RAG pipelines.

CLIPColPaliImage Captioning
0 likes · 22 min read
Multimodal RAG: A Complete Guide to Ingesting Images, Tables, and PDFs
James' Growth Diary
James' Growth Diary
May 12, 2026 · Artificial Intelligence

GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs

This article explains why traditional vector RAG loses relational information, how GraphRAG reconstructs entity‑relationship triples into a knowledge graph, and provides step‑by‑step code, performance benchmarks, retrieval modes, and practical tips that raise multi‑hop reasoning accuracy from around 50% to 85%.

GraphRAGKnowledge GraphLangChain
0 likes · 14 min read
GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs
James' Growth Diary
James' Growth Diary
May 11, 2026 · Artificial Intelligence

Mastering RAG Evaluation: Recall@K, MRR, NDCG, and RAGAS Explained

This article breaks down RAG evaluation into a two‑layer framework, explains the four core metrics—Recall@K, MRR, NDCG, and the four RAGAS scores—shows how to implement them with LangChain.js, highlights common pitfalls, and offers scenario‑specific metric combinations for reliable performance monitoring.

LangChainMRRNDCG
0 likes · 20 min read
Mastering RAG Evaluation: Recall@K, MRR, NDCG, and RAGAS Explained
James' Growth Diary
James' Growth Diary
May 10, 2026 · Artificial Intelligence

Syncing Vectors with Changing Documents: Add, Update, Delete Made Simple

This article walks through why keeping a vector store consistent with a mutable knowledge base is challenging, explains the three failure points, introduces hash‑based incremental syncing, shows idempotent add, proper update and soft‑delete workflows, covers embedding model upgrades, and presents a production‑grade event‑driven architecture with common pitfalls and remedies.

Hash DeduplicationIncremental SyncLangChain
0 likes · 17 min read
Syncing Vectors with Changing Documents: Add, Update, Delete Made Simple
IT Services Circle
IT Services Circle
May 9, 2026 · Artificial Intelligence

How to Choose Between LangChain and LlamaIndex: Core Use‑Case Comparison for Agent Development

The article analyzes the design philosophies, key components, strengths, and weaknesses of LangChain and LlamaIndex, explains their distinct core scenarios—complex multi‑step agent orchestration versus private‑data RAG—and shows how they can be combined in real projects while outlining emerging ecosystem trends.

AgentLLMLangChain
0 likes · 13 min read
How to Choose Between LangChain and LlamaIndex: Core Use‑Case Comparison for Agent Development
James' Growth Diary
James' Growth Diary
May 8, 2026 · Artificial Intelligence

How to Test Multi‑Agent Systems? Mock LLM and Graph Replay Explained

The article analyzes why testing Multi‑Agent systems is difficult—due to LLM output randomness, cross‑node state propagation, and tool side‑effects—and presents a systematic solution using mock LLMs, MemorySaver checkpoints with graph replay, tool stubs, and a three‑layer testing pyramid while highlighting common pitfalls and best practices.

Graph ReplayLangChainMock LLM
0 likes · 14 min read
How to Test Multi‑Agent Systems? Mock LLM and Graph Replay Explained
Data Party THU
Data Party THU
May 7, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Building a Multi‑Agent Trading System for End‑to‑End Intelligent Decisions

This article walks through constructing a multi‑agent trading platform—analysts, researchers, traders, risk managers, and a portfolio manager—using LangChain, LangGraph, and LLMs (gpt‑4o, gpt‑4o‑mini), with real‑time data tools, shared and long‑term memory, ReAct loops, structured debates, and a final executable trade proposal.

ChromaDBFinancial AILLM
0 likes · 46 min read
Step‑by‑Step Guide to Building a Multi‑Agent Trading System for End‑to‑End Intelligent Decisions
James' Growth Diary
James' Growth Diary
May 7, 2026 · Artificial Intelligence

Three Design Patterns for Multi‑Agent Permission Isolation: Assigning Dedicated Toolsets

The article explains three architectural patterns—static binding, dynamic injection, and tool‑level guards—for isolating tool permissions in production‑grade multi‑agent LLM systems, compares their trade‑offs, shows concrete code examples, and highlights common pitfalls and best‑practice recommendations.

Dynamic InjectionLangChainMulti-Agent
0 likes · 16 min read
Three Design Patterns for Multi‑Agent Permission Isolation: Assigning Dedicated Toolsets
James' Growth Diary
James' Growth Diary
May 2, 2026 · Artificial Intelligence

How to Add Real‑Time Speech Recognition and Streaming TTS to Your AI Agent

This guide walks through choosing the right voice‑agent architecture, implementing streaming ASR with WebSocket, triggering sentence‑by‑sentence TTS, wiring the three layers together via async generators, optimizing latency to under a second, and avoiding common pitfalls such as missing VAD and checkpoint persistence.

LangChainWebSocketasync generators
0 likes · 19 min read
How to Add Real‑Time Speech Recognition and Streaming TTS to Your AI Agent
Data Party THU
Data Party THU
May 1, 2026 · Artificial Intelligence

LangChain vs LangGraph: Choosing Between a Toolkit and an Orchestration Layer

This article compares LangChain and LangGraph by implementing the same three‑stage code‑review pipeline with both frameworks, showing how LangChain offers a simple linear flow while LangGraph provides state‑machine orchestration for loops, conditional branches, and retries, and explains when each approach is preferable.

Agent orchestrationGeminiLLM workflow
0 likes · 8 min read
LangChain vs LangGraph: Choosing Between a Toolkit and an Orchestration Layer
AI Illustrated Series
AI Illustrated Series
Apr 28, 2026 · Artificial Intelligence

Comprehensive Interview Guide: LangChain & LangGraph Frameworks

This article provides a detailed, question‑and‑answer style walkthrough of LangChain and LangGraph, covering their core concepts, components, workflow patterns, memory mechanisms, LCEL syntax, graph construction, conditional edges, loops, multi‑agent collaboration, persistence, and a comparison with LlamaIndex, offering concrete code examples and practical insights for AI interview preparation.

AI FrameworkAgentLCEL
0 likes · 32 min read
Comprehensive Interview Guide: LangChain & LangGraph Frameworks
James' Growth Diary
James' Growth Diary
Apr 26, 2026 · Databases

Vector Database Fundamentals: Embedding, Similarity Search, and Index Structures Explained in One Go

This article walks through the complete workflow of turning split text into high‑dimensional vectors, choosing the right embedding model, selecting an appropriate similarity metric, comparing index structures such as Flat, IVF, HNSW and PQ, and finally picking a vector database and integrating it with LangChain.js for production‑grade RAG pipelines.

LangChainRAGembeddings
0 likes · 25 min read
Vector Database Fundamentals: Embedding, Similarity Search, and Index Structures Explained in One Go
AI Illustrated Series
AI Illustrated Series
Apr 26, 2026 · Artificial Intelligence

Build Your First LangChain Agent: A Hands‑On Framework Tutorial

This article walks through a practical, step‑by‑step construction of a LangChain agent—from basic concepts and a simple weather‑query agent to a more complex market‑research agent, adding memory and RAG capabilities, and finally comparing LangChain with LangGraph.

AI AgentLangChainMemory
0 likes · 15 min read
Build Your First LangChain Agent: A Hands‑On Framework Tutorial
AI Architect Hub
AI Architect Hub
Apr 26, 2026 · Artificial Intelligence

Embedding Explained: How Vectorization Turns Text into Numbers for RAG

This article walks through why traditional keyword matching fails for RAG, explains the evolution from one‑hot encoding to Word2Vec and BERT, details sentence‑level embeddings and similarity metrics, compares leading Chinese and multilingual embedding models using the C‑MTEB benchmark, and provides practical LangChain code, deployment tips, and common pitfalls.

Chinese NLPEmbeddingLangChain
0 likes · 18 min read
Embedding Explained: How Vectorization Turns Text into Numbers for RAG
AI Architect Hub
AI Architect Hub
Apr 25, 2026 · Artificial Intelligence

How to Feed Massive Documents to an RAG System: Mastering the Art of Text Chunking

This article explains why proper text chunking is critical for Retrieval‑Augmented Generation, illustrates common pitfalls with real‑world examples, compares four chunking strategies (fixed length, recursive, structure‑aware, and code‑aware), and provides practical guidelines for chunk size, overlap, metadata handling, and a production‑ready pipeline.

AI RetrievalLangChainRAG
0 likes · 21 min read
How to Feed Massive Documents to an RAG System: Mastering the Art of Text Chunking
James' Growth Diary
James' Growth Diary
Apr 25, 2026 · Artificial Intelligence

LangChain Memory Best Practices: Avoid Common Pitfalls and Choose the Right Module

This article dissects the most frequent LangChain Memory pitfalls—missing placeholders, wrong memory type, shared instances, and multi‑process issues—provides correct code patterns, compares the five built‑in memory classes, introduces the new RunnableWithMessageHistory approach, and offers a production‑ready checklist.

ChatOpenAIConversationSummaryBufferMemoryLLMChain
0 likes · 12 min read
LangChain Memory Best Practices: Avoid Common Pitfalls and Choose the Right Module
James' Growth Diary
James' Growth Diary
Apr 25, 2026 · Artificial Intelligence

Choosing the Right AI Memory: Truncation, Summarization, or Vector Retrieval

This article breaks down LangChain.js's three memory strategies—window truncation, summary compression, and vector‑store retrieval—explaining their inner workings, code setup, trade‑offs in token cost and information retention, and provides a decision guide for selecting the best approach in multi‑turn LLM conversations.

Conversation MemoryLLMLangChain
0 likes · 14 min read
Choosing the Right AI Memory: Truncation, Summarization, or Vector Retrieval
DeepHub IMBA
DeepHub IMBA
Apr 24, 2026 · Artificial Intelligence

LangChain vs LangGraph: Choosing a Toolkit or an Orchestrator

The article compares LangChain and LangGraph by implementing the same three‑stage code‑review pipeline with identical agents and Gemini 2.5 Flash calls, showing when a linear toolkit suffices and when a state‑machine orchestrator becomes necessary.

AgentLLM OrchestrationLangChain
0 likes · 8 min read
LangChain vs LangGraph: Choosing a Toolkit or an Orchestrator
AI Architect Hub
AI Architect Hub
Apr 24, 2026 · Artificial Intelligence

RAG Level 1: Avoid Dirty Data Poisoning Your AI – A Data Cleaning Guide

This article explains why noisy documents cripple Retrieval‑Augmented Generation, enumerates common garbage data types, describes three typical data‑quality problems, warns against over‑cleaning, encoding, and regex pitfalls, and provides a configurable LangChain pipeline with deduplication and validation best practices.

AIEmbeddingLangChain
0 likes · 21 min read
RAG Level 1: Avoid Dirty Data Poisoning Your AI – A Data Cleaning Guide
Fun with Large Models
Fun with Large Models
Apr 22, 2026 · Artificial Intelligence

How to Quickly Integrate Agent Skills in LangChain DeepAgents

This article provides a step‑by‑step guide to using Agent Skills in LangChain DeepAgents, covering the Skills directory structure, the four engineering steps (discovery, system‑prompt injection, progressive loading, execution), and two practical examples—a simple skill lookup and a complex docx‑processing skill—complete with code snippets and troubleshooting tips.

Agent SkillDeepAgentsFileSystemMiddleware
0 likes · 15 min read
How to Quickly Integrate Agent Skills in LangChain DeepAgents
James' Growth Diary
James' Growth Diary
Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

Hybrid SearchLangChainMilvus
0 likes · 18 min read
Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices
Tech Freedom Circle
Tech Freedom Circle
Apr 21, 2026 · Artificial Intelligence

Deep Dive into DeerFlow’s 14‑Layer Middleware: An Onion‑Style Chain Architecture Case Study

This article provides a detailed technical analysis of DeerFlow 2.0’s 14‑layer middleware stack, explaining how it extends LangChain’s runnable middleware with an onion‑style responsibility‑chain, compares the design to MyBatis interceptors, and breaks down each middleware’s purpose, implementation details, execution order, and engineering benefits for AI agent frameworks.

AI agentsDeerFlowLangChain
0 likes · 36 min read
Deep Dive into DeerFlow’s 14‑Layer Middleware: An Onion‑Style Chain Architecture Case Study
James' Growth Diary
James' Growth Diary
Apr 19, 2026 · Artificial Intelligence

Vector Database Basics: Embeddings, Similarity Search, and Index Structures

This article explains how embeddings turn text into high‑dimensional vectors, compares commercial and open‑source embedding models, details cosine, Euclidean and inner‑product similarity metrics, reviews common index structures such as Flat, IVF, HNSW and PQ, and shows how to choose and use a vector database with LangChain.js while avoiding typical pitfalls.

LangChainRAGembeddings
0 likes · 25 min read
Vector Database Basics: Embeddings, Similarity Search, and Index Structures
SpringMeng
SpringMeng
Apr 19, 2026 · Artificial Intelligence

Build a LangChain AI Agent in 20 Minutes: Step‑by‑Step Guide

This tutorial walks through creating a LangChain‑based AI agent by covering model integration, tool definition with @tool, short‑ and long‑term memory handling via checkpointers and vector stores, and assembling everything with create_agent, middleware, and code examples for a functional travel assistant.

AI AgentLangChainLangGraph
0 likes · 16 min read
Build a LangChain AI Agent in 20 Minutes: Step‑by‑Step Guide
James' Growth Diary
James' Growth Diary
Apr 17, 2026 · Artificial Intelligence

How to Load and Split Documents for RAG: First Step to Building a Knowledge Base

This tutorial explains why document loading and splitting are critical for RAG pipelines, introduces LangChain's Document format, demonstrates loaders for various file types, details the RecursiveCharacterTextSplitter and alternative splitters, and provides practical tips on parameter tuning, metadata preservation, Chinese text handling, and common pitfalls.

AIDocument LoaderLangChain
0 likes · 27 min read
How to Load and Split Documents for RAG: First Step to Building a Knowledge Base
PaperAgent
PaperAgent
Apr 15, 2026 · Artificial Intelligence

How Open‑Source Agent Harnesses Are Redefining LLM Deployments

The article analyzes the shift from proprietary Claude Managed Agents to open‑source frameworks like LangChain Deep Agents Deploy, detailing harness engineering, deployment steps, memory management, and the benefits of an open ecosystem for building production‑ready AI agents.

DeploymentHarness EngineeringLangChain
0 likes · 8 min read
How Open‑Source Agent Harnesses Are Redefining LLM Deployments
Qborfy AI
Qborfy AI
Apr 14, 2026 · Artificial Intelligence

How to Give AI a Map, Not an Encyclopedia: Mastering Context Engineering

This article explains why AI can only act on information that appears in its context window, outlines the twin problems of missing data and overload, and presents a practical methodology—including AGENTS.md maps and LangChain's LocalContextMiddleware implementation—to engineer concise, searchable context for reliable agent behavior.

AIContext EngineeringLangChain
0 likes · 29 min read
How to Give AI a Map, Not an Encyclopedia: Mastering Context Engineering
ShiZhen AI
ShiZhen AI
Apr 13, 2026 · Artificial Intelligence

Who Owns Your AI Memory? The Risks of Closed Agent Harnesses

The article explains that Agent Harnesses are essential for managing AI memory and context, argues that closed‑source harnesses give vendors control over user data, outlines three risk levels of memory lock‑in, and advocates open, user‑controlled harnesses such as OpenClaw and Deep Agents.

AI memoryAgent HarnessLangChain
0 likes · 14 min read
Who Owns Your AI Memory? The Risks of Closed Agent Harnesses
AI Architect Hub
AI Architect Hub
Apr 12, 2026 · Artificial Intelligence

Which AI Agent Framework Wins in 2026? LangChain, LlamaIndex, LangGraph, AutoGen

This article provides a practical selection guide for developers building AI agents in 2026, dissecting the design, core components, strengths, and limitations of four major frameworks—LangChain, LlamaIndex, LangGraph, and AutoGen—while offering use‑case recommendations, code examples, and a decision‑tree to help choose the most suitable tool.

AI agentsAutoGenLangChain
0 likes · 23 min read
Which AI Agent Framework Wins in 2026? LangChain, LlamaIndex, LangGraph, AutoGen
James' Growth Diary
James' Growth Diary
Apr 12, 2026 · Artificial Intelligence

Build a Complete Private Knowledge Base with RAG: A Hands‑On Guide

This article walks through a complete, production‑ready Retrieval‑Augmented Generation pipeline that lets AI answer a company’s private documents, covering chunking strategies, embedding model choices, vector‑database selection, retrieval methods, full LangChain chain assembly, and common pitfalls to avoid.

EmbeddingLangChainPromptEngineering
0 likes · 18 min read
Build a Complete Private Knowledge Base with RAG: A Hands‑On Guide
Tech Freedom Circle
Tech Freedom Circle
Apr 12, 2026 · Artificial Intelligence

What Is Harness Agent? A Deep Dive into the New AI Engineering Framework

Harness Agent is an AI engineering framework that combines a large language model with a runtime control system—called the Harness—to provide task planning, sandboxed execution, tool integration, memory management, safety guardrails, and observability, turning raw model capabilities into reliable, production‑grade agents.

AI EngineeringAgent ArchitectureDeerFlow
0 likes · 26 min read
What Is Harness Agent? A Deep Dive into the New AI Engineering Framework
Qborfy AI
Qborfy AI
Apr 11, 2026 · Industry Insights

Why AI Agents Need Harness Engineering: Insights from OpenAI, LangChain, and Anthropic

This article explains how AI agents often stall, repeat mistakes, or diverge on complex tasks, argues that the missing piece is a well‑designed harness, and demonstrates with real‑world case studies from OpenAI, LangChain, and Anthropic how a six‑component harness can boost performance by over 13 percentage points and enable million‑line code generation.

AI EngineeringAgent HarnessAnthropic
0 likes · 12 min read
Why AI Agents Need Harness Engineering: Insights from OpenAI, LangChain, and Anthropic
James' Growth Diary
James' Growth Diary
Apr 11, 2026 · Artificial Intelligence

Deep Dive into Tools: Function Calling Mechanics and LangChain Toolchain Design

This article explains how LLMs use Function Calling to output structured JSON for tool execution, walks through the full multi‑turn tool call loop, shows how LangChain standardizes disparate vendor APIs with BaseTool and bind_tools, and shares practical pitfalls, best‑practice guidelines, and security considerations for building robust agents.

AgentFunction CallingLLM
0 likes · 16 min read
Deep Dive into Tools: Function Calling Mechanics and LangChain Toolchain Design
James' Growth Diary
James' Growth Diary
Apr 10, 2026 · Artificial Intelligence

Designing Agent Memory Systems: Short‑Term, Long‑Term, and Knowledge Graph Layers

The article breaks down how to build a three‑layer memory architecture for AI agents—short‑term context windows with sliding‑window summarization, long‑term semantic retrieval via vector databases with selective storage and time decay, and a knowledge‑graph layer for relational reasoning—plus implementation tips and common pitfalls.

Agent MemoryKnowledge GraphLangChain
0 likes · 19 min read
Designing Agent Memory Systems: Short‑Term, Long‑Term, and Knowledge Graph Layers
James' Growth Diary
James' Growth Diary
Apr 10, 2026 · Artificial Intelligence

Build Your First Production‑Ready LCEL Chain with the Pipe Operator

This tutorial walks through LCEL’s pipe operator and its underlying RunnableSequence, then demonstrates sequential, parallel, and lambda‑based chains, shows how to preserve context with RunnablePassthrough/Assign, compares invoke/stream/batch execution modes, and provides a complete production‑grade RAG chain with common pitfalls and a self‑check checklist.

AILCELLangChain
0 likes · 12 min read
Build Your First Production‑Ready LCEL Chain with the Pipe Operator
PMTalk Product Manager Community
PMTalk Product Manager Community
Apr 10, 2026 · Artificial Intelligence

AI Handles 80% of a Medical Triage Agent, Product Managers Cover the Rest

The article walks through a medical triage AI Agent built with LangChain, LangGraph, and LangSmith, showing how the framework supplies core model and tool interfaces, how graph‑based orchestration manages complex branching, loops and human‑in‑the‑loop steps, and how tracing and evaluation prove reliability for product managers.

AI AgentLangChainLangGraph
0 likes · 23 min read
AI Handles 80% of a Medical Triage Agent, Product Managers Cover the Rest
James' Growth Diary
James' Growth Diary
Apr 9, 2026 · Artificial Intelligence

How ReAct Enables Agents to Think While Acting

This article explains the ReAct pattern—interleaving reasoning and acting for LLM agents—by defining its core loop, comparing it with plain tool‑calling, providing a step‑by‑step hand‑written implementation in JavaScript, showing the LangChain.js wrapper, streaming output, and detailing five common pitfalls and a pre‑deployment checklist.

JavaScriptLLMLangChain
0 likes · 16 min read
How ReAct Enables Agents to Think While Acting
Data STUDIO
Data STUDIO
Apr 9, 2026 · Artificial Intelligence

Two Weeks of RAG Troubles: How Bad PDF Parsing Made My LLM Look Stupid

After two weeks of failed RAG queries caused by fragmented tables, multi‑column layouts, and poor OCR, the author switched from open‑source PDF parsers to the commercial TextIn xParse engine, boosting retrieval accuracy from under 30% to over 95% and sharing practical integration tips.

AILangChainPDF parsing
0 likes · 12 min read
Two Weeks of RAG Troubles: How Bad PDF Parsing Made My LLM Look Stupid
James' Growth Diary
James' Growth Diary
Apr 8, 2026 · Artificial Intelligence

Practical Guide to Output Parsers: Ensuring Stable JSON from LLMs

The article explains why LLMs often produce malformed JSON, categorizes three common failure types, and walks through modern solutions—including withStructuredOutput + Zod, JsonOutputParser, and OutputFixingParser—plus a decision tree to choose the right approach for production use.

FunctionCallingJSONLLM
0 likes · 14 min read
Practical Guide to Output Parsers: Ensuring Stable JSON from LLMs
James' Growth Diary
James' Growth Diary
Apr 7, 2026 · Artificial Intelligence

Parser vs withStructuredOutput: Choosing the Right Structured Output for LangChain

The article analyzes why LLMs often return unstructured text, compares LangChain's OutputParser and withStructuredOutput approaches, evaluates their stability, token usage, and model compatibility, and provides a decision guide and best‑practice recommendations for production‑grade structured output in 2025.

Function CallingLLMLangChain
0 likes · 10 min read
Parser vs withStructuredOutput: Choosing the Right Structured Output for LangChain
James' Growth Diary
James' Growth Diary
Apr 6, 2026 · Artificial Intelligence

10 Practical LangChain Performance Hacks to Speed Up and Cut Costs

This article presents ten concrete techniques—including in‑memory and Redis caching, semantic caching, parallel execution, batch processing, prompt compression, model routing, streaming output, and connection‑pool reuse—to dramatically reduce latency and token costs in production LangChain applications.

LangChainNode.jsPerformance Optimization
0 likes · 14 min read
10 Practical LangChain Performance Hacks to Speed Up and Cut Costs
Fun with Large Models
Fun with Large Models
Apr 3, 2026 · Artificial Intelligence

Fast Guide to LangChain DeepAgents: How SubAgents Work

This article explains DeepAgents SubAgent mechanisms, showing how context isolation and task division improve complex agent workflows, details two creation methods (dictionary‑based and compiled), demonstrates a search‑and‑report demo, and outlines suitable and unsuitable scenarios with practical code examples.

AI agentsDeepAgentsLangChain
0 likes · 15 min read
Fast Guide to LangChain DeepAgents: How SubAgents Work
Data STUDIO
Data STUDIO
Apr 2, 2026 · Artificial Intelligence

Building a Dual‑Stack Memory Agent: Situational + Semantic Memory for Long‑Term AI Understanding

This tutorial walks through designing and implementing a dual‑stack memory architecture for AI agents—combining episodic vector‑based situational memory with graph‑based semantic memory—using LangChain, FAISS, and Neo4j, and demonstrates a complete end‑to‑end workflow with code examples.

Agent MemoryFAISSKnowledge Graph
0 likes · 14 min read
Building a Dual‑Stack Memory Agent: Situational + Semantic Memory for Long‑Term AI Understanding
AI Waka
AI Waka
Mar 30, 2026 · Artificial Intelligence

Exploring Deep Agents: An Open‑Source Alternative to Claude Code for Coding AI Agents

Deep Agents, an open‑source framework built on LangChain and LangGraph, provides a ready‑to‑use agent harness with planning, file‑system tools, sandboxed shell access, sub‑agents, automatic context management, and built‑in observability for Python and TypeScript developers seeking a flexible replacement for Claude Code.

AI automationDeepAgentsLangChain
0 likes · 9 min read
Exploring Deep Agents: An Open‑Source Alternative to Claude Code for Coding AI Agents
Data STUDIO
Data STUDIO
Mar 30, 2026 · Artificial Intelligence

Why a Single AI Falls Short: Building a Multi‑Agent Expert Team for Superior Reports

The article demonstrates how a monolithic LLM struggles with multi‑dimensional market analysis and shows, through step‑by‑step code, how assembling specialized AI agents for news, technical and financial analysis yields clearer structure, deeper insight, and higher evaluation scores.

AI ArchitectureLLM evaluationLangChain
0 likes · 17 min read
Why a Single AI Falls Short: Building a Multi‑Agent Expert Team for Superior Reports
Data STUDIO
Data STUDIO
Mar 27, 2026 · Artificial Intelligence

Boost Agent Efficiency with Planning Architecture: A Hands‑On Comparison to ReAct

This article explains the planning architecture for AI agents, contrasts it with the ReAct approach, provides step‑by‑step Python code using LangChain and LangGraph, evaluates both methods on task completion and process efficiency, and discusses when each architecture is most suitable.

AI agentsLangChainLangGraph
0 likes · 18 min read
Boost Agent Efficiency with Planning Architecture: A Hands‑On Comparison to ReAct
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Mar 26, 2026 · Artificial Intelligence

How to Build a Full‑Stack RAG Chatbot Using LangChain, FAISS & Langfuse

This guide walks through an end‑to‑end RAG implementation with LangChain, covering multi‑format document loading, recursive text splitting, embedding selection, FAISS vector storage, ConversationalRetrievalChain setup, prompt engineering, source citation, Langfuse observability, and best‑practice configuration management.

FAISSLLMOpsLangChain
0 likes · 13 min read
How to Build a Full‑Stack RAG Chatbot Using LangChain, FAISS & Langfuse
Fun with Large Models
Fun with Large Models
Mar 25, 2026 · Artificial Intelligence

Quick Guide to LangChain DeepAgents: Core Features and Fast Onboarding

This article introduces the background and key advantages of the DeepAgents framework, explains its four core capabilities—task planning, context management, sub‑agent generation, and long‑term memory—and provides a step‑by‑step code example that builds a complex AI agent with just a few lines of Python.

AI agentsDeepAgentsLangChain
0 likes · 11 min read
Quick Guide to LangChain DeepAgents: Core Features and Fast Onboarding
Test Development Learning Exchange
Test Development Learning Exchange
Mar 24, 2026 · Artificial Intelligence

Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports

This guide presents an end‑to‑end solution for creating a test‑focused AI agent that indexes project code and defect data, integrates a large language model via LangChain, generates compliant Pytest cases, parses Allure reports, and offers deployment tips for seamless PyCharm integration.

AI AgentAllureLangChain
0 likes · 13 min read
Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports
Data STUDIO
Data STUDIO
Mar 24, 2026 · Artificial Intelligence

Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes

This article explains why large language models alone can hallucinate, introduces the tool‑using agent architecture, and provides a step‑by‑step Python tutorial using LangChain, LangGraph, and Tavily to create, run, and evaluate a real‑time web‑search capable AI assistant.

AgentLLMLangChain
0 likes · 16 min read
Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 24, 2026 · Artificial Intelligence

Why LLMs Behave Unpredictably: From Uncertainty to Practical Agent Design

This article analyzes the sources of LLM output uncertainty, explores hardware and architectural constraints, demonstrates how to build robust AI agents with prompt engineering, tool orchestration, and memory management, and compares traditional micro‑service design with modern LLM‑centric workflows.

AI AgentHardwareLLM
0 likes · 64 min read
Why LLMs Behave Unpredictably: From Uncertainty to Practical Agent Design
DeepHub IMBA
DeepHub IMBA
Mar 18, 2026 · Artificial Intelligence

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

The article analyzes how most RAG pipelines blindly feed retrieved documents to LLMs, introduces CRAG's lightweight evaluator with confidence thresholds, describes its sentence‑level decomposition, filtering, and dual‑knowledge routing, and provides a full implementation walkthrough with a real insurance query example.

CRAGFAISSLLM
0 likes · 13 min read
CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator
JavaGuide
JavaGuide
Mar 18, 2026 · Artificial Intelligence

Why Build Your Own Claude Code Agent? A Step‑by‑Step Walkthrough

This article explores the Learn Claude Code website, breaking down the universal agent loop into twelve incremental versions, demonstrating language‑agnostic implementations in Python and Java, and detailing progressive capabilities—from basic tool integration to memory compression, concurrency, and multi‑agent collaboration.

AI AgentAgent LoopClaude
0 likes · 9 min read
Why Build Your Own Claude Code Agent? A Step‑by‑Step Walkthrough
AI Engineer Programming
AI Engineer Programming
Mar 16, 2026 · Artificial Intelligence

Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents

The article explains that the term “Agent development” hides a fundamental split between Agent Frameworks, which give developers building blocks to assemble their own agents, and Agent Harnesses, which provide ready‑to‑run agents, and shows how this distinction affects decisions, maintenance, and troubleshooting.

AI EngineeringAgentClaude Code
0 likes · 10 min read
Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents
Fun with Large Models
Fun with Large Models
Mar 15, 2026 · Artificial Intelligence

A Complete Guide to 2026’s Hottest Tech Concept: Agent Engineering

The article explains Agent Engineering—a systematic approach that turns nondeterministic large‑language‑model agents into reliable production‑grade applications through an iterative build‑test‑deploy‑observe‑improve loop, combining product, engineering, and data‑science thinking to address unpredictability and achieve continuous growth.

AI AgentData‑Driven OptimizationIterative Development
0 likes · 12 min read
A Complete Guide to 2026’s Hottest Tech Concept: Agent Engineering
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 12, 2026 · Artificial Intelligence

How to Build Cross-Session Memory for RAG Chatbots: Short‑Term vs Long‑Term Strategies

This article explains the role of memory modules in Retrieval‑Augmented Generation systems, compares short‑term and long‑term memory techniques, outlines storage and retrieval methods, discusses management strategies like forgetting and deduplication, and compares LangChain and LlamaIndex implementations for practical deployment.

LLMLangChainLlamaIndex
0 likes · 11 min read
How to Build Cross-Session Memory for RAG Chatbots: Short‑Term vs Long‑Term Strategies
Fun with Large Models
Fun with Large Models
Mar 11, 2026 · Artificial Intelligence

LangChain DeepAgents Quick Guide – FileSystem Middleware Gives AI Agents System‑Level Memory Management

This article explains why AI agents need a memory‑management solution, introduces LangChain DeepAgents' FileSystem middleware, details its four backend options for short‑term, long‑term, disk‑based, and hybrid storage, and provides step‑by‑step Python examples for installing, configuring, and using the middleware in real‑world scenarios.

AI AgentDeepAgentsFileSystemMiddleware
0 likes · 16 min read
LangChain DeepAgents Quick Guide – FileSystem Middleware Gives AI Agents System‑Level Memory Management
AI Tech Publishing
AI Tech Publishing
Mar 10, 2026 · Artificial Intelligence

Agent Frameworks vs. Agent Harness: Understanding the Key Differences

The article explains how Agent Frameworks and Agent Harness occupy different points on an opinionated spectrum, detailing their abstractions, built‑in components, trade‑offs, and when to choose each, with examples like OpenClaw, LangChain, and Deep Agents.

Agent FrameworkAgent HarnessLLM
0 likes · 5 min read
Agent Frameworks vs. Agent Harness: Understanding the Key Differences
Fun with Large Models
Fun with Large Models
Feb 25, 2026 · Artificial Intelligence

Fast Guide to LangChain DeepAgents: Using Summarization Middleware to Optimize Agent Memory

This article explains how LangChain DeepAgents' Summarization middleware automatically compresses conversation history to overcome large‑model context window limits, detailing its core mechanism, applicable scenarios, configuration parameters (trigger, keep, model, summary_prompt), and step‑by‑step Python examples that illustrate its integration and internal message flow.

AI agentsContext WindowDeepAgents
0 likes · 23 min read
Fast Guide to LangChain DeepAgents: Using Summarization Middleware to Optimize Agent Memory
Fun with Large Models
Fun with Large Models
Feb 24, 2026 · Artificial Intelligence

DeepAgents Quickstart Guide: A Full Walkthrough of Core Features

This article introduces LangChain's DeepAgents framework, explains its design goals, compares it with LangChain and LangGraph, and provides a step‑by‑step code walkthrough that demonstrates task planning, sub‑agent delegation, tool usage, and result generation for building complex AI agents with just a few lines of code.

AI agentsAgent orchestrationDeepAgents
0 likes · 15 min read
DeepAgents Quickstart Guide: A Full Walkthrough of Core Features
AI Waka
AI Waka
Feb 23, 2026 · Artificial Intelligence

Essential Books to Master Generative AI: From NLP to Multimodal Apps

This guide outlines the key competencies for generative AI professionals and curates a focused reading list—covering NLP fundamentals, software engineering, LLM libraries, vector databases, and multimodal AI—to help readers build practical expertise and deploy impactful AI solutions.

AI learningBook RecommendationsLangChain
0 likes · 9 min read
Essential Books to Master Generative AI: From NLP to Multimodal Apps
Data STUDIO
Data STUDIO
Feb 22, 2026 · Artificial Intelligence

Building AI Agents with LangGraph: Implementing RAG and Long‑Term Memory

This tutorial walks through adding Retrieval‑Augmented Generation (RAG) and persistent long‑term memory to a LangGraph AI agent, covering concepts, step‑by‑step code for document loading, vector store creation, prompt engineering, memory management, and best‑practice pitfalls.

AI AgentEmbeddingLangChain
0 likes · 16 min read
Building AI Agents with LangGraph: Implementing RAG and Long‑Term Memory
Qborfy AI
Qborfy AI
Feb 18, 2026 · Artificial Intelligence

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

This article explains Retrieval‑Augmented Generation (RAG), detailing its offline knowledge‑base construction and online retrieval‑enhanced generation workflow, comparing it with traditional and fine‑tuned models, and providing step‑by‑step LangChain implementations, advanced techniques, and practical use‑case demos.

Hybrid SearchLangChainPrompt engineering
0 likes · 16 min read
How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code
Qborfy AI
Qborfy AI
Feb 11, 2026 · Artificial Intelligence

What Is an AI Agent? From Passive Models to Autonomous Digital Assistants

This article explains AI agents as autonomous systems that perceive environments, set goals, and act, contrasting them with traditional AI, detailing their core definition, architecture, key components, practical applications, implementation steps, classification, technology stack, case studies, emerging trends, challenges, and future directions.

AI AgentAgent ArchitectureAutoGPT
0 likes · 11 min read
What Is an AI Agent? From Passive Models to Autonomous Digital Assistants
AI Engineering
AI Engineering
Feb 11, 2026 · Artificial Intelligence

Harrison Chase Explains Two Sandbox Architectures for AI Agents

The article analyzes why AI agents need isolated sandboxes, outlines two architectural patterns—running the agent inside a sandbox or using the sandbox as an external tool—compares their advantages and challenges, and provides concrete implementation examples and community insights.

AI agentsAPIDocker
0 likes · 11 min read
Harrison Chase Explains Two Sandbox Architectures for AI Agents
Fun with Large Models
Fun with Large Models
Feb 10, 2026 · Artificial Intelligence

Building LangChain Agent Skills from Scratch to Cut Token Usage and Boost Tool Accuracy

The article presents a step‑by‑step design and implementation of a Claude‑style Skills mechanism for LangChain agents, using a double‑layer tool architecture, state‑driven dynamic filtering, and middleware interception to load only relevant tools, dramatically reducing token consumption and improving decision quality and response speed.

Agent SkillsDynamic LoadingLangChain
0 likes · 15 min read
Building LangChain Agent Skills from Scratch to Cut Token Usage and Boost Tool Accuracy
Data STUDIO
Data STUDIO
Jan 27, 2026 · Artificial Intelligence

How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs

This article explains why large‑language‑model hallucinations are risky, introduces Retrieval‑Augmented Generation (RAG) as a remedy, and walks through nine Python‑based RAG architectures—standard, conversational, corrective, adaptive, fusion, HyDE, self‑RAG, agentic, and graph RAG—detailing their workflows, code examples, strengths, weaknesses, and a decision‑making map for selecting the right design.

AI hallucinationLangChainPython
0 likes · 29 min read
How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs
Tech Verticals & Horizontals
Tech Verticals & Horizontals
Jan 23, 2026 · Artificial Intelligence

Comparing 9 Major Agent Development Frameworks: Choosing the Best Fit

This article provides an in‑depth comparison of nine mainstream AI agent development frameworks—Pydantic AI, SmolAgents, DeepAgents, LlamaIndex, CAMEL, AutoGen, CrewAI, LangGraph, and OpenAI Agents SDK—detailing their design principles, strengths, weaknesses, typical scenarios, and guidance for selecting or mixing them in production.

Agent FrameworksComparisonLLM
0 likes · 30 min read
Comparing 9 Major Agent Development Frameworks: Choosing the Best Fit
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jan 13, 2026 · Databases

Turn PostgreSQL into a Graph Database with Apache AGE

This guide explains how Apache AGE extends PostgreSQL with OpenCypher‑compatible graph capabilities, covering architecture, installation, storage schema, Cypher‑SQL integration, common graph operations, and a LangChain example that turns natural‑language questions into executable graph queries.

Apache AGECypherLangChain
0 likes · 11 min read
Turn PostgreSQL into a Graph Database with Apache AGE
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Jan 2, 2026 · Artificial Intelligence

From ChatGPT to LLM‑Native: Building Intelligent AI Agents and Workflows with LangChain

The article explains why traditional chat‑based AI tools are limited to advice, introduces next‑generation LLM‑native applications that can understand, plan, and act, and provides a step‑by‑step guide on designing AI workflows, autonomous agents, hybrid architectures, and the Model Context Protocol (MCP) using LangChain.

AI agentsLLMLangChain
0 likes · 36 min read
From ChatGPT to LLM‑Native: Building Intelligent AI Agents and Workflows with LangChain
360 Tech Engineering
360 Tech Engineering
Dec 25, 2025 · Artificial Intelligence

Why LangChain 1.0 Makes AI Agent Development Faster, Safer, and More Scalable

LangChain 1.0 replaces fragmented agent code with a production‑ready framework that unifies model outputs, simplifies tool integration, introduces content_blocks for consistent response handling, and adds a middleware system for privacy, summarization, and human‑in‑the‑loop safety, dramatically improving developer efficiency and reliability.

LLMLangChainPython
0 likes · 13 min read
Why LangChain 1.0 Makes AI Agent Development Faster, Safer, and More Scalable
Fun with Large Models
Fun with Large Models
Dec 24, 2025 · Artificial Intelligence

Building an Automatic Email‑Processing Agent with LangGraph 1.0 – A Hands‑On Guide

This tutorial walks through the complete development of an automatic email‑processing agent using LangGraph 1.0, covering scenario analysis, state design, node implementation, graph assembly, and testing with both high‑priority bug reports and routine greeting emails, while demonstrating state management, conditional routing, and human‑in‑the‑loop controls.

LangChainLangGraphState Management
0 likes · 14 min read
Building an Automatic Email‑Processing Agent with LangGraph 1.0 – A Hands‑On Guide
Fun with Large Models
Fun with Large Models
Dec 17, 2025 · Artificial Intelligence

Quick Guide to LangGraph 1.0: Core Concepts, Nodes, and Edges

This article introduces LangGraph 1.0 as a programming‑language‑style framework for AI agents, explains its core abstractions—State, Node, Edge, Reducer, and Human‑in‑the‑Loop—shows how to define state and node functions, builds simple and parallel graphs with static, conditional, and MapReduce edges, and demonstrates conflict‑resolution using built‑in and custom reducers.

AI agentsGraph WorkflowLangChain
0 likes · 17 min read
Quick Guide to LangGraph 1.0: Core Concepts, Nodes, and Edges
Bilibili Tech
Bilibili Tech
Dec 12, 2025 · Artificial Intelligence

Turning a Simple JS Function into a Cross‑Platform AI Tool with MCP

This article details how we built an AI‑tool ecosystem by evolving a basic online JS cloud‑function platform into a unified, reusable capability layer that integrates with Flowise, LangChain StructuredTool, and the Model Context Protocol (MCP) to provide secure, cross‑platform tool calls for agents.

AI toolsLangChainMCP
0 likes · 20 min read
Turning a Simple JS Function into a Cross‑Platform AI Tool with MCP
Fun with Large Models
Fun with Large Models
Dec 7, 2025 · Frontend Development

Building a Multimodal RAG Front‑End with Trae Solo: A Vibe‑Coding Guide

This article walks through a three‑step Vibe‑Coding workflow—structured prompt creation, prompt optimization with DeepSeek, and precise bug‑fix guidance—to automatically generate, refine, and extend a React + TypeScript front‑end for a multimodal RAG system using Trae Solo, covering architecture, streaming chat, and PDF citation features.

AI programmingLangChainRAG
0 likes · 22 min read
Building a Multimodal RAG Front‑End with Trae Solo: A Vibe‑Coding Guide
dbaplus Community
dbaplus Community
Dec 7, 2025 · Artificial Intelligence

How AI Agents Can Revolutionize Data Governance: A Step‑by‑Step Blueprint

This article explains how AI agents transform traditional data governance by introducing a four‑layer perception‑decision‑execution‑learning architecture, detailing the required technologies, tool integrations, code examples, deployment steps, team roles, security safeguards, and practical rollout strategies for enterprises seeking automated, intelligent data management.

AI AgentData GovernanceData Quality
0 likes · 10 min read
How AI Agents Can Revolutionize Data Governance: A Step‑by‑Step Blueprint
Fun with Large Models
Fun with Large Models
Nov 30, 2025 · Artificial Intelligence

Multimodal RAG with LangChain: PDF Parsing, Chunking, and Citation Guide

This article walks through building a LangChain‑based multimodal RAG system that parses PDFs (both native and scanned), splits them into semantic chunks, stores embeddings in a vector database, and generates answers with precise source citations, complete with code samples and API integration.

FastAPILangChainMultimodal AI
0 likes · 20 min read
Multimodal RAG with LangChain: PDF Parsing, Chunking, and Citation Guide
Data Party THU
Data Party THU
Nov 25, 2025 · Artificial Intelligence

What $47,000 Taught Us About Deploying Multi‑Agent AI Systems

After spending $47,000 running four LangChain agents in production, we reveal the hidden costs of A2A communication and Anthropic’s MCP, expose seven common deployment pitfalls, and argue that dedicated AI infrastructure is essential for scalable multi‑agent systems.

A2A communicationAI InfrastructureCost Optimization
0 likes · 13 min read
What $47,000 Taught Us About Deploying Multi‑Agent AI Systems
Architect's Guide
Architect's Guide
Nov 24, 2025 · Artificial Intelligence

Building Java LLM Applications with LangChain4j: A Hands‑On Guide

This tutorial walks through the fundamentals of large language models, prompt engineering, and word embeddings, then shows how to set up a LangChain‑based LLM stack in Java using LangChain4j, covering core modules, memory, retrieval, chains, agents, and complete code examples.

AI agentsJavaLLM
0 likes · 15 min read
Building Java LLM Applications with LangChain4j: A Hands‑On Guide