Author

AI Large Model Application Practice

Focused on deep research and development of large-model applications. Authors of "RAG Application Development and Optimization Based on Large Models" and "MCP Principles Unveiled and Development Guide". Primarily B2B, with B2C as a supplement.

131

Articles

Likes

Views

Comments

Latest from AI Large Model Application Practice

100 recent articles max

AI Large Model Application Practice

Jul 7, 2025 · Artificial Intelligence

Understanding Dropout: Preventing Overfitting in Neural Networks

This article explains what overfitting is, introduces dropout as a regularization technique, describes how dropout randomly deactivates neurons during training and rescales outputs during inference, discusses its limitations, and outlines why large language models may use alternative strategies.

AIdropoutmachine learning

0 likes · 11 min read

Understanding Dropout: Preventing Overfitting in Neural Networks

AI Large Model Application Practice

Jul 2, 2025 · Artificial Intelligence

Build a PPT‑Powered RAG Engine with Visual Models and MCP Server

This article explains how to construct a Retrieval‑Augmented Generation (RAG) pipeline for multi‑page PPT documents by converting slides to images, extracting content with a vision model, indexing with LlamaIndex and Chroma, and exposing the functionality through an MCP Server with tools for adding, querying, and managing PPTs.

LlamaIndexMCP ServerPPT

0 likes · 13 min read

Build a PPT‑Powered RAG Engine with Visual Models and MCP Server

AI Large Model Application Practice

Jun 23, 2025 · Databases

How Google’s MCP Toolbox Simplifies Enterprise Database Access for LLM Agents

This guide explains Google’s open‑source MCP Toolbox for Databases, covering its core concepts, installation, configuration, two usage modes (native SDK and MCP), example LangGraph agent integration, security features, observability, and practical code snippets for building reliable LLM‑driven database tools.

LLM agentsMCP ToolboxTool Integration

0 likes · 11 min read

How Google’s MCP Toolbox Simplifies Enterprise Database Access for LLM Agents

AI Large Model Application Practice

Jun 3, 2025 · Backend Development

Scaling Human‑in‑the‑Loop Agents to Distributed Environments with Robust Fault Recovery

This article explains how to extend a single‑process Human‑in‑the‑Loop (HITL) agent to a distributed, multi‑user API service using FastAPI, detailing session management, interrupt handling, client and server fault‑recovery strategies, and providing concrete code snippets and architectural diagrams.

Human-in-the-loopLangGraphSession Management

0 likes · 16 min read

Scaling Human‑in‑the‑Loop Agents to Distributed Environments with Robust Fault Recovery

AI Large Model Application Practice

May 30, 2025 · Artificial Intelligence

Why Layer Normalization Stabilizes Transformers: A Deep Dive

This article explains the mathematical foundation of layer normalization, why it is needed for deep neural networks like Transformers, how scaling (γ) and bias (β) parameters restore important signal variations, and practical placement tips for stable training.

BiasLayer NormalizationScaling

0 likes · 8 min read

Why Layer Normalization Stabilizes Transformers: A Deep Dive

AI Large Model Application Practice

May 28, 2025 · Artificial Intelligence

Mastering Human-in-the-Loop for Enterprise LLM Agents with LangGraph

This article explains why Human-in-the-Loop (HITL) is essential for enterprise LLM agents, outlines common HITL patterns, and shows how LangGraph’s interrupt, resume, and checkpoint mechanisms can be used to build reliable, auditable workflows with tool‑call control and practical code examples.

AIHuman-in-the-loopLLM

0 likes · 14 min read

Mastering Human-in-the-Loop for Enterprise LLM Agents with LangGraph

AI Large Model Application Practice

May 21, 2025 · Backend Development

Mastering Streamable HTTP in MCP SDK: Setup, Parameters, and Real‑World Examples

This guide provides a comprehensive walkthrough of the new streamable HTTP transport mode introduced in MCP SDK v1.9.0, covering server and client configuration, core parameters, session‑id handling, sample implementations, multi‑instance deployment, and practical testing results.

FastMCPMCP SDKSSE

0 likes · 14 min read

Mastering Streamable HTTP in MCP SDK: Setup, Parameters, and Real‑World Examples

AI Large Model Application Practice

May 16, 2025 · Artificial Intelligence

Why Residual Connections Keep Deep Neural Networks Stable

This article explains why residual connections are essential in deep neural networks, describing the problems of network degradation and gradient vanishing, how shortcut paths add the input to the layer output, the requirement of matching dimensions, and the resulting stability for training large language models.

LLMResidual Connectionsgradient flow

0 likes · 7 min read

Why Residual Connections Keep Deep Neural Networks Stable

AI Large Model Application Practice

May 12, 2025 · Artificial Intelligence

Which AI Agent Planning Strategy Wins? ReAct, Plan‑and‑Execute, Static Workflow & Hybrid Models Compared

This article examines five major LLM‑driven AI agent planning and execution patterns—ReAct, Plan‑and‑Execute, Static Workflow, Static Workflow with local intelligence, and modular hierarchical planning—detailing their mechanisms, code examples, strengths, weaknesses, suitable scenarios, and optimization techniques.

AIAgent architecturePlan-and-Execute

0 likes · 17 min read

Which AI Agent Planning Strategy Wins? ReAct, Plan‑and‑Execute, Static Workflow & Hybrid Models Compared

AI Large Model Application Practice

May 9, 2025 · Backend Development

How JSON‑RPC Batch Processing and Tool Annotations Boost MCP Protocol Efficiency

The updated MCP protocol (2025‑03‑26) introduces JSON‑RPC batch processing and enhanced tool annotations, enabling AI agents to send multiple requests in a single call, improve performance, and provide richer metadata for safer, more transparent tool usage.

BatchJSON-RPCMCP

0 likes · 9 min read

How JSON‑RPC Batch Processing and Tool Annotations Boost MCP Protocol Efficiency