Author

AI Architect Hub

Discuss AI and architecture; a ten-year veteran of major tech companies now transitioning to AI and continuing the journey.

Articles

Likes

144

Views

Comments

Latest from AI Architect Hub

21 recent articles

AI Architect Hub

Jun 16, 2026 · Operations

How to Build a Loop Engineering System: A Ready‑to‑Deploy Checklist

This article provides a step‑by‑step checklist covering six modules—from pre‑planning and requirement standardization to deployment and ongoing ops—detailing templates, core components, sandbox isolation, scheduling architecture, monitoring, and acceptance criteria for implementing Loop Engineering in both quick‑start and enterprise‑grade scenarios.

AutomationCI/CDDevOps

0 likes · 14 min read

How to Build a Loop Engineering System: A Ready‑to‑Deploy Checklist

AI Architect Hub

May 27, 2026 · R&D Management

Hermes Kanban Deep Dive with a Real-World Public Account Matrix Management System

This article explains Hermes Kanban's multi‑agent orchestration features, core concepts, and a step‑by‑step case study that builds a public‑account matrix management system, demonstrating task decomposition, parallel execution, dependency handling, human intervention, and best‑practice guidelines.

Backend DevelopmentHermes KanbanPython

0 likes · 15 min read

Hermes Kanban Deep Dive with a Real-World Public Account Matrix Management System

AI Architect Hub

May 10, 2026 · Artificial Intelligence

RAG Series Recap: From Chunking to Prompt – A Complete Technical Roadmap

This article systematically reviews the nine‑stage RAG pipeline—from data cleaning and text chunking through embedding, vector indexing, retrieval, reranking, and finally prompt assembly—highlighting core concepts, practical code snippets, common pitfalls, and optimization tips for building production‑grade systems.

AIEmbeddingLLM

0 likes · 22 min read

RAG Series Recap: From Chunking to Prompt – A Complete Technical Roadmap

AI Architect Hub

May 3, 2026 · Artificial Intelligence

Choosing the Right Vector Database: Milvus, Chroma, Weaviate, Qdrant, FAISS Compared

This article compares five popular vector databases—Chroma, Milvus, Weaviate, Qdrant, and FAISS—detailing their positions, strengths, weaknesses, suitable scenarios, a selection‑dimension matrix, common pitfalls, code implementations for a unified RAG pipeline, best‑practice recommendations, and thought questions to guide engineers in choosing and migrating vector stores.

ChromaFAISSMilvus

0 likes · 23 min read

Choosing the Right Vector Database: Milvus, Chroma, Weaviate, Qdrant, FAISS Compared

AI Architect Hub

Apr 30, 2026 · Operations

Getting Started with Hermes: Connect WeChat Gateway and Create a Server Monitoring Skill

This tutorial walks through configuring an enterprise WeChat robot, linking it to Hermes' message gateway, building a skill that runs CPU and memory checks, scheduling periodic inspections, and automatically sending the results to the WeChat robot for hands‑free server performance monitoring.

AutomationHermesSkill

0 likes · 5 min read

Getting Started with Hermes: Connect WeChat Gateway and Create a Server Monitoring Skill

AI Architect Hub

Apr 30, 2026 · Artificial Intelligence

How AI Understands Your Queries: Core Techniques of Semantic Vector Search

The article explains why traditional keyword search often fails when user questions differ from knowledge‑base wording, introduces semantic search that matches queries and documents via vector similarity, details query understanding and rewriting techniques, lists common pitfalls, provides a full Python implementation, and shares best‑practice recommendations.

AIPythonRAG

0 likes · 16 min read

How AI Understands Your Queries: Core Techniques of Semantic Vector Search

AI Architect Hub

Apr 30, 2026 · Operations

Comprehensive Guide to Installing Hermes and Its Web UI

This tutorial walks through installing Hermes with a single curl command that checks Python 3.11+, installs Node v23+, sets up a virtual environment, configures channels like WeChat, QQ and Feishu, and then adds the open‑source Hermes Web UI via npm for full model configuration and chat capabilities.

ConfigurationHermesInstallation

0 likes · 3 min read

Comprehensive Guide to Installing Hermes and Its Web UI

AI Architect Hub

Apr 29, 2026 · Artificial Intelligence

Hermes Agent Guide: Install, Model Integration, Skills, MCP, Message Gateway

This comprehensive guide walks you through installing Hermes Agent, configuring over 20 model providers, setting up the Skill system, enabling MCP client/server modes, and connecting the agent to 17 messaging platforms, while also covering execution back‑ends, security settings, and troubleshooting tips.

AI AgentConfigurationHermes Agent

0 likes · 20 min read

Hermes Agent Guide: Install, Model Integration, Skills, MCP, Message Gateway

AI Architect Hub

Apr 27, 2026 · Artificial Intelligence

Why HNSW Can Speed Up Search 50× Compared to Brute‑Force? A Hands‑On Guide to Building Vector Indexes

The article explains why brute‑force vector search is painfully slow, introduces Flat, IVF, and HNSW index structures, compares their speed, memory and accuracy, shows common pitfalls, provides production‑grade Python code, and presents benchmark results that demonstrate HNSW’s superior speed‑accuracy trade‑off.

AIFAISSHNSW

0 likes · 12 min read

Why HNSW Can Speed Up Search 50× Compared to Brute‑Force? A Hands‑On Guide to Building Vector Indexes

AI Architect Hub

Apr 26, 2026 · Artificial Intelligence

Embedding Explained: How Vectorization Turns Text into Numbers for RAG

This article walks through why traditional keyword matching fails for RAG, explains the evolution from one‑hot encoding to Word2Vec and BERT, details sentence‑level embeddings and similarity metrics, compares leading Chinese and multilingual embedding models using the C‑MTEB benchmark, and provides practical LangChain code, deployment tips, and common pitfalls.

Chinese NLPEmbeddingLangChain

0 likes · 18 min read

Embedding Explained: How Vectorization Turns Text into Numbers for RAG