Author

Linyb Geek Road

Tech notes

Articles

Likes

Views

Comments

Latest from Linyb Geek Road

94 recent articles

Linyb Geek Road

May 10, 2026 · Artificial Intelligence

Designing Progressive Large‑Model Agents: Architecture, Frameworks, and Real‑World Practices

This article examines the evolution of large‑model agents, outlines four development stages, compares workflow, collaborative, and evolutionary frameworks, details core components such as perception, memory, planning, tools, and reflection, and explains how a progressive, loop‑based architecture can be applied across verticals like research, code generation, and complex workflow automation.

Agent architectureAlphaEvolveLLM agents

0 likes · 9 min read

Designing Progressive Large‑Model Agents: Architecture, Frameworks, and Real‑World Practices

Linyb Geek Road

May 9, 2026 · Artificial Intelligence

Why Overly Long Context Files Reduce AI Agent Success by 3% and Raise Token Cost 20%

The article shows that adding redundant context to AI agents like Claude harms efficiency: each extra 50 lines dilutes attention, lowers task success by about 3 % and inflates token usage by roughly 20 %, because the model’s instruction budget is capped at 150‑200 tokens, so context files must be concise and focused on non‑derivable information.

AGENTS.mdAI agentsCLAUDE.md

0 likes · 17 min read

Why Overly Long Context Files Reduce AI Agent Success by 3% and Raise Token Cost 20%

Linyb Geek Road

May 8, 2026 · Backend Development

How RocketMQ Guarantees Strict Message Ordering

The article explains RocketMQ's ordered messaging model, detailing why disorder occurs during production and consumption, and describing the three-phase mechanism—using MessageQueueSelector, sequential storage, and single‑threaded ordered consumption—to ensure that messages are processed in the exact order they were produced.

MessageListenerOrderlyMessageQueueSelectorRocketMQ

0 likes · 7 min read

How RocketMQ Guarantees Strict Message Ordering

Linyb Geek Road

May 7, 2026 · Backend Development

How to Ensure High Availability When Third‑Party Services Keep Failing – An Interview‑Ready Guide

The article explains how to design a defensive layer that abstracts third‑party calls, implements client‑side rate limiting, retries, circuit breaking, observability, and mock testing, and shows how to present these practices effectively during a system‑design interview.

High AvailabilityObservabilitycircuit breaker

0 likes · 21 min read

How to Ensure High Availability When Third‑Party Services Keep Failing – An Interview‑Ready Guide

Linyb Geek Road

May 7, 2026 · Operations

A Decade of E‑Commerce Ops: How to Prevent System Outages and Ensure High Availability

The article outlines why e‑commerce systems fail, presents a four‑layer high‑availability defense—including load balancing, service isolation, data protection, and fallback mechanisms—plus concrete monitoring, alerting, and emergency response practices illustrated with real‑world scenarios and code samples.

Disaster RecoveryHigh Availabilitydatabase backup

0 likes · 6 min read

A Decade of E‑Commerce Ops: How to Prevent System Outages and Ensure High Availability

Linyb Geek Road

May 6, 2026 · Artificial Intelligence

Ensuring High Availability and Robustness for LLM Agents: Key Strategies and Pitfalls

The article breaks down the unique hard and soft failure modes of LLM‑driven agents and proposes a four‑layer defense—LLM call handling, tool execution isolation, execution‑chain checkpointing, and semantic‑level safeguards—plus observability practices to keep production agents stable and reliable.

AgentCheckpointLLM

0 likes · 15 min read

Ensuring High Availability and Robustness for LLM Agents: Key Strategies and Pitfalls

Linyb Geek Road

May 5, 2026 · Artificial Intelligence

Optimizing Retrieval and Generation Latency in High‑Concurrency RAG Agents

The article dissects latency in high‑concurrency RAG Agent pipelines, showing how retrieval, re‑ranking, and LLM generation each contribute milliseconds of delay, and presents system‑level tactics—from ANN index tuning and partitioned search to vLLM PagedAttention, continuous batching, speculative decoding, model quantization, routing, semantic caching, and pipeline parallelism—to dramatically cut end‑to‑end response time.

ANNLLMRAG

0 likes · 15 min read

Optimizing Retrieval and Generation Latency in High‑Concurrency RAG Agents

Linyb Geek Road

May 5, 2026 · Artificial Intelligence

How to Fully Evaluate a RAG System – Metrics for Retrieval and Generation Stages

The article explains why RAG systems require stage‑wise evaluation, detailing retrieval metrics such as Precision, Recall, F1, MRR, NDCG and Context Relevance, and generation metrics like Faithfulness, Answer Relevance and Completeness, while discussing LLM‑as‑Judge automation and a three‑layer assessment framework.

EvaluationLLM-as-JudgeRAG

0 likes · 14 min read

How to Fully Evaluate a RAG System – Metrics for Retrieval and Generation Stages

Linyb Geek Road

May 4, 2026 · Artificial Intelligence

Agent Principles, Architecture, and Engineering Practices for Stable AI Systems

The article breaks down the core loop of AI agents, distinguishes agents from static workflows, and presents engineering practices—such as harness testing, context management, skill loading, tool design, memory handling, multi‑agent coordination, evaluation reliability, and security—that are essential for building robust, cost‑effective agents.

AI agentsAgent architectureMemory management

0 likes · 20 min read

Agent Principles, Architecture, and Engineering Practices for Stable AI Systems

Linyb Geek Road

May 2, 2026 · Operations

2026 Linux Production Ops Command Guide: From Beginner to Expert

This comprehensive guide collects the most essential Linux commands for 2026 production environments, covering system information, service management, file operations, process and network monitoring, user and security administration, system maintenance, advanced shell tricks, and best‑practice checklists for services like MySQL and Redis.

AutomationProduction OpsShell Commands

0 likes · 26 min read

2026 Linux Production Ops Command Guide: From Beginner to Expert