Linyb Geek Road
Author

Linyb Geek Road

Tech notes

94
Articles
0
Likes
59
Views
0
Comments
Recent Articles

Latest from Linyb Geek Road

94 recent articles
Linyb Geek Road
Linyb Geek Road
May 10, 2026 · Artificial Intelligence

Designing Progressive Large‑Model Agents: Architecture, Frameworks, and Real‑World Practices

This article examines the evolution of large‑model agents, outlines four development stages, compares workflow, collaborative, and evolutionary frameworks, details core components such as perception, memory, planning, tools, and reflection, and explains how a progressive, loop‑based architecture can be applied across verticals like research, code generation, and complex workflow automation.

Agent architectureAlphaEvolveLLM agents
0 likes · 9 min read
Designing Progressive Large‑Model Agents: Architecture, Frameworks, and Real‑World Practices
Linyb Geek Road
Linyb Geek Road
May 9, 2026 · Artificial Intelligence

Why Overly Long Context Files Reduce AI Agent Success by 3% and Raise Token Cost 20%

The article shows that adding redundant context to AI agents like Claude harms efficiency: each extra 50 lines dilutes attention, lowers task success by about 3 % and inflates token usage by roughly 20 %, because the model’s instruction budget is capped at 150‑200 tokens, so context files must be concise and focused on non‑derivable information.

AGENTS.mdAI agentsCLAUDE.md
0 likes · 17 min read
Why Overly Long Context Files Reduce AI Agent Success by 3% and Raise Token Cost 20%
Linyb Geek Road
Linyb Geek Road
May 8, 2026 · Backend Development

How RocketMQ Guarantees Strict Message Ordering

The article explains RocketMQ's ordered messaging model, detailing why disorder occurs during production and consumption, and describing the three-phase mechanism—using MessageQueueSelector, sequential storage, and single‑threaded ordered consumption—to ensure that messages are processed in the exact order they were produced.

MessageListenerOrderlyMessageQueueSelectorRocketMQ
0 likes · 7 min read
How RocketMQ Guarantees Strict Message Ordering
Linyb Geek Road
Linyb Geek Road
May 7, 2026 · Operations

A Decade of E‑Commerce Ops: How to Prevent System Outages and Ensure High Availability

The article outlines why e‑commerce systems fail, presents a four‑layer high‑availability defense—including load balancing, service isolation, data protection, and fallback mechanisms—plus concrete monitoring, alerting, and emergency response practices illustrated with real‑world scenarios and code samples.

Disaster RecoveryHigh Availabilitydatabase backup
0 likes · 6 min read
A Decade of E‑Commerce Ops: How to Prevent System Outages and Ensure High Availability
Linyb Geek Road
Linyb Geek Road
May 5, 2026 · Artificial Intelligence

Optimizing Retrieval and Generation Latency in High‑Concurrency RAG Agents

The article dissects latency in high‑concurrency RAG Agent pipelines, showing how retrieval, re‑ranking, and LLM generation each contribute milliseconds of delay, and presents system‑level tactics—from ANN index tuning and partitioned search to vLLM PagedAttention, continuous batching, speculative decoding, model quantization, routing, semantic caching, and pipeline parallelism—to dramatically cut end‑to‑end response time.

ANNLLMRAG
0 likes · 15 min read
Optimizing Retrieval and Generation Latency in High‑Concurrency RAG Agents
Linyb Geek Road
Linyb Geek Road
May 5, 2026 · Artificial Intelligence

How to Fully Evaluate a RAG System – Metrics for Retrieval and Generation Stages

The article explains why RAG systems require stage‑wise evaluation, detailing retrieval metrics such as Precision, Recall, F1, MRR, NDCG and Context Relevance, and generation metrics like Faithfulness, Answer Relevance and Completeness, while discussing LLM‑as‑Judge automation and a three‑layer assessment framework.

EvaluationLLM-as-JudgeRAG
0 likes · 14 min read
How to Fully Evaluate a RAG System – Metrics for Retrieval and Generation Stages
Linyb Geek Road
Linyb Geek Road
May 4, 2026 · Artificial Intelligence

Agent Principles, Architecture, and Engineering Practices for Stable AI Systems

The article breaks down the core loop of AI agents, distinguishes agents from static workflows, and presents engineering practices—such as harness testing, context management, skill loading, tool design, memory handling, multi‑agent coordination, evaluation reliability, and security—that are essential for building robust, cost‑effective agents.

AI agentsAgent architectureMemory management
0 likes · 20 min read
Agent Principles, Architecture, and Engineering Practices for Stable AI Systems
Linyb Geek Road
Linyb Geek Road
May 2, 2026 · Operations

2026 Linux Production Ops Command Guide: From Beginner to Expert

This comprehensive guide collects the most essential Linux commands for 2026 production environments, covering system information, service management, file operations, process and network monitoring, user and security administration, system maintenance, advanced shell tricks, and best‑practice checklists for services like MySQL and Redis.

AutomationProduction OpsShell Commands
0 likes · 26 min read
2026 Linux Production Ops Command Guide: From Beginner to Expert