Tagged articles
1130 articles
Page 2 of 12
Alibaba Cloud Native
Alibaba Cloud Native
Aug 21, 2025 · Cloud Native

How Higress AI Gateway Optimizes LLM Load Balancing with Global, Prefix, and GPU‑Aware Algorithms

This article explains why traditional load‑balancing methods fall short for large language model services and introduces Higress AI Gateway's three specialized algorithms—global minimum‑request, prefix‑matching, and GPU‑aware load balancing—detailing their design, Redis‑based implementation, deployment steps, and performance gains.

GPULLMload balancing
0 likes · 11 min read
How Higress AI Gateway Optimizes LLM Load Balancing with Global, Prefix, and GPU‑Aware Algorithms
MaGe Linux Operations
MaGe Linux Operations
Aug 18, 2025 · Operations

Which Nginx Load‑Balancing Strategy Wins in Production? A Real‑World Guide

An experienced ops engineer shares a production incident caused by mis‑chosen Nginx load‑balancing, compares Weighted Round‑Robin and IP‑Hash in depth, presents a week‑long stress test, offers best‑practice configurations, common pitfalls, performance‑tuning tips, and actionable recommendations for reliable traffic distribution.

IP HashNginxWeighted Round Robin
0 likes · 9 min read
Which Nginx Load‑Balancing Strategy Wins in Production? A Real‑World Guide
Open Source Linux
Open Source Linux
Aug 15, 2025 · Operations

Master Nginx Load Balancing: Algorithms, Reverse Proxy & Config Examples

This guide explains how Nginx functions as a load balancer and reverse proxy, covering its event‑driven architecture, worker processes, and core mechanisms, and details common balancing algorithms such as round‑robin, least connections, IP hash, weighted round‑robin and weighted least connections with full configuration examples and monitoring commands.

NginxOperationsload balancing
0 likes · 10 min read
Master Nginx Load Balancing: Algorithms, Reverse Proxy & Config Examples
Practical DevOps Architecture
Practical DevOps Architecture
Aug 12, 2025 · Backend Development

Mastering Nginx Proxy Header Transmission: Key Configurations and Common Pitfalls

This article explains how Nginx forwards HTTP request headers to backend servers, details the default headers passed, shows how to customize header transmission with directives like proxy_set_header, and addresses typical issues such as missing client IP, incorrect Host headers, and security concerns.

http-headersload balancingproxy_set_header
0 likes · 9 min read
Mastering Nginx Proxy Header Transmission: Key Configurations and Common Pitfalls
MaGe Linux Operations
MaGe Linux Operations
Jul 21, 2025 · Artificial Intelligence

Master Multi‑GPU Load Balancing for OLLAMA: From Zero to Production

This guide walks you through configuring OLLAMA for multi‑GPU load balancing, covering hardware checks, CUDA setup, native and Docker deployment methods, detailed parameter tuning, advanced sharding strategies, troubleshooting, performance optimization, and production‑grade monitoring to maximize throughput and stability of large language models.

AI deploymentCUDAOllama
0 likes · 16 min read
Master Multi‑GPU Load Balancing for OLLAMA: From Zero to Production
Architect's Guide
Architect's Guide
Jul 21, 2025 · Operations

How to Achieve Five Nines: Practical High‑Availability Strategies for Modern Web Systems

This article explains key high‑availability concepts such as availability metrics, microservice modularization, load balancing, rate limiting, circuit breaking, isolation, retry strategies, rollback plans, stress testing, monitoring, and on‑call processes, providing concrete design guidelines for building resilient internet services.

Circuit BreakingMicroserviceshigh availability
0 likes · 12 min read
How to Achieve Five Nines: Practical High‑Availability Strategies for Modern Web Systems
Lin is Dream
Lin is Dream
Jul 21, 2025 · Operations

Unlock Nginx Power: Load Balancing, Static Serving, Logging, Security & Gray Release Guide

This article walks you through practical Nginx configurations covering reverse‑proxy load balancing, static resource handling, cache control, version hiding, JSON‑formatted logging, rate‑limiting, IP restrictions, gray‑release traffic splitting, security headers and DNS anti‑spoofing, with ready‑to‑use code examples.

ConfigurationNginxSecurity
0 likes · 20 min read
Unlock Nginx Power: Load Balancing, Static Serving, Logging, Security & Gray Release Guide
Su San Talks Tech
Su San Talks Tech
Jul 19, 2025 · Operations

Mastering Load Balancing: Architecture, Algorithms, and Real-World Pitfalls

This article explores the four‑layer load‑balancing architecture, five common algorithms (including Round Robin, Weighted RR, Least Connections, Consistent Hashing, and AI‑driven adaptive load), high‑availability design, deep pitfalls, and a self‑built load balancer implementation, providing practical code examples and best‑practice guidelines.

Backend ArchitectureOperationsdistributed algorithms
0 likes · 10 min read
Mastering Load Balancing: Architecture, Algorithms, and Real-World Pitfalls
Cognitive Technology Team
Cognitive Technology Team
Jul 16, 2025 · Databases

Mastering Dynamic Multi‑DataSource Management in Spring Boot: A Deep Dive

Dynamic‑datasource 4.3.1 offers a powerful AOP‑based, annotation‑driven solution for seamless multi‑database routing, read‑write separation, and load‑balancing in Spring Boot applications, while addressing common pitfalls such as transaction boundaries, connection‑pool configuration, health‑check failures, and version compatibility.

Multi-DataSourcedynamic-datasourcehealth check
0 likes · 18 min read
Mastering Dynamic Multi‑DataSource Management in Spring Boot: A Deep Dive
Ops Community
Ops Community
Jul 15, 2025 · Operations

Why 90% of Ops Teams Choose the Wrong LVS Mode – A Deep Dive into Performance

This article examines the four Linux Virtual Server (LVS) clustering modes—NAT, Direct Routing, Tunneling, and FULLNAT—detailing their architectures, data flows, configuration steps, advantages, disadvantages, and ideal use cases, helping operations engineers select the most suitable load‑balancing solution for high‑performance, scalable web services.

LVSOperationsload balancing
0 likes · 18 min read
Why 90% of Ops Teams Choose the Wrong LVS Mode – A Deep Dive into Performance
Raymond Ops
Raymond Ops
Jul 11, 2025 · Operations

Mastering Keepalived: Complete Guide to High‑Availability Load Balancing

This tutorial explains Keepalived’s VRRP‑based failover, IPVS rule generation, health‑checking, script integration, installation methods, detailed configuration files, notification handling, logging, brain‑split prevention, and VRRP scripting for building robust high‑availability clusters on Linux.

IPVSVRRPhigh availability
0 likes · 26 min read
Mastering Keepalived: Complete Guide to High‑Availability Load Balancing
Raymond Ops
Raymond Ops
Jul 5, 2025 · Operations

Mastering Nginx proxy_pass: When to Use / and How It Affects URLs

This article explains the nuances of Nginx's proxy_pass directive, illustrating how trailing slashes and URI paths affect backend request URLs, compares configurations across http and stream modules, and provides practical load‑balancing examples with detailed parameter settings.

ConfigurationNginxload balancing
0 likes · 15 min read
Mastering Nginx proxy_pass: When to Use / and How It Affects URLs
Java One
Java One
Jul 4, 2025 · Cloud Native

Building Microservices with Nacos: Registration, Discovery, Load Balancing & Config

This tutorial walks through setting up Nacos as a cloud‑native service registry and configuration center, covering installation modes, service registration, discovery, load‑balancing with Spring Cloud, dynamic configuration loading, namespace isolation, and best‑practice patterns for managing microservice environments.

Configuration CenterNacosSpring Cloud
0 likes · 23 min read
Building Microservices with Nacos: Registration, Discovery, Load Balancing & Config
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jul 3, 2025 · Backend Development

Master Nginx: Core Features, Use Cases, and Complete Configuration Guide

This article provides a comprehensive overview of Nginx, covering its high‑performance architecture, key features, common deployment scenarios such as web serving, reverse proxy, load balancing and caching, detailed explanations of load‑balancing algorithms, and step‑by‑step configuration examples with essential directives.

Backend DevelopmentConfigurationNginx
0 likes · 8 min read
Master Nginx: Core Features, Use Cases, and Complete Configuration Guide
Code Wrench
Code Wrench
Jun 30, 2025 · Cloud Native

How We Cut Search Latency by 80% with an AWS Cloud‑Native Architecture

This article details a step‑by‑step AWS cloud‑native solution that combines 10 Gbps internal networking, Elasticsearch distributed indexing, Windows Server tuning, SQL Server read‑write splitting, and intelligent load‑balancing to reduce search latency by 80 % and boost database throughput by 300 % under million‑level concurrency.

AWSCloud NativeElasticsearch
0 likes · 6 min read
How We Cut Search Latency by 80% with an AWS Cloud‑Native Architecture
Open Source Linux
Open Source Linux
Jun 30, 2025 · Backend Development

How to Split Nginx Architecture into a Scalable Cluster: Database, Web Nodes, and Session Sharing

This guide explains how to break a monolithic LNMP stack into separate components—dedicated MySQL servers, multiple Nginx/PHP web nodes, shared static storage, and session persistence using Redis—while providing step‑by‑step commands, configuration snippets, and best‑practice recommendations for building a reliable, high‑performance Nginx cluster.

ClusterNginxdatabase sharding
0 likes · 18 min read
How to Split Nginx Architecture into a Scalable Cluster: Database, Web Nodes, and Session Sharing
Selected Java Interview Questions
Selected Java Interview Questions
Jun 27, 2025 · Backend Development

How to Fix Uneven Data Distribution in Multi‑Threaded Backend Systems

This article analyzes a production‑line data‑distribution issue where parallel processing and rule‑based assignment cause some employees to receive no tasks, explains why simply increasing batch size fails, and presents a Redis‑counter solution that balances efficiency with fair workload distribution.

data distributionload balancingredis
0 likes · 5 min read
How to Fix Uneven Data Distribution in Multi‑Threaded Backend Systems
Open Source Linux
Open Source Linux
Jun 27, 2025 · Operations

Mastering Nginx: Essential Configuration, Performance Tuning, and Real‑World Use Cases

This comprehensive guide walks you through Nginx’s key features, core configuration syntax, built‑in variables, global settings, common directives, HTTPS setup, CORS handling, gzip compression, access control, caching strategies, and practical examples of rewrite rules, load balancing, proxying, and server name matching, all illustrated with clear code snippets and diagrams.

NginxWeb servercaching
0 likes · 41 min read
Mastering Nginx: Essential Configuration, Performance Tuning, and Real‑World Use Cases
Architect's Guide
Architect's Guide
Jun 25, 2025 · Backend Development

Mastering Version‑Based Routing and Gray Deployments for Microservices

This article explains how to design and implement custom routing strategies—including default branch routing, version‑specific routing, and service‑specific routing—to achieve isolated version testing and gray‑release deployments in distributed microservice systems.

Gray DeploymentMicroservicesVersioning
0 likes · 9 min read
Mastering Version‑Based Routing and Gray Deployments for Microservices
Cognitive Technology Team
Cognitive Technology Team
Jun 21, 2025 · Fundamentals

Understanding Faults, Failures, and Fault Tolerance in Distributed Systems

This tutorial explains the definitions of faults and failures in distributed systems, explores their types and root causes, and presents fault‑tolerance mechanisms such as replication, checkpointing, redundancy, error detection, load balancing, and consensus algorithms to build resilient architectures.

Distributed Systemsconsensus algorithmsdata replication
0 likes · 10 min read
Understanding Faults, Failures, and Fault Tolerance in Distributed Systems
MaGe Linux Operations
MaGe Linux Operations
Jun 17, 2025 · Operations

Master Nginx: Essential Configurations and Real‑World Examples

This comprehensive guide walks you through Nginx's core features, configuration syntax, and practical modules such as server blocks, load balancing, SSL, gzip compression, CORS handling, access control, and caching, complete with clear code snippets and real‑world deployment examples.

NginxSSLload balancing
0 likes · 32 min read
Master Nginx: Essential Configurations and Real‑World Examples
Java Captain
Java Captain
Jun 17, 2025 · Backend Development

Mastering Client‑Side Load Balancing with Spring Cloud Ribbon

This article explains how Spring Cloud Ribbon provides client‑side load balancing, outlines its classification, core mechanisms, common algorithms, and shows step‑by‑step code to replace the default balancing rule in a Spring Boot microservice.

JavaMicroservicesRibbon
0 likes · 5 min read
Mastering Client‑Side Load Balancing with Spring Cloud Ribbon
Raymond Ops
Raymond Ops
Jun 13, 2025 · Operations

Master HAProxy: Step-by-Step Deployment and Configuration Guide

This article provides a comprehensive, hands‑on guide to installing HAProxy, configuring global, defaults, listen, frontend, and backend sections, setting up ACL‑based load balancing, preparing backend web servers, testing the setup, and accessing the HAProxy statistics page.

ACLBackendConfiguration
0 likes · 16 min read
Master HAProxy: Step-by-Step Deployment and Configuration Guide
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jun 10, 2025 · Operations

Mastering Load Balancing: From Single‑Layer to Billion‑Scale Architectures

This article explains the essential role of load balancing in modern distributed systems and walks through single‑layer, double‑layer, and billion‑scale architectures, highlighting their design principles, benefits, trade‑offs, and typical deployment scenarios for high‑availability and high‑performance applications.

LVSNginxScalability
0 likes · 6 min read
Mastering Load Balancing: From Single‑Layer to Billion‑Scale Architectures
Raymond Ops
Raymond Ops
Jun 3, 2025 · Operations

Mastering Nginx Load Balancing and Reverse Proxy Configuration

This article explains how to configure Nginx load balancing with upstream blocks, server weight, ip_hash, and logging variables, and details reverse‑proxy settings such as proxy_pass, proxy_method, header handling, redirects, and error‑fallback options, providing complete code examples for each feature.

ConfigurationNginxload balancing
0 likes · 12 min read
Mastering Nginx Load Balancing and Reverse Proxy Configuration
Liangxu Linux
Liangxu Linux
Jun 2, 2025 · Operations

10 Must‑Know Ops Tools to Transform Reactive Firefighting into Proactive Management

This guide presents ten essential operations tools—including Zabbix, Prometheus, MySQL, Redis, Ansible, Jenkins, Docker, Kubernetes, LVS, and Kafka—covering monitoring, databases, automation, containerization, and load balancing, to help engineers shift from reactive firefighting to proactive, efficient system management.

AutomationContainersMessaging
0 likes · 4 min read
10 Must‑Know Ops Tools to Transform Reactive Firefighting into Proactive Management
Linux Cloud Computing Practice
Linux Cloud Computing Practice
May 29, 2025 · Cloud Computing

2025 Linux Cloud Computing Interview Questions: 142 Essential Q&A

An extensive collection of 2025 Linux cloud computing interview questions—covering fundamentals, system architecture, security, shell scripting, networking, load balancing, and practical command‑line tasks—provides job seekers with a ready‑to‑study resource to master the knowledge most interviewers demand.

Securityinterview-questionsload balancing
0 likes · 8 min read
2025 Linux Cloud Computing Interview Questions: 142 Essential Q&A
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
May 27, 2025 · Operations

Understanding L4 and L7 Load Balancing Architectures

This article explains the fundamentals of Layer‑4 and Layer‑7 load balancing, compares their advantages and disadvantages, and describes how a hybrid approach can combine high‑performance traffic handling with flexible application‑level routing for large‑scale systems.

L4L7Operations
0 likes · 4 min read
Understanding L4 and L7 Load Balancing Architectures
php Courses
php Courses
May 26, 2025 · Backend Development

Implementing Load‑Balancer‑Like Auto‑Decision Logic in PHP Applications

This article explores how to embed load‑balancer concepts such as intelligent request distribution, health checks, automatic failover, and dynamic strategy adjustment directly into PHP applications using algorithms like weighted round‑robin, response‑time balancing, and circuit‑breaker patterns, providing code examples and practical deployment scenarios.

PHPfailoverhealth check
0 likes · 11 min read
Implementing Load‑Balancer‑Like Auto‑Decision Logic in PHP Applications
FunTester
FunTester
May 26, 2025 · Backend Development

Thread‑Safe Round Robin Load Balancer Using Java AtomicInteger

This article explains how to implement a thread‑safe round‑robin load balancer in Java using AtomicInteger, discusses its advantages and limitations, provides sample code for single‑thread and multithreaded scenarios, and suggests further optimizations such as health checks, weighted routing, and dynamic server updates.

AtomicIntegerBackendJava
0 likes · 11 min read
Thread‑Safe Round Robin Load Balancer Using Java AtomicInteger
ITPUB
ITPUB
May 18, 2025 · Backend Development

Why Tomcat Alone Isn’t Enough: Leveraging Nginx for Reliable Web Services

The article explains the practical limitations of using only Tomcat—such as DNS IP caps, scaling costs, deployment downtime, and lack of health checks—and demonstrates how adding Nginx as a reverse proxy and load balancer resolves these issues with concrete configuration examples.

DNSDeploymentNginx
0 likes · 10 min read
Why Tomcat Alone Isn’t Enough: Leveraging Nginx for Reliable Web Services
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
May 9, 2025 · Operations

Master Nginx Reverse Proxy & Load Balancing: Complete Configuration and Performance Tuning Guide

This article walks through Nginx's role as a high‑performance reverse proxy and load balancer, explains core concepts, provides step‑by‑step configuration examples—including upstream pools, server blocks, and location directives—and details performance optimizations such as gzip, connection reuse, worker settings, static‑file caching, and proxy caching.

NginxPerformance OptimizationServer Configuration
0 likes · 15 min read
Master Nginx Reverse Proxy & Load Balancing: Complete Configuration and Performance Tuning Guide
AI Algorithm Path
AI Algorithm Path
May 9, 2025 · Artificial Intelligence

A Visual Guide to Mixture of Experts (MoE) Architecture in Large Language Models

This article explains the Mixture of Experts (MoE) technique used in modern LLMs, detailing its core components—experts and router—comparing dense and sparse layers, describing load‑balancing, expert capacity, and routing strategies, and showcasing real‑world examples such as Switch Transformer, Vision‑MoE, and Mixtral 8x7B.

Expert CapacityLLMMixture of Experts
0 likes · 15 min read
A Visual Guide to Mixture of Experts (MoE) Architecture in Large Language Models
Bilibili Tech
Bilibili Tech
May 9, 2025 · Artificial Intelligence

How an AI Gateway Scales LLM Services: Architecture, Auth, Quotas, and Load Balancing

This article explains the design of an AI gateway that centralizes LLM access, detailing its background, overall architecture, authentication, quota management, multi‑model routing, load‑balancing strategies, multi‑tenant isolation, observability features, and the supported API protocols for enterprise integration.

AI gatewayAuthenticationLLM
0 likes · 17 min read
How an AI Gateway Scales LLM Services: Architecture, Auth, Quotas, and Load Balancing
Architect
Architect
May 3, 2025 · Backend Development

Why Rebuild a Job Scheduler? Inside a Lightweight Distributed Timing Framework

This article explains the motivation, design choices, and implementation details of a custom distributed job scheduling framework, covering its architecture, load‑balancing strategy, message‑queue handling, persistence mechanisms, and key code snippets, while comparing it to existing solutions like Quartz, XXL‑Job, and PowerJob.

Distributed SystemsJavaMessage Queue
0 likes · 16 min read
Why Rebuild a Job Scheduler? Inside a Lightweight Distributed Timing Framework
MaGe Linux Operations
MaGe Linux Operations
Apr 30, 2025 · Backend Development

Top 40 Nginx Interview Questions and Answers Explained

This comprehensive guide covers the most common Nginx interview questions, including its definition, advantages, architecture, request handling, high‑concurrency mechanisms, proxy types, configuration files, load‑balancing algorithms, rate limiting, health checks, compression, and many practical code examples for real‑world deployment.

load balancing
0 likes · 33 min read
Top 40 Nginx Interview Questions and Answers Explained
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Apr 30, 2025 · Industry Insights

Network Load Balancing: Emerging Techniques and Innovative Insights

This article surveys current network load‑balancing approaches—including CONGA, Hula, DRILL, Hermes, MP‑RDMA, ConWeave, Proteus, and CAVER—detailing their granularity, information exchange, signaling methods, and the performance gains they achieve in modern data‑center environments.

RDMAdatacenter networkingin-network reordering
0 likes · 13 min read
Network Load Balancing: Emerging Techniques and Innovative Insights
Linux Cloud Computing Practice
Linux Cloud Computing Practice
Apr 27, 2025 · Backend Development

Why Is Nginx the World’s Most Popular Web Server? A Deep Dive

This article explains why Nginx has overtaken Apache to become the leading web server, covering its performance‑focused design, high stability, master‑worker architecture, core configuration, location directives, access control, reverse‑proxy features, and real‑world deployment examples from major Chinese internet companies.

Backend DevelopmentConfigurationNginx
0 likes · 5 min read
Why Is Nginx the World’s Most Popular Web Server? A Deep Dive
macrozheng
macrozheng
Apr 22, 2025 · Operations

How Nginx Turns a Simple HTML File into a High‑Performance Gateway

This article explains how Nginx works as an HTTP server and reverse proxy, detailing its modular gateway capabilities, single‑thread design, multi‑worker architecture, shared memory, proxy caching, master‑worker coordination, and how to address its single‑point‑of‑failure limitations.

Web servergatewayload balancing
0 likes · 12 min read
How Nginx Turns a Simple HTML File into a High‑Performance Gateway
MaGe Linux Operations
MaGe Linux Operations
Apr 12, 2025 · Backend Development

Master Nginx Basics: From Installation to Advanced Configuration

This comprehensive guide walks you through Nginx fundamentals, including its advantages, installation methods, core configuration hierarchy, key modules such as stub_status, random_index, sub_filter and gzip, log management, virtual host setup, HTTPS, rewrite rules, and smooth upgrade techniques, empowering you to deploy and optimize a high‑performance web server.

ConfigurationHTTPSload balancing
0 likes · 32 min read
Master Nginx Basics: From Installation to Advanced Configuration
IT Services Circle
IT Services Circle
Apr 11, 2025 · Operations

Understanding Nginx: HTTP Server, Reverse Proxy, and Modular Gateway Architecture

This article explains how Nginx transforms a simple local HTML file into a high‑performance HTTP server and reverse‑proxy gateway, detailing its modular capabilities, configuration via nginx.conf, single‑thread design, multi‑worker processes, shared memory, proxy caching, master‑worker coordination, and the challenges of single‑point failure.

HTTP serverNginxgateway
0 likes · 10 min read
Understanding Nginx: HTTP Server, Reverse Proxy, and Modular Gateway Architecture
Java Tech Enthusiast
Java Tech Enthusiast
Apr 10, 2025 · Backend Development

Understanding Nginx: HTTP Server, Reverse Proxy, and Scalable Architecture

The article shows how adding HTML tags turns a plain‑text file into a web page, explains the need for an HTTP service to serve it, introduces reverse proxies for load‑balancing and address hiding, and outlines Nginx’s event‑driven architecture with master and worker processes, shared memory, proxy cache, multi‑protocol support, configurable modules, and scaling options, while warning of a single‑instance failure and recommending cluster mode.

HTTP serverMaster ProcessNginx
0 likes · 9 min read
Understanding Nginx: HTTP Server, Reverse Proxy, and Scalable Architecture
Linux Cloud Computing Practice
Linux Cloud Computing Practice
Apr 10, 2025 · Backend Development

Why Is Nginx the Most Popular Web Server?

This article explains why Nginx has overtaken Apache to become the leading web server, highlighting its performance‑optimized design, stability, widespread adoption by major Chinese tech firms, core functions such as load balancing and caching, and detailed insights into its master‑worker architecture and configuration.

Backend DevelopmentConfigurationNginx
0 likes · 4 min read
Why Is Nginx the Most Popular Web Server?
Raymond Ops
Raymond Ops
Apr 5, 2025 · Operations

Master Nginx Load Balancing: Step‑by‑Step Configuration Guide

This article explains how to configure Nginx as a load balancer for web applications, covering upstream and proxy_pass definitions, the three built‑in balancing methods, weight and connection settings, fail‑over options, and practical code examples for both HTTP and HTTPS deployments.

ConfigurationNginxOperations
0 likes · 11 min read
Master Nginx Load Balancing: Step‑by‑Step Configuration Guide
IT Architects Alliance
IT Architects Alliance
Apr 2, 2025 · Backend Development

Designing High‑Concurrency Backend Architecture for E‑commerce Platforms

The article explains how to design a scalable, highly available backend system capable of handling millions of requests per second by defining key performance metrics, estimating traffic with the 2/8 rule, and applying architectural patterns such as load‑balanced clusters, vertical service splitting, distributed caching, and database master‑slave replication, illustrated with a Taobao case study.

Backend ArchitectureDatabase Replicationdistributed cache
0 likes · 14 min read
Designing High‑Concurrency Backend Architecture for E‑commerce Platforms
JD Tech Talk
JD Tech Talk
Mar 24, 2025 · Artificial Intelligence

MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Ad Serving

This article presents MaRCA, a multi‑agent reinforcement learning framework that allocates computation resources across the full ad‑serving chain by modeling user value, compute consumption, and action rewards, enabling fine‑grained power‑tilting toward high‑quality traffic and achieving significant business gains under strict latency constraints.

AI Optimizationad servingcomputation allocation
0 likes · 16 min read
MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Ad Serving
JD Cloud Developers
JD Cloud Developers
Mar 24, 2025 · Artificial Intelligence

How Multi-Agent Reinforcement Learning Boosts Ad Computation Allocation

This article presents MaRCA, a multi‑agent reinforcement‑learning framework that allocates computation resources across the full ad‑serving chain, modeling user value, compute cost, and action rewards to maximize ad revenue while keeping system load stable under fluctuating traffic.

AIMulti-Agentad optimization
0 likes · 16 min read
How Multi-Agent Reinforcement Learning Boosts Ad Computation Allocation
Architect's Guide
Architect's Guide
Mar 20, 2025 · Backend Development

Implementing a Lightweight Service Registry with Eureka for Task Distribution

This article describes how to build a lightweight service registry using Eureka Server that also acts as a client, enabling a task dispatcher to discover and load‑balance task executor instances without deploying a separate registry, including Maven dependencies, Spring annotations, Feign client definitions, and troubleshooting steps.

Backend DevelopmentMicroservicesSpring Cloud
0 likes · 11 min read
Implementing a Lightweight Service Registry with Eureka for Task Distribution
Linux Kernel Journey
Linux Kernel Journey
Mar 17, 2025 · Fundamentals

How Linux Multicore Scheduling Unlocks CPU Potential

The article explains how Linux’s multicore scheduler distributes tasks across CPU cores, describes the core CFS and real‑time algorithms, details load‑balancing mechanisms such as pull/push and active/passive strategies, and discusses power, thermal, and algorithmic optimizations for servers and embedded devices.

CFSDVFSLinux
0 likes · 24 min read
How Linux Multicore Scheduling Unlocks CPU Potential
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 14, 2025 · Artificial Intelligence

Solving Rate Limiting, Load Balancing, and Data Challenges in AI Inference with Tair

This article explains how AI inference services can tackle five core problems—rate limiting, load balancing, asynchronous processing, user data management, and index enhancement—by leveraging Tair's rich data structures, offering practical code examples, architectural diagrams, and a comparison with alternative solutions.

AI inferenceRAGTair
0 likes · 20 min read
Solving Rate Limiting, Load Balancing, and Data Challenges in AI Inference with Tair
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Mar 6, 2025 · Backend Development

ROFF: A High‑Performance Seven‑Layer Rust‑Based Gateway with TLS Offload, QUIC/HTTP3, and Dynamic Module System

ROFF is a Rust‑implemented, seven‑layer gateway that delivers high‑throughput load balancing with memory‑safe performance, TLS hardware offload, native QUIC/HTTP3 support, a hot‑reload/upgrade mechanism, and an extensible module system allowing over thirty built‑in filters and custom Rust macros.

HTTP/3QUICRust
0 likes · 28 min read
ROFF: A High‑Performance Seven‑Layer Rust‑Based Gateway with TLS Offload, QUIC/HTTP3, and Dynamic Module System
Ctrip Technology
Ctrip Technology
Mar 6, 2025 · Backend Development

Design and Implementation of Ctrip's Soft Load Balancer: Evolving from Nginx Reverse Proxy to an OpenResty‑Based API Gateway

This article details how Ctrip's Soft Load Balancer (SLB) transitioned from a simple Nginx reverse‑proxy to a multi‑datacenter, dynamic routing gateway using OpenResty, Lua scripts, and a three‑layer architecture that eliminates reloads and supports high‑frequency configuration updates.

LuaOpenRestyapi-gateway
0 likes · 14 min read
Design and Implementation of Ctrip's Soft Load Balancer: Evolving from Nginx Reverse Proxy to an OpenResty‑Based API Gateway
Pan Zhi's Tech Notes
Pan Zhi's Tech Notes
Mar 5, 2025 · Backend Development

Spring Cloud Ribbon: Hands‑On Guide to Client‑Side Load Balancing

This article walks through the background of Spring Cloud Ribbon, explains its client‑side load‑balancing role, provides step‑by‑step code to create multiple provider services and a consumer that uses Ribbon, and shows how to configure different balancing algorithms and retry mechanisms.

MicroservicesRibbonSpring Boot
0 likes · 12 min read
Spring Cloud Ribbon: Hands‑On Guide to Client‑Side Load Balancing
Architect
Architect
Mar 2, 2025 · Artificial Intelligence

Demystifying Mixture of Experts: How MoE Boosts LLMs and Vision Models

This article explains the Mixture of Experts (MoE) architecture, detailing experts, routers, dense vs. sparse layers, load‑balancing strategies such as KeepTopK, auxiliary loss, capacity constraints, the Switch Transformer simplification, and how MoE is applied to both language and vision models, illustrated with concrete examples and parameter counts.

Mixture of ExpertsMoESparse Models
0 likes · 17 min read
Demystifying Mixture of Experts: How MoE Boosts LLMs and Vision Models
AI Product Manager Community
AI Product Manager Community
Feb 28, 2025 · Artificial Intelligence

What’s Inside DeepSeek’s Open‑Source Week? DualPipe, EPLB, 3FS and More Explained

DeepSeek’s recent Open‑Source Week unveiled a suite of AI‑focused tools—including the DualPipe pipeline parallelism algorithm, the EPLB expert load balancer, detailed training‑inference framework data, the high‑performance 3FS parallel file system, and the Smallpond data‑processing framework—each with GitHub links and performance highlights.

AIDistributed Trainingfile system
0 likes · 7 min read
What’s Inside DeepSeek’s Open‑Source Week? DualPipe, EPLB, 3FS and More Explained
Architect's Must-Have
Architect's Must-Have
Feb 26, 2025 · Backend Development

How to Implement Version-Based Routing and Gray Deployment in Microservices

This article explains how to design custom routing strategies for versioned microservices, including default master branch routing, unified version routing, and service‑specific routing, and shows how to apply these rules to achieve gray deployments with load‑balancing and seamless version isolation.

BackendGray DeploymentMicroservices
0 likes · 10 min read
How to Implement Version-Based Routing and Gray Deployment in Microservices
MaGe Linux Operations
MaGe Linux Operations
Feb 25, 2025 · Backend Development

Mastering Nginx: From Basics to Advanced Load Balancing and High‑Concurrency Techniques

This comprehensive guide explains what Nginx is, its advantages, typical use cases, request processing, high‑concurrency architecture, forward and reverse proxy concepts, directory layout, configuration directives, cookie vs session, load‑balancing algorithms, rate limiting, static‑dynamic separation, CDN integration, health checks, compression, and worker process tuning, providing practical code examples for each topic.

Configurationload balancingrate limiting
0 likes · 31 min read
Mastering Nginx: From Basics to Advanced Load Balancing and High‑Concurrency Techniques
Radish, Keep Going!
Radish, Keep Going!
Feb 16, 2025 · Fundamentals

Master Consistent Hashing: Principles, Virtual Nodes, and Go Implementation

Consistent hashing, a cornerstone of distributed systems, balances load, enhances scalability, and minimizes data migration; this article explains its fundamentals, the drawbacks of basic implementations, the role of virtual nodes, and provides a complete Go-language example with code for adding, removing, and locating nodes.

Distributed Systemsconsistent hashingload balancing
0 likes · 11 min read
Master Consistent Hashing: Principles, Virtual Nodes, and Go Implementation
Architect
Architect
Feb 10, 2025 · Artificial Intelligence

Evolution of DeepSeek Mixture‑of‑Experts (MoE) Architecture from V1 to V3

This article reviews the development of DeepSeek's Mixture-of-Experts (MoE) models, tracing their evolution from the original DeepSeekMoE V1 through V2 to V3, detailing architectural innovations such as fine‑grained expert segmentation, shared‑expert isolation, load‑balancing losses, device‑limited routing, and the shift from softmax to sigmoid gating.

DeepSeekLLMMixture of Experts
0 likes · 21 min read
Evolution of DeepSeek Mixture‑of‑Experts (MoE) Architecture from V1 to V3
Raymond Ops
Raymond Ops
Feb 5, 2025 · Operations

Mastering IPVS: Build High‑Performance Load Balancers with LVS

This article explains the concept of IPVS (IP Virtual Server) as a layer‑4 load balancer, compares ipvs with iptables, details LVS scheduling algorithms, and provides step‑by‑step commands for configuring ipvsadm on both load‑balancer and real‑server nodes, including VIP setup and client testing.

IPVSLVSLinux networking
0 likes · 9 min read
Mastering IPVS: Build High‑Performance Load Balancers with LVS
dbaplus Community
dbaplus Community
Feb 3, 2025 · Databases

How to Diagnose and Fix Extreme ClickHouse Load Spikes in Production

A production ClickHouse cluster suddenly showed blacked‑out dashboards due to CPU load soaring above 2,700%, and this guide walks through step‑by‑step diagnostics using system tables, a simple query to spot heavy SQL, and practical remediation actions to restore normal load levels.

ClickHouseDatabase PerformanceSQL Optimization
0 likes · 7 min read
How to Diagnose and Fix Extreme ClickHouse Load Spikes in Production
Linux Cloud Computing Practice
Linux Cloud Computing Practice
Jan 23, 2025 · Backend Development

Why Nginx Is the Go‑to Choice for High‑Performance Web and Proxy Services

This article explains Nginx’s key advantages—high‑concurrency handling, efficient load balancing, versatile mail proxying, simple installation and zero‑downtime upgrades—illustrates its architecture with master/worker processes, and outlines essential configuration topics from basic setup to advanced HTTPS and reverse‑proxy techniques.

ConfigurationHTTPSNginx
0 likes · 7 min read
Why Nginx Is the Go‑to Choice for High‑Performance Web and Proxy Services
dbaplus Community
dbaplus Community
Jan 12, 2025 · Backend Development

How Redis Stream Replaced MQ in High‑Throughput Traffic Processing

This article explains why the traffic team switched from traditional MQ to Redis Stream, covering the underlying concepts, design choices, implementation details, load‑balancing strategies, cross‑datacenter handling, monitoring metrics, performance benchmarks, and practical lessons learned.

Backend DevelopmentCRedis Stream
0 likes · 13 min read
How Redis Stream Replaced MQ in High‑Throughput Traffic Processing
Raymond Ops
Raymond Ops
Jan 11, 2025 · Operations

How to Build a Highly Available Load Balancer with LVS and Keepalived

This tutorial explains how to design and deploy a high‑availability web cluster using Linux Virtual Server (LVS) and Keepalived, covering terminology, test environment setup, detailed configuration steps, HA testing procedures, and a concise summary of the solution.

LVSLinuxhigh availability
0 likes · 11 min read
How to Build a Highly Available Load Balancer with LVS and Keepalived
Java Tech Enthusiast
Java Tech Enthusiast
Jan 10, 2025 · Backend Development

RocketMQ Consumer Scaling and Load Balancing Strategies

In RocketMQ, adding consumers speeds consumption only when they are fewer than MessageQueues, while pull delays arise from ProcessQueue thresholds or ordered‑lock timeouts; slow processing often stems from heavy business logic or external calls, and load can be balanced using average, round‑robin, custom, machine‑room, nearby‑room, or consistent‑hash allocation strategies.

ConsumerMessage QueueRocketMQ
0 likes · 9 min read
RocketMQ Consumer Scaling and Load Balancing Strategies
IT Architects Alliance
IT Architects Alliance
Jan 9, 2025 · Operations

Load Balancing Strategies for High Availability in Distributed Systems

This article explores the challenges and opportunities of distributed architectures and explains how various static and dynamic load‑balancing strategies, hardware and software balancers, redundancy, health checks, and failover mechanisms together ensure high availability, illustrated with real‑world e‑commerce and live‑streaming case studies and future trends.

OperationsSystem Architecturehigh availability
0 likes · 20 min read
Load Balancing Strategies for High Availability in Distributed Systems
IT Architects Alliance
IT Architects Alliance
Jan 8, 2025 · Backend Development

Mastering High Concurrency in Distributed Systems: Strategies & Real-World Cases

This article explores the challenges of handling massive simultaneous requests in distributed architectures and presents practical solutions such as load balancing, distributed caching, asynchronous processing, and sharding, illustrated with case studies from major e‑commerce and social platforms.

Backend ArchitectureDistributed Systemsasynchronous processing
0 likes · 20 min read
Mastering High Concurrency in Distributed Systems: Strategies & Real-World Cases
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jan 8, 2025 · Backend Development

Eight High‑Performance Architecture Solutions for Large‑Scale Systems

This article outlines eight essential high‑performance architecture techniques—including load balancing, asynchronous processing, database optimization, caching, distributed clusters, CDN, microservices, and rate‑limiting/circuit‑breaking—to improve scalability, availability, and responsiveness of large‑scale backend systems.

Database Optimizationasynchronous processingcaching
0 likes · 9 min read
Eight High‑Performance Architecture Solutions for Large‑Scale Systems
IT Architects Alliance
IT Architects Alliance
Jan 1, 2025 · Backend Development

Comprehensive Guide to IT Architecture Optimization for System Performance

This article explores practical techniques for boosting system performance through IT architecture optimization, covering caching strategies, database query and connection‑pool tuning, load‑balancing, asynchronous messaging, code‑level refinements, memory pooling, network tricks, and real‑world case studies.

AsynchronousCode Optimizationcaching
0 likes · 18 min read
Comprehensive Guide to IT Architecture Optimization for System Performance
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Dec 30, 2024 · Backend Development

How to Define and Tackle High Concurrency: Strategies and Code Samples

This article explains what constitutes high concurrency, categorizes load levels, and presents practical solutions such as load balancing, database sharding, query optimization, caching, message queues, and rate‑limiting, complete with code examples for implementing these techniques in backend systems.

database shardinghigh concurrencyload balancing
0 likes · 7 min read
How to Define and Tackle High Concurrency: Strategies and Code Samples
IT Architects Alliance
IT Architects Alliance
Dec 29, 2024 · Operations

Design Principles and Key Technologies for High‑Availability Systems

The article explains why 24/7 high‑availability systems are essential for modern enterprises and details core design principles, layered architecture, and critical technologies such as redundancy, load balancing, caching, elastic scaling, monitoring, and fault‑tolerance to ensure continuous, reliable service.

System Designcloud computinghigh availability
0 likes · 23 min read
Design Principles and Key Technologies for High‑Availability Systems
Linux Cloud Computing Practice
Linux Cloud Computing Practice
Dec 25, 2024 · Backend Development

Why Nginx Dominates as the World’s Most Popular Web Server

This article explains how Nginx’s performance‑oriented design, high stability, mature codebase, and versatile features such as load balancing, reverse proxy, and caching have made it the leading web server worldwide, with widespread adoption by major Chinese internet companies and detailed configuration guidance.

Backend DevelopmentNginxServer Configuration
0 likes · 5 min read
Why Nginx Dominates as the World’s Most Popular Web Server
Selected Java Interview Questions
Selected Java Interview Questions
Dec 24, 2024 · Backend Development

Design and Implementation of a Custom Distributed Job Scheduling Framework (k‑job)

This article introduces the motivation, architecture, technology choices, and key implementation details of a lightweight, highly extensible distributed job scheduling framework built on gRPC, Protobuf, a custom name‑server, and a bespoke message‑queue, addressing limitations of existing solutions like Quartz, XXL‑Job, and PowerJob.

Distributed SystemsJavaJob Scheduling
0 likes · 14 min read
Design and Implementation of a Custom Distributed Job Scheduling Framework (k‑job)
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Dec 20, 2024 · Backend Development

Go‑Zero Load Balancing Explained: Stateless vs Stateful Algorithms and gRPC Integration

This article explores the fundamentals of load balancing in microservice architectures, detailing both stateless and stateful strategies such as round‑robin, random, hash, and weighted algorithms, and demonstrates how go‑zero and gRPC implement these mechanisms through custom balancers, service discovery, and request scheduling.

gRPCgo-zeroload balancing
0 likes · 11 min read
Go‑Zero Load Balancing Explained: Stateless vs Stateful Algorithms and gRPC Integration
MaGe Linux Operations
MaGe Linux Operations
Dec 17, 2024 · Operations

Mastering Keepalived: Complete Guide to High‑Availability Load Balancing with VRRP

This article provides a comprehensive walkthrough of Keepalived, covering its VRRP‑based address failover, architecture components, installation methods, detailed configuration files, notification scripts, logging, unicast/multicast settings, brain‑split prevention, and practical LVS‑Keepalived high‑availability deployment steps.

LVSLinuxNetworking
0 likes · 25 min read
Mastering Keepalived: Complete Guide to High‑Availability Load Balancing with VRRP
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Dec 4, 2024 · Backend Development

How Nginx Achieves Million-Connection Concurrency: Architecture and Optimization Techniques

This article explains how Nginx uses an event‑driven, asynchronous non‑blocking I/O model together with epoll/kqueue and various optimization strategies such as keep‑alive, caching, efficient data structures and load balancing to handle millions of concurrent connections in large‑scale internet architectures.

Event-drivenasynchronous I/Obackend optimization
0 likes · 5 min read
How Nginx Achieves Million-Connection Concurrency: Architecture and Optimization Techniques
macrozheng
macrozheng
Dec 3, 2024 · Backend Development

Simplify Feign Calls in Local Development with Custom URL Routing

This article explains how to streamline OpenFeign usage during local development by configuring custom URL routing, detailing the underlying Feign mechanism, creating a custom ImportBeanDefinitionRegistrar, and providing step‑by‑step code examples, testing procedures, and best‑practice recommendations.

Custom Bean RegistrationLocal DevelopmentMicroservices
0 likes · 15 min read
Simplify Feign Calls in Local Development with Custom URL Routing
Code Mala Tang
Code Mala Tang
Nov 20, 2024 · Backend Development

Can Node.js Power Millions of Users? Scaling Strategies Revealed

This article explores whether Node.js can handle millions of concurrent users, explains the core non‑blocking architecture, outlines challenges such as the single‑thread model and memory leaks, and provides practical scaling tactics like clustering, load balancing, caching, and database optimization.

BackendNode.jsScalability
0 likes · 10 min read
Can Node.js Power Millions of Users? Scaling Strategies Revealed