Topic

load balancing

Collection size
708 articles
Page 1 of 36
HelloTech
HelloTech
Nov 15, 2022 · Backend Development

Investigation of Elasticsearch RestClient Load‑Balancing and Traffic Skew Issues

The investigation revealed that Elasticsearch RestClient’s built‑in round‑robin and dead‑node blacklisting redirected traffic from failed data‑node addresses—mistakenly included in the static IP list—onto a single client node, causing severe load imbalance and timeouts, which were eliminated after correcting the IP list.

Backend DevelopmentElasticsearchJava Client
0 likes · 17 min read
Investigation of Elasticsearch RestClient Load‑Balancing and Traffic Skew Issues
JD Tech Talk
JD Tech Talk
Mar 24, 2025 · Artificial Intelligence

MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Ad Serving

This article presents MaRCA, a multi‑agent reinforcement learning framework that allocates computation resources across the full ad‑serving chain by modeling user value, compute consumption, and action rewards, enabling fine‑grained power‑tilting toward high‑quality traffic and achieving significant business gains under strict latency constraints.

AI optimizationMulti-Agentad serving
0 likes · 16 min read
MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Ad Serving
360 Smart Cloud
360 Smart Cloud
Jul 14, 2022 · Cloud Computing

Auto Scaling (AS) in Cloud Services: Architecture, Use Cases, and Optimization Strategies

This article explains the concept of elastic auto scaling in cloud services, describes typical scenarios such as high‑elastic web apps and compute‑intensive workloads, details the four‑layer architecture and workflow, and outlines functional features, stability improvements, and future optimization directions.

auto-scalingcloud-computingelasticity
0 likes · 12 min read
Auto Scaling (AS) in Cloud Services: Architecture, Use Cases, and Optimization Strategies
Baidu Intelligent Testing
Baidu Intelligent Testing
Mar 22, 2017 · Operations

Load Balancing: Concepts, Mechanisms, and Enterprise Practices

This article explains the principles of load balancing, distinguishes stateless service and stateful data balancing, describes DNS, hardware and software solutions such as F5, HAProxy, LVS, and GSLB, and illustrates real‑world implementations at Alibaba and Tencent while offering practical guidance on sharding, caching, and fault tolerance.

DNSGSLBLVS
0 likes · 18 min read
Load Balancing: Concepts, Mechanisms, and Enterprise Practices
IT Architects Alliance
IT Architects Alliance
Jan 9, 2025 · Operations

Load Balancing Strategies for High Availability in Distributed Systems

This article explores the challenges and opportunities of distributed architectures and explains how various static and dynamic load‑balancing strategies, hardware and software balancers, redundancy, health checks, and failover mechanisms together ensure high availability, illustrated with real‑world e‑commerce and live‑streaming case studies and future trends.

distributed systemshigh availabilityload balancing
0 likes · 20 min read
Load Balancing Strategies for High Availability in Distributed Systems
Architect
Architect
Feb 10, 2025 · Artificial Intelligence

Evolution of DeepSeek Mixture‑of‑Experts (MoE) Architecture from V1 to V3

This article reviews the development of DeepSeek's Mixture-of-Experts (MoE) models, tracing their evolution from the original DeepSeekMoE V1 through V2 to V3, detailing architectural innovations such as fine‑grained expert segmentation, shared‑expert isolation, load‑balancing losses, device‑limited routing, and the shift from softmax to sigmoid gating.

DeepSeekLLMMixture of Experts
0 likes · 21 min read
Evolution of DeepSeek Mixture‑of‑Experts (MoE) Architecture from V1 to V3
Top Architect
Top Architect
Aug 13, 2024 · Backend Development

Spring Cloud Alibaba and Nacos Service Governance: Architecture, Installation, and Practical Usage

This article explains the evolution from monolithic to microservice architectures, introduces Spring Cloud Alibaba and Nacos for service discovery and governance, provides step‑by‑step installation and configuration instructions, and demonstrates load balancing with Ribbon, including essential code snippets for quick adoption.

NacosSpring Cloudbackend
0 likes · 11 min read
Spring Cloud Alibaba and Nacos Service Governance: Architecture, Installation, and Practical Usage
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
May 8, 2025 · Operations

Understanding Nginx Forward and Reverse Proxy: Differences and Use Cases

This article explains the concepts of Nginx forward and reverse proxy, compares their roles, client and server perceptions, typical usage scenarios, and highlights why reverse proxy is a core Nginx feature while forward proxy is rarely used without third‑party modules.

Web Serverforward proxyload balancing
0 likes · 5 min read
Understanding Nginx Forward and Reverse Proxy: Differences and Use Cases
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Dec 4, 2024 · Backend Development

How Nginx Achieves Million-Connection Concurrency: Architecture and Optimization Techniques

This article explains how Nginx uses an event‑driven, asynchronous non‑blocking I/O model together with epoll/kqueue and various optimization strategies such as keep‑alive, caching, efficient data structures and load balancing to handle millions of concurrent connections in large‑scale internet architectures.

asynchronous I/Obackend optimizationepoll
0 likes · 5 min read
How Nginx Achieves Million-Connection Concurrency: Architecture and Optimization Techniques
Architect's Guide
Architect's Guide
Mar 20, 2025 · Backend Development

Implementing a Lightweight Service Registry with Eureka for Task Distribution

This article describes how to build a lightweight service registry using Eureka Server that also acts as a client, enabling a task dispatcher to discover and load‑balance task executor instances without deploying a separate registry, including Maven dependencies, Spring annotations, Feign client definitions, and troubleshooting steps.

Backend DevelopmentEurekaSpring Cloud
0 likes · 11 min read
Implementing a Lightweight Service Registry with Eureka for Task Distribution
DeWu Technology
DeWu Technology
Nov 13, 2024 · Backend Development

Evolution of Rainbow Bridge Architecture: Building a Self‑Managed Metadata Center and SDK Enhancements

The new Rainbow Bridge architecture replaces the SLB‑based load‑balancing model with a self‑managed, multi‑AZ metadata center and enhanced SDK that aggregates node health, provides zone‑aware weighted routing, supports rapid failover and manual overrides, and delivers faster recovery and scalable traffic handling.

distributed systemshigh availabilityload balancing
0 likes · 11 min read
Evolution of Rainbow Bridge Architecture: Building a Self‑Managed Metadata Center and SDK Enhancements
Java Tech Enthusiast
Java Tech Enthusiast
Apr 10, 2025 · Backend Development

Understanding Nginx: HTTP Server, Reverse Proxy, and Scalable Architecture

The article shows how adding HTML tags turns a plain‑text file into a web page, explains the need for an HTTP service to serve it, introduces reverse proxies for load‑balancing and address hiding, and outlines Nginx’s event‑driven architecture with master and worker processes, shared memory, proxy cache, multi‑protocol support, configurable modules, and scaling options, while warning of a single‑instance failure and recommending cluster mode.

Master Processhttp serverload balancing
0 likes · 9 min read
Understanding Nginx: HTTP Server, Reverse Proxy, and Scalable Architecture
Java Tech Enthusiast
Java Tech Enthusiast
Jan 10, 2025 · Backend Development

RocketMQ Consumer Scaling and Load Balancing Strategies

In RocketMQ, adding consumers speeds consumption only when they are fewer than MessageQueues, while pull delays arise from ProcessQueue thresholds or ordered‑lock timeouts; slow processing often stems from heavy business logic or external calls, and load can be balanced using average, round‑robin, custom, machine‑room, nearby‑room, or consistent‑hash allocation strategies.

ConsumerMessage QueueRocketMQ
0 likes · 9 min read
RocketMQ Consumer Scaling and Load Balancing Strategies
Java Tech Enthusiast
Java Tech Enthusiast
May 18, 2022 · Operations

Nginx Reverse Proxy, Load Balancing, and URL Rewrite Guide

This article explains how Nginx can act as a reverse proxy, distribute traffic with various load‑balancing methods, serve static assets separately from dynamic back‑ends, and rewrite friendly URLs, providing clear configuration snippets for each technique.

load balancingnginxreverse-proxy
0 likes · 10 min read
Nginx Reverse Proxy, Load Balancing, and URL Rewrite Guide
DaTaobao Tech
DaTaobao Tech
Aug 9, 2024 · Cloud Native

Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow

RocketMQ stores messages in a sequential CommitLog while using a ConsumerQueue index for fast random access, and employs two looping threads—RebalanceService for load‑balancing queue assignments and PullMessageService for continuously pulling messages and dispatching them to user callbacks via a thread‑pooled consumer pipeline.

ConsumerJavaMessage Queue
0 likes · 12 min read
Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow
Tencent Music Tech Team
Tencent Music Tech Team
Oct 31, 2023 · Cloud Native

Advanced Istio Best Practices – Locality Routing and Service Mesh Optimization

The article by delphisfang offers a concise, step‑by‑step guide to mastering Istio’s locality‑aware routing, explaining the three‑evidence learning method, the priority algorithm, required DestinationRule and outlier detection settings, how Envoy discovers locality, and tips for simplifying the Pilot‑Envoy mesh architecture.

EnvoyKubernetesLocality Routing
0 likes · 17 min read
Advanced Istio Best Practices – Locality Routing and Service Mesh Optimization
Didi Tech
Didi Tech
Jan 9, 2024 · Big Data

Introducing Apache Pulsar: Technical Benefits and Solutions for Didi Big Data Messaging System

Apache Pulsar, a cloud‑native distributed messaging platform, solves Didi Big Data’s DKafka bottlenecks by separating compute and storage, using sequential log writes, heterogeneous disks, multi‑level caching, bundle‑based load balancing and automatic scaling, dramatically improving stability while introducing richer monitoring complexity.

Apache PulsarDKafkabig data
0 likes · 17 min read
Introducing Apache Pulsar: Technical Benefits and Solutions for Didi Big Data Messaging System
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Mar 6, 2025 · Backend Development

ROFF: A High‑Performance Seven‑Layer Rust‑Based Gateway with TLS Offload, QUIC/HTTP3, and Dynamic Module System

ROFF is a Rust‑implemented, seven‑layer gateway that delivers high‑throughput load balancing with memory‑safe performance, TLS hardware offload, native QUIC/HTTP3 support, a hot‑reload/upgrade mechanism, and an extensible module system allowing over thirty built‑in filters and custom Rust macros.

HTTP/3QUICRust
0 likes · 28 min read
ROFF: A High‑Performance Seven‑Layer Rust‑Based Gateway with TLS Offload, QUIC/HTTP3, and Dynamic Module System
iQIYI Technical Product Team
iQIYI Technical Product Team
Jul 29, 2022 · Cloud Native

Design and Implementation of Graceful Service Deployment in Spring Cloud Microservices

The article presents a Spring Cloud micro‑service solution that combines pluggable warm‑up modules and delayed deregistration to ensure graceful startup and shutdown, preventing premature object destruction, late offline calls, and early registration, thereby achieving over 99.99 % deployment success and stable request latency.

Graceful DeploymentSpring CloudWarm-up
0 likes · 8 min read
Design and Implementation of Graceful Service Deployment in Spring Cloud Microservices
iQIYI Technical Product Team
iQIYI Technical Product Team
Nov 26, 2021 · Backend Development

Analysis and Solutions for Load‑Balancing Issues in QLB‑4 Based TFServing Service Calls

The investigation of QLB‑4‑based TFServing calls revealed uneven traffic, stale routing after scaling, and idle servers due to layer‑4 hash routing, leading the team to replace QLB‑4 with a Consul‑driven client‑side load‑balancer that dynamically pools servers, eliminates restarts, and cuts GPU waste.

ConsulQLB-4TFServing
0 likes · 11 min read
Analysis and Solutions for Load‑Balancing Issues in QLB‑4 Based TFServing Service Calls