load balancing | BestHub

Collection size

708 articles

Page 1 of 36

HelloTech

Nov 15, 2022 · Backend Development

Investigation of Elasticsearch RestClient Load‑Balancing and Traffic Skew Issues

The investigation revealed that Elasticsearch RestClient’s built‑in round‑robin and dead‑node blacklisting redirected traffic from failed data‑node addresses—mistakenly included in the static IP list—onto a single client node, causing severe load imbalance and timeouts, which were eliminated after correcting the IP list.

Backend DevelopmentElasticsearchJava Client

0 likes · 17 min read

Investigation of Elasticsearch RestClient Load‑Balancing and Traffic Skew Issues

JD Tech Talk

Mar 24, 2025 · Artificial Intelligence

MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Ad Serving

This article presents MaRCA, a multi‑agent reinforcement learning framework that allocates computation resources across the full ad‑serving chain by modeling user value, compute consumption, and action rewards, enabling fine‑grained power‑tilting toward high‑quality traffic and achieving significant business gains under strict latency constraints.

AI optimizationMulti-Agentad serving

0 likes · 16 min read

MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Ad Serving

360 Smart Cloud

Jul 14, 2022 · Cloud Computing

Auto Scaling (AS) in Cloud Services: Architecture, Use Cases, and Optimization Strategies

This article explains the concept of elastic auto scaling in cloud services, describes typical scenarios such as high‑elastic web apps and compute‑intensive workloads, details the four‑layer architecture and workflow, and outlines functional features, stability improvements, and future optimization directions.

auto-scalingcloud-computingelasticity

0 likes · 12 min read

Auto Scaling (AS) in Cloud Services: Architecture, Use Cases, and Optimization Strategies

Baidu Intelligent Testing

Mar 22, 2017 · Operations

Load Balancing: Concepts, Mechanisms, and Enterprise Practices

This article explains the principles of load balancing, distinguishes stateless service and stateful data balancing, describes DNS, hardware and software solutions such as F5, HAProxy, LVS, and GSLB, and illustrates real‑world implementations at Alibaba and Tencent while offering practical guidance on sharding, caching, and fault tolerance.

DNSGSLBLVS

0 likes · 18 min read

Load Balancing: Concepts, Mechanisms, and Enterprise Practices

IT Architects Alliance

Jan 9, 2025 · Operations

Load Balancing Strategies for High Availability in Distributed Systems

This article explores the challenges and opportunities of distributed architectures and explains how various static and dynamic load‑balancing strategies, hardware and software balancers, redundancy, health checks, and failover mechanisms together ensure high availability, illustrated with real‑world e‑commerce and live‑streaming case studies and future trends.

distributed systemshigh availabilityload balancing

0 likes · 20 min read

Load Balancing Strategies for High Availability in Distributed Systems

Architect

Feb 10, 2025 · Artificial Intelligence

Evolution of DeepSeek Mixture‑of‑Experts (MoE) Architecture from V1 to V3

This article reviews the development of DeepSeek's Mixture-of-Experts (MoE) models, tracing their evolution from the original DeepSeekMoE V1 through V2 to V3, detailing architectural innovations such as fine‑grained expert segmentation, shared‑expert isolation, load‑balancing losses, device‑limited routing, and the shift from softmax to sigmoid gating.

DeepSeekLLMMixture of Experts

0 likes · 21 min read

Evolution of DeepSeek Mixture‑of‑Experts (MoE) Architecture from V1 to V3

Top Architect

Aug 13, 2024 · Backend Development

Spring Cloud Alibaba and Nacos Service Governance: Architecture, Installation, and Practical Usage

This article explains the evolution from monolithic to microservice architectures, introduces Spring Cloud Alibaba and Nacos for service discovery and governance, provides step‑by‑step installation and configuration instructions, and demonstrates load balancing with Ribbon, including essential code snippets for quick adoption.

NacosSpring Cloudbackend

0 likes · 11 min read

Spring Cloud Alibaba and Nacos Service Governance: Architecture, Installation, and Practical Usage

Mike Chen's Internet Architecture

May 8, 2025 · Operations

Understanding Nginx Forward and Reverse Proxy: Differences and Use Cases

This article explains the concepts of Nginx forward and reverse proxy, compares their roles, client and server perceptions, typical usage scenarios, and highlights why reverse proxy is a core Nginx feature while forward proxy is rarely used without third‑party modules.

Web Serverforward proxyload balancing

0 likes · 5 min read

Understanding Nginx Forward and Reverse Proxy: Differences and Use Cases

Mike Chen's Internet Architecture

Dec 4, 2024 · Backend Development

How Nginx Achieves Million-Connection Concurrency: Architecture and Optimization Techniques

This article explains how Nginx uses an event‑driven, asynchronous non‑blocking I/O model together with epoll/kqueue and various optimization strategies such as keep‑alive, caching, efficient data structures and load balancing to handle millions of concurrent connections in large‑scale internet architectures.

asynchronous I/Obackend optimizationepoll

0 likes · 5 min read

How Nginx Achieves Million-Connection Concurrency: Architecture and Optimization Techniques

Architect's Guide

Mar 20, 2025 · Backend Development

Implementing a Lightweight Service Registry with Eureka for Task Distribution

This article describes how to build a lightweight service registry using Eureka Server that also acts as a client, enabling a task dispatcher to discover and load‑balance task executor instances without deploying a separate registry, including Maven dependencies, Spring annotations, Feign client definitions, and troubleshooting steps.

Backend DevelopmentEurekaSpring Cloud

0 likes · 11 min read

Implementing a Lightweight Service Registry with Eureka for Task Distribution

DeWu Technology

Nov 13, 2024 · Backend Development

Evolution of Rainbow Bridge Architecture: Building a Self‑Managed Metadata Center and SDK Enhancements

The new Rainbow Bridge architecture replaces the SLB‑based load‑balancing model with a self‑managed, multi‑AZ metadata center and enhanced SDK that aggregates node health, provides zone‑aware weighted routing, supports rapid failover and manual overrides, and delivers faster recovery and scalable traffic handling.

distributed systemshigh availabilityload balancing

0 likes · 11 min read

Evolution of Rainbow Bridge Architecture: Building a Self‑Managed Metadata Center and SDK Enhancements

Java Tech Enthusiast

Apr 10, 2025 · Backend Development

Understanding Nginx: HTTP Server, Reverse Proxy, and Scalable Architecture

The article shows how adding HTML tags turns a plain‑text file into a web page, explains the need for an HTTP service to serve it, introduces reverse proxies for load‑balancing and address hiding, and outlines Nginx’s event‑driven architecture with master and worker processes, shared memory, proxy cache, multi‑protocol support, configurable modules, and scaling options, while warning of a single‑instance failure and recommending cluster mode.

Master Processhttp serverload balancing

0 likes · 9 min read

Understanding Nginx: HTTP Server, Reverse Proxy, and Scalable Architecture

Java Tech Enthusiast

Jan 10, 2025 · Backend Development

RocketMQ Consumer Scaling and Load Balancing Strategies

In RocketMQ, adding consumers speeds consumption only when they are fewer than MessageQueues, while pull delays arise from ProcessQueue thresholds or ordered‑lock timeouts; slow processing often stems from heavy business logic or external calls, and load can be balanced using average, round‑robin, custom, machine‑room, nearby‑room, or consistent‑hash allocation strategies.

ConsumerMessage QueueRocketMQ

0 likes · 9 min read

RocketMQ Consumer Scaling and Load Balancing Strategies

Java Tech Enthusiast

May 18, 2022 · Operations

Nginx Reverse Proxy, Load Balancing, and URL Rewrite Guide

This article explains how Nginx can act as a reverse proxy, distribute traffic with various load‑balancing methods, serve static assets separately from dynamic back‑ends, and rewrite friendly URLs, providing clear configuration snippets for each technique.

load balancingnginxreverse-proxy

0 likes · 10 min read

Nginx Reverse Proxy, Load Balancing, and URL Rewrite Guide

DaTaobao Tech

Aug 9, 2024 · Cloud Native

Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow

RocketMQ stores messages in a sequential CommitLog while using a ConsumerQueue index for fast random access, and employs two looping threads—RebalanceService for load‑balancing queue assignments and PullMessageService for continuously pulling messages and dispatching them to user callbacks via a thread‑pooled consumer pipeline.

ConsumerJavaMessage Queue

0 likes · 12 min read

Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow

Tencent Music Tech Team

Oct 31, 2023 · Cloud Native

Advanced Istio Best Practices – Locality Routing and Service Mesh Optimization

The article by delphisfang offers a concise, step‑by‑step guide to mastering Istio’s locality‑aware routing, explaining the three‑evidence learning method, the priority algorithm, required DestinationRule and outlier detection settings, how Envoy discovers locality, and tips for simplifying the Pilot‑Envoy mesh architecture.

EnvoyKubernetesLocality Routing

0 likes · 17 min read

Advanced Istio Best Practices – Locality Routing and Service Mesh Optimization

Didi Tech

Jan 9, 2024 · Big Data

Introducing Apache Pulsar: Technical Benefits and Solutions for Didi Big Data Messaging System

Apache Pulsar, a cloud‑native distributed messaging platform, solves Didi Big Data’s DKafka bottlenecks by separating compute and storage, using sequential log writes, heterogeneous disks, multi‑level caching, bundle‑based load balancing and automatic scaling, dramatically improving stability while introducing richer monitoring complexity.

Apache PulsarDKafkabig data

0 likes · 17 min read

Introducing Apache Pulsar: Technical Benefits and Solutions for Didi Big Data Messaging System

Xiaohongshu Tech REDtech

Mar 6, 2025 · Backend Development

ROFF: A High‑Performance Seven‑Layer Rust‑Based Gateway with TLS Offload, QUIC/HTTP3, and Dynamic Module System

ROFF is a Rust‑implemented, seven‑layer gateway that delivers high‑throughput load balancing with memory‑safe performance, TLS hardware offload, native QUIC/HTTP3 support, a hot‑reload/upgrade mechanism, and an extensible module system allowing over thirty built‑in filters and custom Rust macros.

HTTP/3QUICRust

0 likes · 28 min read

ROFF: A High‑Performance Seven‑Layer Rust‑Based Gateway with TLS Offload, QUIC/HTTP3, and Dynamic Module System

iQIYI Technical Product Team

Jul 29, 2022 · Cloud Native

Design and Implementation of Graceful Service Deployment in Spring Cloud Microservices

The article presents a Spring Cloud micro‑service solution that combines pluggable warm‑up modules and delayed deregistration to ensure graceful startup and shutdown, preventing premature object destruction, late offline calls, and early registration, thereby achieving over 99.99 % deployment success and stable request latency.

Graceful DeploymentSpring CloudWarm-up

0 likes · 8 min read

Design and Implementation of Graceful Service Deployment in Spring Cloud Microservices

iQIYI Technical Product Team

Nov 26, 2021 · Backend Development

Analysis and Solutions for Load‑Balancing Issues in QLB‑4 Based TFServing Service Calls

The investigation of QLB‑4‑based TFServing calls revealed uneven traffic, stale routing after scaling, and idle servers due to layer‑4 hash routing, leading the team to replace QLB‑4 with a Consul‑driven client‑side load‑balancer that dynamically pools servers, eliminates restarts, and cuts GPU waste.

ConsulQLB-4TFServing

0 likes · 11 min read

Analysis and Solutions for Load‑Balancing Issues in QLB‑4 Based TFServing Service Calls