Operations 7 min read

Common Load Balancing Algorithms and Their Characteristics

This article explains various load‑balancing scheduling algorithms—including Round Robin, Weighted Round Robin, Least Connection, Weighted Least Connection, Adaptive Agent‑Based balancing, Fixed Weighted, Weighted Response, and Source IP Hash—detailing how each distributes traffic and handles server capacity differences.

Practical DevOps Architecture

Mar 20, 2015

Common Load Balancing Algorithms and Their Characteristics

Load balancers can provide many load‑balancing methods, also known as scheduling methods or algorithms:

Round Robin

This method distributes incoming requests sequentially across all servers in the pool. It works best when servers have similar capacity and load, but can overload weaker servers if they receive requests they cannot handle.

Weighted Round Robin

This algorithm improves simple round robin by assigning a weight to each server based on its processing capability; servers with higher weights receive more consecutive requests, allowing stronger servers to handle a larger share of traffic.

Least Connection

Requests are sent to the server with the fewest active connections at the moment, preventing a server that handles long‑lasting connections from becoming overloaded even if it receives fewer total requests.

Least Connection Slow Start Time

When a new server joins the pool, a slow‑start period limits the number of connections it can receive, allowing the server to ramp up gradually without being overwhelmed.

Weighted Least Connection

This method combines connection count with server‑specific weights, offering a fair distribution that accounts for both current load and predefined capacity differences.

Agent Based Adaptive Balancing

The load balancer periodically collects a load metric (0‑99) from each server via an HTTP GET request. It adjusts weights dynamically based on these metrics, falling back to static weighted round robin when traffic is low or metrics are unreliable.

Fixed Weighted

Only the server with the highest weight is used unless it becomes unavailable, in which case the next highest‑weight server takes over; each server’s weight reflects its priority.

Weighted Response

Weights are calculated from the response times measured during health checks; the total response time of all servers is used to recompute individual weights roughly every 15 seconds.

Source IP Hash

This technique hashes the client’s source IP address to consistently map the client to the same backend server, eliminating the need to store client‑IP state but potentially causing uneven load distribution.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Operations load balancing Round Robin Scheduling Algorithms least connection weighted algorithms

Written by

Practical DevOps Architecture

Hands‑on DevOps operations using Docker, K8s, Jenkins, and Ansible—empowering ops professionals to grow together through sharing, discussion, knowledge consolidation, and continuous improvement.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.