Operations 8 min read

Mastering Load Balancing: Algorithms, Nginx Setup, and Real‑World Use Cases

This article explains load balancing fundamentals, shows how to configure Nginx for a Tomcat server pool, compares common balancing algorithms, describes OSI‑layer classifications, and outlines typical scenarios such as web farms, application clusters, databases, CDN, and cloud environments.

Architect Chen

Apr 10, 2024

Mastering Load Balancing: Algorithms, Nginx Setup, and Real‑World Use Cases

What Is Load Balancing?

Load balancing is a technique that distributes network or application traffic across multiple servers to improve performance, reliability, and scalability.

Simple Nginx Example

upstream tomcat_servers {
    server tomcat1.example.com:8080;
    server tomcat2.example.com:8080;
    server tomcat3.example.com:8080;
}

Benefits of Load Balancing

Performance : distributes traffic to reduce load on each server, improving response time and throughput.

Reliability : detects failed nodes and reroutes traffic, increasing fault tolerance.

Scalability : allows adding or removing servers dynamically to handle changing demand.

How Load Balancers Work

Load balancers act as reverse proxies, hiding backend details from clients. They receive incoming requests and forward them to backend servers based on a selected algorithm.

Common Load‑Balancing Algorithms

Round Robin – sequentially assigns each request to the next server. Simple and fair but ignores actual load.

Least Connections – sends traffic to the server with the fewest active connections, balancing load dynamically.

Least Response Time – chooses the server with the shortest measured response time, improving overall latency.

Hashing – computes a hash from request attributes (e.g., client IP) and routes consistently to the same server, preserving session affinity.

Weighted Round Robin – assigns a weight to each server based on capacity and distributes requests proportionally.

Layer‑Based Classification

Load balancers can operate at different OSI layers:

Layer 2 (Data Link) – MAC‑address based distribution, typically within a LAN.

Layer 3 (Network) – IP‑address routing, useful across subnets or data centers.

Layer 4 (Transport) – TCP/UDP port‑based routing, handling generic traffic.

Layer 7 (Application) – HTTP/HTTPS‑level routing, allowing content‑based decisions such as URL or header inspection.

Typical Application Scenarios

Web server farms – spread HTTP traffic across multiple web nodes.

Application server clusters – balance requests for complex business logic.

Database clusters – distribute query load among replica databases.

Content delivery and streaming – manage traffic for CDN and media services.

Cloud and virtualized environments – route traffic among VM instances or containers.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

backend operations Algorithms

Written by

Architect Chen

Sharing over a decade of architecture experience from Baidu, Alibaba, and Tencent.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.