Backend Development 10 min read

Scaling a Backend: From Single Server to Reverse Proxy, Load Balancing, Microservices, Caching, and Partitioning

This article explains how to evolve a simple single‑node backend by adding a reverse proxy, introducing load balancers, scaling databases, adopting micro‑services, leveraging caches and CDNs, using message queues, and applying partitioning techniques to handle massive traffic while maintaining consistency and reliability.

360 Quality & Efficiency

Jan 7, 2020

Scaling a Backend: From Single Server to Reverse Proxy, Load Balancing, Microservices, Caching, and Partitioning

1. Single‑node server + database

This is the most basic backend setup: one server runs the business logic and a database stores persistent data. It works for low traffic, but scaling requires a more powerful server.

2. Add a reverse proxy

A reverse proxy acts like a hotel front desk, intercepting incoming requests, performing health checks, routing to the correct endpoint, handling authentication, and enforcing firewall rules before they reach the actual server.

Health checks to ensure the real server is running

Routing requests to the correct endpoint

Authentication to allow only authorized users

Firewall to restrict access to permitted network segments

3. Introduce a load balancer

Many reverse proxies can also function as load balancers, distributing incoming requests across multiple servers (e.g., two payment servers handling 100 requests per minute, each capable of 50). The load balancer splits traffic so that more servers can be added as demand grows.

4. Scale the database

While adding servers is easy, scaling the database is harder due to consistency requirements. A common approach is a primary‑replica (master‑slave) setup where writes go to a single primary instance and reads are served by replicas, preserving consistency but still limiting write scalability.

5. Adopt microservices

Instead of a monolithic server handling all functions, break the system into independent services, each with its own resources and possibly its own database. Benefits include independent scaling, isolated development teams, and reduced coupling.

6. Cache and CDN

Static assets (images, JavaScript, CSS) can be cached to avoid recomputation on every request. A Content Delivery Network (CDN) distributes cached copies worldwide, delivering content from locations close to users.

7. Message queue

A message queue decouples producers and consumers, allowing tasks to be queued and processed asynchronously by multiple workers, improving responsiveness and enabling horizontal scaling of processing capacity.

Decouples tasks from processors, handling variable load gracefully

Allows scaling of workers on demand without blocking user requests

8. Partitioning

Partitioning splits the application stack into multiple shards, each responsible for a subset of keys or namespaces (e.g., users whose names start with A go to partition A). This enables parallelism across servers, databases, or any component.

9. DNS‑level load balancing

Beyond a single load balancer, DNS can map a domain name to multiple IP addresses, effectively distributing traffic across multiple load balancers for higher capacity and redundancy.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

architecture Microservices caching Message Queue scaling load-balancing

Written by

360 Quality & Efficiency

360 Quality & Efficiency focuses on seamlessly integrating quality and efficiency in R&D, sharing 360’s internal best practices with industry peers to foster collaboration among Chinese enterprises and drive greater efficiency value.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.