Backend Development 11 min read

How to Build Scalable Web Services: Cloning, Databases, Caching, and Async Strategies

This guide explains how to design horizontally scalable web services by using load balancing, shared code deployment, centralized session stores, database choices, in‑memory caching patterns, and asynchronous processing to achieve high performance and reliability.

21CTO

Apr 28, 2016

How to Build Scalable Web Services: Cloning, Databases, Caching, and Async Strategies

Scalable web services rely on a load‑balancing layer that distributes user requests across a pool of application servers. Each server must run the same codebase and store no user‑specific data locally.

User sessions should be kept in a centralized store accessible to all servers, such as an external database or a persistent cache (e.g., Redis). The store must reside outside the application servers.

Deployments can be synchronized with tools like Capistrano, ensuring every server runs the latest code. After sharing the codebase and externalizing sessions, create a master image (e.g., an AWS AMI) to clone new instances quickly.

Chapter 2: Database

Even with horizontal scaling, a single MySQL database can become a bottleneck. Two approaches are common:

Persist with MySQL, add a DBA, enable master‑slave replication, increase memory, and consider sharding or denormalization—though this can become costly.

Denormalize early and move to a NoSQL solution (MongoDB, CouchDB) or use MySQL without joins, handling joins in application code. Eventually a cache will still be needed.

Chapter 3: Caching

Introduce an in‑memory cache (Memcached or Redis) to alleviate database load. Never use file‑based caches for scalable systems.

A cache acts as a key‑value layer between the application and the data store: read from cache first, fall back to the primary source only on a miss. This dramatically speeds up reads and writes.

Two caching patterns:

Query result caching – store full result sets; requires careful invalidation when underlying data changes.

Object caching – store assembled objects or full class instances; simplifies invalidation and enables asynchronous processing.

Object caching is especially powerful for session data, fully rendered articles, activity streams, and user‑friend relationships. Redis offers persistence and rich data structures, while Memcached provides pure scalability.

Chapter 4: Asynchronous Processing

Long‑running tasks should be handled asynchronously. One pattern is to pre‑compute results (e.g., generate static HTML) and serve them instantly. Another is to enqueue high‑cost jobs (via RabbitMQ, ActiveMQ, or a Redis list) and notify the front‑end when they complete.

Using asynchronous workers allows the back‑end to scale virtually without limit while keeping the front‑end responsive, delivering a superior user experience.

If a task is time‑consuming, implement it asynchronously.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Backend deployment asynchronous Caching Web services

Written by

21CTO

21CTO (21CTO.com) offers developers community, training, and services, making it your go‑to learning and service platform.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.