Operations 11 min read

System Capacity Design and Evaluation: Concepts, Metrics, and Practical Steps

This article explains how to design and evaluate system capacity by defining key concepts such as design capacity, TPS, QPS, and concurrency, outlining when capacity assessment is needed, and providing a step‑by‑step methodology with real‑world examples and calculations for accurate performance planning.

Top Architect
Top Architect
Top Architect
System Capacity Design and Evaluation: Concepts, Metrics, and Practical Steps

Background

The article starts with a real‑world scenario of a company sports event where the number of participants and limited time caused capacity problems, illustrating the need for proper capacity design.

Concept

Design capacity is defined as the process of estimating system resources (data volume, concurrency, bandwidth, CPU, memory, storage, etc.) required to meet business needs.

Analysis Process

Key performance indicators are introduced:

TPS (Transactions Per Second) : number of transactions processed each second.

QPS (Queries Per Second) : number of requests handled each second, a common measure of throughput.

Concurrency : number of simultaneous requests the system can handle.

Formulas are provided, e.g., QPS = (Total PV * 80%) / (Seconds per day * 20%) and the relationship QPS = Concurrency / AvgResponseTime.

When to Evaluate System Capacity

Temporary traffic spikes (e.g., promotional events like 618, Double‑11).

Initial system capacity assessment before launch.

Changes in capacity baseline as features and data grow.

Evaluation Steps

Analyze total daily visits (PV/UV) from product, operations, and monitoring data.

Calculate average daily QPS.

Estimate peak‑interval QPS using traffic curves or the 80/20 rule.

Conduct performance stress testing (e.g., using nGrinder or JMeter) to find the maximum QPS a single instance can sustain.

Adjust based on redundancy and desired response time, then determine the required number of instances.

Case Study

A library reservation system example shows how to compute QPS, peak QPS, and required concurrency using the 80/20 rule and a pessimism/optimism factor, resulting in a recommended concurrency of about 200.

Conclusion

The article summarizes the timing, steps, and practical considerations for system capacity design, emphasizing that early and accurate capacity planning prevents performance disasters and ensures smooth operation during traffic surges.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

System DesignPerformance Testingcapacity planningQPS
Top Architect
Written by

Top Architect

Top Architect focuses on sharing practical architecture knowledge, covering enterprise, system, website, large‑scale distributed, and high‑availability architectures, plus architecture adjustments using internet technologies. We welcome idea‑driven, sharing‑oriented architects to exchange and learn together.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.