Backend Development 6 min read

High Concurrency Traffic Control and Rate Limiting Techniques

This article discusses practical approaches to handling massive traffic spikes—defining high‑traffic scenarios, common mitigation methods such as caching, degradation, and various rate‑limiting algorithms (counter, sliding window, leaky bucket, token bucket), including Guava's RateLimiter and brief notes on distributed implementations.

Java Captain

Jun 27, 2018

High Concurrency Traffic Control and Rate Limiting Techniques

In real projects the author has experienced peak traffic of over 50,000 QPS and stress‑tested up to 100,000 QPS, prompting a discussion on handling high‑concurrency traffic.

High traffic is defined not by a fixed number but by any load that stresses the system and degrades performance.

Common mitigation techniques include caching (bringing data closer to the application to reduce DB hits), degradation (downgrading non‑core services during spikes), and rate limiting (controlling request volume within a time window to protect system stability).

Rate‑limiting methods covered are:

Counter : a simple algorithm that counts requests in a fixed interval and resets the counter at the interval boundary.

Sliding Window : divides time into small slices that move forward, avoiding the counter’s boundary‑spike problem.

Leaky Bucket : models a bucket with constant outflow; excess requests overflow when the bucket is full.

Token Bucket : improves on the leaky bucket by generating tokens at a steady rate, allowing bursts while still limiting overall throughput.

Illustrative images for each algorithm are included:

Guava’s RateLimiter implements the token‑bucket algorithm; by specifying a desired QPS, it automatically generates tokens and applications acquire permits via tryAcquire().

While the article focuses on single‑machine rate limiting, it notes that distributed scenarios often combine technologies such as Nginx+Lua or Redis+Lua, but these are not detailed here.

In summary, effective traffic control combines caching, degradation, and appropriate rate‑limiting algorithms to protect system stability under massive request loads.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Java high concurrency traffic control Guava rate limiting Backend Performance

Written by

Java Captain

Focused on Java technologies: SSM, the Spring ecosystem, microservices, MySQL, MyCat, clustering, distributed systems, middleware, Linux, networking, multithreading; occasionally covers DevOps tools like Jenkins, Nexus, Docker, ELK; shares practical tech insights and is dedicated to full‑stack Java development.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.