Tagged articles

high QPS

15 articles · Page 1 of 1

Dec 22, 2025 · Databases

Scaling MySQL to Tens of Millions QPS: Practical Optimization Steps

This guide walks through practical techniques to push MySQL performance from a few thousand to tens of millions of queries per second, covering single‑node tuning, read/write splitting, caching layers, and horizontal/vertical sharding with concrete configuration tips and architecture diagrams.

CachingRead‑Write SplittingSharding

0 likes · 4 min read

Scaling MySQL to Tens of Millions QPS: Practical Optimization Steps

Su San Talks Tech

Jun 12, 2025 · Information Security

Defending Against Million‑QPS Attacks: Rate Limiting, Fingerprinting, and Dynamic Rules

This article explains why a million‑QPS flood can cripple systems, outlines attackers' tactics, and presents a three‑layer defense strategy—including gateway rate limiting with Nginx + Lua, distributed circuit breaking via Sentinel, device fingerprinting, behavior analysis, and a dynamic rule engine—to protect high‑traffic services.

DDoSbehavior analysishigh QPS

0 likes · 14 min read

Defending Against Million‑QPS Attacks: Rate Limiting, Fingerprinting, and Dynamic Rules

Alimama Tech

May 12, 2025 · Artificial Intelligence

Universal Recommendation Model (URM): A General Large‑Model Recall System for Advertising

The article presents the Universal Recommendation Model (URM), a large‑language‑model‑based recall framework that integrates world knowledge and e‑commerce expertise through knowledge injection and prompt‑driven alignment, achieving significant offline recall gains and a 3.1% increase in ad consumption while meeting high‑QPS, low‑latency production constraints.

AdvertisingLarge Language ModelMultimodal

0 likes · 17 min read

Universal Recommendation Model (URM): A General Large‑Model Recall System for Advertising

Xiaohongshu Tech REDtech

Nov 7, 2024 · Artificial Intelligence

RTAMS-GANNS: A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighbor Search

RTAMS‑GANNS, the award‑winning real‑time adaptive multi‑stream GPU system for online approximate nearest neighbor search, eliminates costly memory allocations and serial execution by using a dynamic memory‑block insertion algorithm and separate CUDA streams, cutting latency by 40‑80% and reliably serving over 100 million daily users in production.

GPUVector Insertionapproximate nearest neighbor

0 likes · 19 min read

RTAMS-GANNS: A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighbor Search

Architect's Guide

Aug 24, 2022 · Backend Development

Optimizing Long‑Connection Services with Netty: From Millions of Connections to High QPS

This article summarizes the challenges and optimization techniques for building a high‑performance long‑connection service with Netty, covering non‑blocking I/O, Linux kernel tuning, client‑side testing, VM‑based scaling, data‑structure tweaks, CPU and GC bottlenecks, and the final results of achieving hundreds of thousands of connections and tens of thousands of QPS on a single server.

GC TuningJava NIOLinux Tuning

0 likes · 14 min read

Optimizing Long‑Connection Services with Netty: From Millions of Connections to High QPS

Xiao Lou's Tech Notes

May 25, 2022 · Backend Development

Why Caching Timestamps Can Slash CPU Usage in High‑QPS Java and Go Services

This article explores how naive timestamp retrieval can become a CPU bottleneck under high concurrency, demonstrates cache‑based optimizations used in Alibaba's Cobar and Sentinel projects, presents benchmark results, and proposes an adaptive algorithm to enable or disable caching based on real‑time QPS.

Javahigh QPStimestamp caching

0 likes · 11 min read

Why Caching Timestamps Can Slash CPU Usage in High‑QPS Java and Go Services

Xianyu Technology

Apr 13, 2022 · Big Data

Real-time Multi-system Data Aggregation for Fan Tag System

The Xianyu fan‑tag system solves the challenge of displaying full‑history purchase counts with real‑time updates and low‑latency, high‑throughput queries by daily exporting multi‑system data to a LevelDB‑based KV store, converting schemas, and applying real‑time compensation from transaction and follow‑change messages, merging offline and live data to produce sorted fan lists at ~10 k QPS.

KV storageReal-time Processingdata aggregation

0 likes · 6 min read

Real-time Multi-system Data Aggregation for Fan Tag System

ByteFE

Apr 11, 2022 · Backend Development

ByteDance Wallet Asset Middle Platform Design for 2022 Spring Festival High‑Traffic Reward Distribution

This article details ByteDance's wallet asset middle platform designed for the 2022 Spring Festival, covering eight‑app reward interoperability, high‑QPS challenges, token‑based asynchronous入账, budget control, stability measures, and fund‑safety guarantees, and includes practical solutions for hot‑key handling, budget throttling, and multi‑stage activity isolation.

ByteDanceFund SafetyStability

0 likes · 22 min read

ByteDance Wallet Asset Middle Platform Design for 2022 Spring Festival High‑Traffic Reward Distribution

IT Architects Alliance

Mar 27, 2022 · Backend Development

Simulating a 10‑Billion Red‑Envelope System with Go: From 3K to 6K QPS

This article details a step‑by‑step engineering experiment that reproduces a high‑throughput "red‑envelope" service, outlining the required hardware, software stack, load‑generation logic, monitoring setup, and performance results for handling up to 6 000 QPS on a 100‑million‑user scale.

GoMonitoringbackend

0 likes · 21 min read

Simulating a 10‑Billion Red‑Envelope System with Go: From 3K to 6K QPS

IT Architects Alliance

Jan 8, 2022 · Backend Development

Simulating 10 Billion Red Packet Requests: A Go‑Based High‑QPS Backend Blueprint

This article details a step‑by‑step engineering effort to model and benchmark a high‑throughput red‑packet service capable of handling 10 billion requests, covering target metrics, hardware setup, Go implementation, load generation, monitoring, and performance analysis.

GoLoad Simulationbackend-architecture

0 likes · 22 min read

Simulating 10 Billion Red Packet Requests: A Go‑Based High‑QPS Backend Blueprint

IT Xianyu

Sep 14, 2021 · Backend Development

Design and Implementation of a High‑Throughput 10‑Billion Red‑Envelope System Simulation

This article describes how to design, implement, and evaluate a scalable backend that can simulate 10 billion WeChat red‑envelope requests by supporting up to 1 million concurrent users and handling 30 k–60 k QPS per server using Go, Linux tools, and custom monitoring.

GoSimulationbackend-architecture

0 likes · 18 min read

Design and Implementation of a High‑Throughput 10‑Billion Red‑Envelope System Simulation

High Availability Architecture

Jul 6, 2021 · Backend Development

Tuning a Go Service to Reach 200k QPS: GC Adjustment and UDP Optimizations

The article describes how a Go‑based high‑throughput service was tuned from 80k to over 200k QPS by enlarging the GC heap, reusing UDP connections with sync.Pool, reducing system‑call overhead, and applying several lightweight logging and discovery optimizations.

GC TuningGoUDP

0 likes · 8 min read

Tuning a Go Service to Reach 200k QPS: GC Adjustment and UDP Optimizations

FunTester

Apr 19, 2021 · Operations

How to Add a Soft‑Start Mechanism for High‑QPS Performance Testing in Java

This article explains the concept of soft‑start in performance testing, presents Java implementations for both fixed‑thread and fixed‑QPS models, discusses error‑impact considerations, and provides practical code snippets to gradually ramp up load and improve measurement accuracy for high‑throughput services.

Javaconcurrencyhigh QPS

0 likes · 8 min read

How to Add a Soft‑Start Mechanism for High‑QPS Performance Testing in Java

Ctrip Technology

Feb 25, 2021 · Backend Development

Design and Implementation of a Cache Access Component and Update Platform for High‑QPS Scenarios

This article describes a backend architecture for a high‑traffic e‑commerce project, detailing a cache access component and a cache update platform that use asynchronous messaging, hotspot‑key handling, versioned cache entries, and Redis to achieve low latency, high QPS support and strong data consistency.

Cachingbackenddistributed-systems

0 likes · 18 min read

Design and Implementation of a Cache Access Component and Update Platform for High‑QPS Scenarios

Meituan Technology Team

Dec 20, 2018 · Backend Development

Design and Performance Optimization of LruCache in Meituan DSP System

Meituan’s DSP system boosted high‑QPS ad serving performance by layering an LRU cache in front of Redis, then adding time‑based eviction, sharding the cache into HashLruCache instances to cut lock contention, and employing a zero‑copy, reference‑counted design, ultimately cutting average latency to about 20 % of the original and similarly reducing 99.9th‑percentile delays.

Cache OptimizationHashLruCacheLRUCache

0 likes · 15 min read

Design and Performance Optimization of LruCache in Meituan DSP System