Tagged articles
15 articles
Page 1 of 1
Su San Talks Tech
Su San Talks Tech
Jun 12, 2025 · Information Security

Defending Against Million‑QPS Attacks: Rate Limiting, Fingerprinting, and Dynamic Rules

This article explains why a million‑QPS flood can cripple systems, outlines attackers' tactics, and presents a three‑layer defense strategy—including gateway rate limiting with Nginx + Lua, distributed circuit breaking via Sentinel, device fingerprinting, behavior analysis, and a dynamic rule engine—to protect high‑traffic services.

DDoSbehavior analysishigh QPS
0 likes · 14 min read
Defending Against Million‑QPS Attacks: Rate Limiting, Fingerprinting, and Dynamic Rules
Alimama Tech
Alimama Tech
May 12, 2025 · Artificial Intelligence

Universal Recommendation Model (URM): A General Large‑Model Recall System for Advertising

The article presents the Universal Recommendation Model (URM), a large‑language‑model‑based recall framework that integrates world knowledge and e‑commerce expertise through knowledge injection and prompt‑driven alignment, achieving significant offline recall gains and a 3.1% increase in ad consumption while meeting high‑QPS, low‑latency production constraints.

AdvertisingPrompt engineeringhigh QPS
0 likes · 17 min read
Universal Recommendation Model (URM): A General Large‑Model Recall System for Advertising
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 7, 2024 · Artificial Intelligence

RTAMS-GANNS: A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighbor Search

RTAMS‑GANNS, the award‑winning real‑time adaptive multi‑stream GPU system for online approximate nearest neighbor search, eliminates costly memory allocations and serial execution by using a dynamic memory‑block insertion algorithm and separate CUDA streams, cutting latency by 40‑80% and reliably serving over 100 million daily users in production.

GPUPerformance EvaluationVector Insertion
0 likes · 19 min read
RTAMS-GANNS: A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighbor Search
Architect's Guide
Architect's Guide
Aug 24, 2022 · Backend Development

Optimizing Long‑Connection Services with Netty: From Millions of Connections to High QPS

This article summarizes the challenges and optimization techniques for building a high‑performance long‑connection service with Netty, covering non‑blocking I/O, Linux kernel tuning, client‑side testing, VM‑based scaling, data‑structure tweaks, CPU and GC bottlenecks, and the final results of achieving hundreds of thousands of connections and tens of thousands of QPS on a single server.

GC tuningJava NIOLinux Tuning
0 likes · 14 min read
Optimizing Long‑Connection Services with Netty: From Millions of Connections to High QPS
Xiao Lou's Tech Notes
Xiao Lou's Tech Notes
May 25, 2022 · Backend Development

Why Caching Timestamps Can Slash CPU Usage in High‑QPS Java and Go Services

This article explores how naive timestamp retrieval can become a CPU bottleneck under high concurrency, demonstrates cache‑based optimizations used in Alibaba's Cobar and Sentinel projects, presents benchmark results, and proposes an adaptive algorithm to enable or disable caching based on real‑time QPS.

Javahigh QPStimestamp caching
0 likes · 11 min read
Why Caching Timestamps Can Slash CPU Usage in High‑QPS Java and Go Services
Xianyu Technology
Xianyu Technology
Apr 13, 2022 · Big Data

Real-time Multi-system Data Aggregation for Fan Tag System

The Xianyu fan‑tag system solves the challenge of displaying full‑history purchase counts with real‑time updates and low‑latency, high‑throughput queries by daily exporting multi‑system data to a LevelDB‑based KV store, converting schemas, and applying real‑time compensation from transaction and follow‑change messages, merging offline and live data to produce sorted fan lists at ~10 k QPS.

KV storageReal-time Processingdata aggregation
0 likes · 6 min read
Real-time Multi-system Data Aggregation for Fan Tag System
ByteFE
ByteFE
Apr 11, 2022 · Backend Development

ByteDance Wallet Asset Middle Platform Design for 2022 Spring Festival High‑Traffic Reward Distribution

This article details ByteDance's wallet asset middle platform designed for the 2022 Spring Festival, covering eight‑app reward interoperability, high‑QPS challenges, token‑based asynchronous入账, budget control, stability measures, and fund‑safety guarantees, and includes practical solutions for hot‑key handling, budget throttling, and multi‑stage activity isolation.

Budget ControlByteDanceFund Safety
0 likes · 22 min read
ByteDance Wallet Asset Middle Platform Design for 2022 Spring Festival High‑Traffic Reward Distribution
IT Architects Alliance
IT Architects Alliance
Mar 27, 2022 · Backend Development

Simulating a 10‑Billion Red‑Envelope System with Go: From 3K to 6K QPS

This article details a step‑by‑step engineering experiment that reproduces a high‑throughput "red‑envelope" service, outlining the required hardware, software stack, load‑generation logic, monitoring setup, and performance results for handling up to 6 000 QPS on a 100‑million‑user scale.

BackendDistributed SystemsGo
0 likes · 21 min read
Simulating a 10‑Billion Red‑Envelope System with Go: From 3K to 6K QPS
FunTester
FunTester
Apr 19, 2021 · Operations

How to Add a Soft‑Start Mechanism for High‑QPS Performance Testing in Java

This article explains the concept of soft‑start in performance testing, presents Java implementations for both fixed‑thread and fixed‑QPS models, discusses error‑impact considerations, and provides practical code snippets to gradually ramp up load and improve measurement accuracy for high‑throughput services.

JavaLoad TestingPerformance Testing
0 likes · 8 min read
How to Add a Soft‑Start Mechanism for High‑QPS Performance Testing in Java
Ctrip Technology
Ctrip Technology
Feb 25, 2021 · Backend Development

Design and Implementation of a Cache Access Component and Update Platform for High‑QPS Scenarios

This article describes a backend architecture for a high‑traffic e‑commerce project, detailing a cache access component and a cache update platform that use asynchronous messaging, hotspot‑key handling, versioned cache entries, and Redis to achieve low latency, high QPS support and strong data consistency.

Backendcachingdistributed-systems
0 likes · 18 min read
Design and Implementation of a Cache Access Component and Update Platform for High‑QPS Scenarios
Meituan Technology Team
Meituan Technology Team
Dec 20, 2018 · Backend Development

Design and Performance Optimization of LruCache in Meituan DSP System

Meituan’s DSP system boosted high‑QPS ad serving performance by layering an LRU cache in front of Redis, then adding time‑based eviction, sharding the cache into HashLruCache instances to cut lock contention, and employing a zero‑copy, reference‑counted design, ultimately cutting average latency to about 20 % of the original and similarly reducing 99.9th‑percentile delays.

HashLruCacheLRUCacheMeituan DSP
0 likes · 15 min read
Design and Performance Optimization of LruCache in Meituan DSP System