Tagged articles
56 articles
Page 1 of 1
ITPUB
ITPUB
Apr 29, 2026 · Backend Development

How to Rescue a System When QPS Jumps 100× in 10 Minutes

When a service experiences a sudden 100‑fold QPS surge, this guide walks through immediate emergency measures such as rate limiting, circuit breaking, and traffic shedding, followed by systematic analysis of traffic origins, robust architectural redesign including horizontal scaling, micro‑service decomposition, sharding, pooling, caching, and asynchronous processing, and finally stress testing to ensure resilience.

Circuit BreakingMicroservicescaching
0 likes · 13 min read
How to Rescue a System When QPS Jumps 100× in 10 Minutes
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Apr 23, 2026 · Artificial Intelligence

Paper Review: TradeTrap – Evaluating the Reliability and Faithfulness of LLM‑Based Trading Agents

The article introduces TradeTrap, a unified framework that systematically stress‑tests large‑language‑model‑based autonomous trading agents by injecting component‑level perturbations—such as data falsification, prompt injection, and state tampering—into a historical US‑stock back‑test, revealing how small disturbances can cascade into extreme risk exposure, portfolio drawdown, and performance collapse.

Financial AILLMRobustness
0 likes · 18 min read
Paper Review: TradeTrap – Evaluating the Reliability and Faithfulness of LLM‑Based Trading Agents
Woodpecker Software Testing
Woodpecker Software Testing
Feb 12, 2026 · Operations

How to Build a Full‑Chain JMeter Load Test for an E‑Commerce Mega‑Sale

This article walks through designing and implementing a complete JMeter load‑testing solution for an e‑commerce platform's big‑sale scenario, covering business‑flow mapping, request correlation, multi‑stage stress testing, real‑time monitoring with InfluxDB + Grafana, bottleneck identification, and practical optimization tips.

GrafanaInfluxDBJMeter
0 likes · 7 min read
How to Build a Full‑Chain JMeter Load Test for an E‑Commerce Mega‑Sale
Woodpecker Software Testing
Woodpecker Software Testing
Feb 10, 2026 · Industry Insights

Building a Comprehensive Financial Stress‑Test Scenario Generator

This article explains the principles, architecture, Monte Carlo algorithms, Python implementations, risk‑metric calculations, and practical applications of a financial stress‑test scenario generator, while also discussing future trends, AI integration, and challenges such as data quality and computational cost.

Monte Carlo simulationPythonfinancial modeling
0 likes · 18 min read
Building a Comprehensive Financial Stress‑Test Scenario Generator
FunTester
FunTester
Feb 10, 2026 · Operations

Why Performance Testing Matters and How to Get Started: A Step‑by‑Step Guide

This article explains what performance testing is, why it’s essential for preventing system crashes under load, and provides a practical, step‑by‑step roadmap—including goal definition, test types, tool selection, metric interpretation, protection mechanisms, and result recording—to help developers and ops teams reliably assess and improve application performance.

Load TestingPerformance Testingmonitoring
0 likes · 13 min read
Why Performance Testing Matters and How to Get Started: A Step‑by‑Step Guide
Woodpecker Software Testing
Woodpecker Software Testing
Feb 1, 2026 · Mobile Development

Hidden Mobile Testing Points You Probably Miss

This article outlines a comprehensive checklist of mobile testing items—including functional, conflict, performance, signal, low‑battery, compatibility, stress, field, stability, regular, user‑experience, and hardware tests—to help engineers ensure thorough coverage of a phone's quality and reliability.

Performance TestingUser experiencecompatibility testing
0 likes · 4 min read
Hidden Mobile Testing Points You Probably Miss
FunTester
FunTester
Sep 1, 2025 · Operations

Why Load Testing Is Critical for High‑Traffic Apps and How to Do It Right

This article explains why load testing is essential for modern applications that must serve millions of users, outlines various test types and best‑practice steps, recommends tools and frameworks, and shows how continuous testing integrated into CI/CD pipelines ensures scalability, reliability, and optimal performance under unpredictable traffic spikes.

Load TestingPerformance MonitoringScalability
0 likes · 11 min read
Why Load Testing Is Critical for High‑Traffic Apps and How to Do It Right
Nightwalker Tech
Nightwalker Tech
Aug 13, 2025 · Operations

Mastering Stress Testing: From Basics to Go-Based Load Tools

This comprehensive guide explains what stress testing is, why it matters, key terminology, calculation methods, traditional tools, and introduces a lightweight Go-based load testing utility with detailed usage examples, parameters, and best‑practice recommendations for accurate performance evaluation.

BenchmarkLoad TestingQPS
0 likes · 25 min read
Mastering Stress Testing: From Basics to Go-Based Load Tools
Test Development Learning Exchange
Test Development Learning Exchange
Oct 11, 2024 · Fundamentals

Fundamentals of Performance Testing: Concepts, Metrics, Tools, and Best Practices

This article provides a comprehensive overview of performance testing fundamentals, covering core concepts, key metrics, common testing tools, test design, load generation, result analysis, bottleneck identification, optimization techniques, cloud and micro‑service testing, monitoring, reporting, challenges, and cost‑benefit considerations.

BenchmarkingLoad TestingPerformance Testing
0 likes · 12 min read
Fundamentals of Performance Testing: Concepts, Metrics, Tools, and Best Practices
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 4, 2023 · Backend Development

Designing a High-Concurrency Flash Sale System: Architecture, Rate Limiting, Caching, and Monitoring

This article outlines the end‑to‑end design of a high‑availability flash‑sale system, covering traffic spikes, overload protection, inventory consistency, multi‑level caching, token‑bucket rate limiting, distributed queue processing, service monitoring, and stress‑testing strategies to ensure reliable million‑level transaction handling.

cachingdistributed-systemshigh-concurrency
0 likes · 15 min read
Designing a High-Concurrency Flash Sale System: Architecture, Rate Limiting, Caching, and Monitoring
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Mar 3, 2023 · Operations

Comprehensive Stress Test Scope Evaluation: Adding and Subtracting Approaches

This article presents a systematic method for evaluating and prioritizing stress‑test coverage in game projects, combining comprehensive “addition” techniques—such as QA requirement gathering, case‑library reference, test‑case review, and server‑architecture analysis—with precise “subtraction” methods to refine the final test set.

Game DevelopmentPerformance TestingServer Architecture
0 likes · 13 min read
Comprehensive Stress Test Scope Evaluation: Adding and Subtracting Approaches
Advanced AI Application Practice
Advanced AI Application Practice
Feb 20, 2023 · Operations

Three Key Pressure‑Testing Questions: Defining Test Points, Getting Expected Metrics, and Crafting Strategies

The article outlines three common pressure‑testing challenges—how to extract test points, how to obtain realistic target metrics, and how to design test strategies—then demonstrates single‑scenario load, mixed‑business load, and long‑duration stability tests using JMeter with stepwise thread ramp‑up and metric analysis.

JMeterLoad TestingPerformance Testing
0 likes · 4 min read
Three Key Pressure‑Testing Questions: Defining Test Points, Getting Expected Metrics, and Crafting Strategies
Alibaba Cloud Native
Alibaba Cloud Native
Dec 16, 2022 · Operations

Mastering Performance Testing: Tools, Techniques, and Real-World Case Studies

This comprehensive guide explains what performance testing (stress testing) is, why it matters, various test types, popular tools like ApacheBench, JMeter, LoadRunner and PTS, and provides detailed step-by-step methodologies and real-world case analyses for diagnosing memory, CPU, and latency issues in cloud‑native Java applications.

CPU optimizationJVM profilingload testing tools
0 likes · 24 min read
Mastering Performance Testing: Tools, Techniques, and Real-World Case Studies
FunTester
FunTester
Nov 27, 2022 · Fundamentals

Why Performance Testing Matters: Key Metrics, Types, and Best Practices

This guide explains what performance testing is, why it’s essential, the key metrics such as throughput, response time, and bandwidth, outlines a step‑by‑step testing process, compares load, stress, endurance and capacity testing types, and reviews popular tools like JMeter, LoadRunner and NeoLoad.

Load TestingPerformance TestingSoftware Testing
0 likes · 10 min read
Why Performance Testing Matters: Key Metrics, Types, and Best Practices
DeWu Technology
DeWu Technology
Sep 26, 2022 · Cloud Native

DeWu's High‑Availability Architecture Evolution

DeWu’s tech team describes how their e‑commerce platform grew from a simple PHP monolith to a containerized active‑active, multi‑region system with hot‑standby failover, comprehensive governance, full‑link stress testing, and detailed big‑sale preparation, illustrating a systematic, evolving high‑availability architecture that balances scalability, disaster recovery, and business continuity.

MicroservicesSystem Architecturedisaster recovery
0 likes · 21 min read
DeWu's High‑Availability Architecture Evolution
DataFunSummit
DataFunSummit
Aug 21, 2022 · Big Data

Alluxio Stress Testing Methods and Practices

This article explains the purpose, sources, and manifestations of pressure in Alluxio, describes its built‑in stress testing framework, outlines how to run and configure stress tools, and provides guidance on result calculation, reporting, common issues, and debugging for effective performance evaluation.

AlluxioBig DataPerformance Evaluation
0 likes · 11 min read
Alluxio Stress Testing Methods and Practices
Snowball Engineer Team
Snowball Engineer Team
Aug 1, 2022 · Fundamentals

Performance Testing: Concepts, Scenarios, Tools, and Best Practices

This article explains what performance testing is, when to conduct it, typical scenarios, step‑by‑step execution—including requirements, test design, tool selection, script examples with Locust, result analysis, and key metrics such as throughput, response time, P90, and optimal concurrency.

JMeterLoad TestingLocust
0 likes · 27 min read
Performance Testing: Concepts, Scenarios, Tools, and Best Practices
Software Development Quality
Software Development Quality
Jul 14, 2022 · Operations

Mastering Full‑Link Stress Testing and Stability Assurance for Large‑Scale Promotions

This guide details a comprehensive approach to stability assurance and test innovation, covering full‑link stress testing, functional pre‑runs, loss‑prevention, fault drills, efficiency innovation from zero to one, and systematic quality assurance thinking for large‑scale promotional events.

Operationsefficiency innovationfault drills
0 likes · 14 min read
Mastering Full‑Link Stress Testing and Stability Assurance for Large‑Scale Promotions
JD Retail Technology
JD Retail Technology
Jul 12, 2022 · Operations

Post‑Mortem of JD International Tech R&D’s 2022 618 Promotion: Preparation, Execution, and Lessons Learned

The report details JD International Technology R&D’s first comprehensive preparation for the 2022 618 sales event, describing multi‑regional challenges, the establishment of rapid response mechanisms, stress‑testing activities, operational highlights, identified shortcomings, and actionable recommendations for future large‑scale promotions.

Cross-Team CollaborationSOPe‑commerce
0 likes · 9 min read
Post‑Mortem of JD International Tech R&D’s 2022 618 Promotion: Preparation, Execution, and Lessons Learned
Bilibili Tech
Bilibili Tech
Jun 28, 2022 · Mobile Development

Live Broadcast Stress Testing Platform and Performance Optimization for Mobile Apps

Bilibili’s senior test engineer Lu Kai built a platformized live broadcast stress‑testing system with five modules—metadata collection, task configuration, pressure generation, performance data collection, and reporting—that reuses thousands of cases, simulates high‑QPS traffic, detects hundreds of bugs and memory leaks, and improves iOS and Android app performance for major events.

Performance TestingSystem Architecturebroadcast
0 likes · 13 min read
Live Broadcast Stress Testing Platform and Performance Optimization for Mobile Apps
ITPUB
ITPUB
Jun 27, 2022 · Big Data

How Kuaishou Guarantees Real‑Time Data Warehouse Performance at Billion‑Scale Events

This article details Kuaishou's real‑time data warehouse architecture, the business challenges of massive traffic and diverse requirements, and the forward‑ and reverse‑assurance strategies—including lifecycle standards, monitoring, fault‑injection testing, and a Spring Festival case study—that together ensure high stability, low latency, and sub‑0.5% accuracy for billion‑scale streaming workloads.

Fault InjectionFlink streamingKuaishou
0 likes · 22 min read
How Kuaishou Guarantees Real‑Time Data Warehouse Performance at Billion‑Scale Events
Alibaba Cloud Native
Alibaba Cloud Native
Jun 9, 2022 · Operations

Mastering Load Testing: Types, Tools, and Real‑World Case Studies

This article explains what load testing is, why it matters, the main testing types, essential terminology, compares popular tools, offers step‑by‑step guidance for selecting a tool, and presents detailed real‑world Java performance problem case studies with commands and analysis techniques.

JMeterLoad TestingPerformance Testing
0 likes · 26 min read
Mastering Load Testing: Types, Tools, and Real‑World Case Studies
Bilibili Tech
Bilibili Tech
Mar 4, 2022 · Operations

Stability Engineering Practices for Large-Scale Live Streaming: Bilibili's S11 World Championship Case Study

To deliver a flawless live broadcast of the 2021 League of Legends S11 World Championship to over 100 million viewers, Bilibili mobilized hundreds of engineers for four months, establishing strict standards, modeling dozens of user scenarios, estimating traffic, conducting layered stress and chaos tests, implementing automated and manual degradation, detailed SOPs, rate‑limiting safeguards, and on‑site monitoring, which together ensured system stability throughout the event.

degradationrate limitingstress testing
0 likes · 14 min read
Stability Engineering Practices for Large-Scale Live Streaming: Bilibili's S11 World Championship Case Study
Architecture Digest
Architecture Digest
Mar 1, 2022 · Backend Development

Implementing and Testing a High‑Throughput WeChat Red‑Packet System: 1M Connections and Up to 60k QPS

This article details a practical reproduction of a large‑scale WeChat red‑packet service, describing the design goals, hardware and software setup, concurrency architecture, monitoring tools, and performance results that demonstrate a single‑machine handling one million connections and up to sixty thousand queries per second.

Distributed SystemsGoQPS
0 likes · 17 min read
Implementing and Testing a High‑Throughput WeChat Red‑Packet System: 1M Connections and Up to 60k QPS
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Jan 21, 2022 · Operations

Game Stress Testing: Types, Metrics, and Robot Tools

This article explains the theory and practice of game stress testing, covering four test types (server, client, third‑party services, cloud gaming), key performance indicators such as CPU, memory, network and log metrics, and the use of server and client robots to simulate real‑world load.

AutomationGame performanceLoad Testing
0 likes · 9 min read
Game Stress Testing: Types, Metrics, and Robot Tools
FunTester
FunTester
Jan 18, 2021 · Operations

Stress vs. Load Testing: Fixed Threads vs. Fixed QPS Explained

This article clarifies the distinction between stress testing and load testing, describing their respective models—fixed‑thread and fixed‑QPS—along with key metrics, formulas, and practical benefits for evaluating system performance under varying workloads.

Load TestingOperationsPerformance Testing
0 likes · 5 min read
Stress vs. Load Testing: Fixed Threads vs. Fixed QPS Explained
Code Ape Tech Column
Code Ape Tech Column
Jan 4, 2021 · Databases

Using MySQL’s mysqlslap and Sysbench for Database Stress Testing

This article explains how to perform MySQL performance benchmarking with the built‑in mysqlslap tool and the third‑party sysbench utility, covering configuration changes, command‑line options, data preparation, execution steps, and interpretation of the resulting metrics.

BenchmarkSysbenchmysql
0 likes · 15 min read
Using MySQL’s mysqlslap and Sysbench for Database Stress Testing
JD Retail Technology
JD Retail Technology
Jun 17, 2020 · Operations

How JD’s Data Platforms Scaled for the 618 Mega‑Sale: Operations, Stress‑Testing, and Dual‑Stream Architecture

The article details JD’s data product teams’ systematic preparation for the 618 shopping festival, covering pressure estimation, capacity expansion, stress testing, emergency downgrade strategies, dual‑data‑center isolation, high‑fidelity end‑to‑end testing, and continuous monitoring to ensure stable, real‑time data services during massive traffic spikes.

Big DataData PlatformJD.com
0 likes · 10 min read
How JD’s Data Platforms Scaled for the 618 Mega‑Sale: Operations, Stress‑Testing, and Dual‑Stream Architecture
JD Retail Technology
JD Retail Technology
Jun 11, 2020 · Operations

How JD Health Engineered System Stability for the 618 Mega‑Sale

Facing unprecedented traffic during the 2020 618 shopping festival, JD Health’s product R&D team implemented comprehensive rehearsals, stress testing, architecture reviews, dual‑channel risk controls, and 24‑hour monitoring to ensure system stability and rapid response for its health‑care e‑commerce platforms.

618 promotionJD HealthOperations
0 likes · 5 min read
How JD Health Engineered System Stability for the 618 Mega‑Sale
FunTester
FunTester
May 23, 2020 · Fundamentals

Why Performance, Stress, and Load Testing Are Essential for Reliable Software

The article explains the distinct purposes and benefits of performance, stress, and load testing, showing how each practice helps identify bottlenecks, ensure stability under load, and improve overall software quality throughout the development lifecycle.

Load TestingPerformance TestingSoftware Testing
0 likes · 5 min read
Why Performance, Stress, and Load Testing Are Essential for Reliable Software
ITPUB
ITPUB
May 10, 2020 · Databases

How We Migrated MySQL to Tencent Cloud CDB and Boosted Performance Up to 10×

This case study details the migration of Weimeng's MySQL databases to Tencent Cloud CDB, describing the testing methodology, performance bottlenecks discovered (NUMA, network parameters, low‑concurrency issues, and version bugs), the step‑by‑step optimizations applied, and the resulting QPS improvements across various workloads.

NUMATencent Cloud CDBdatabase migration
0 likes · 20 min read
How We Migrated MySQL to Tencent Cloud CDB and Boosted Performance Up to 10×
360 Tech Engineering
360 Tech Engineering
Nov 18, 2019 · Backend Development

Design and Implementation of the Huajiao Test Platform for Integrated API and Stress Testing

The article details the architecture, core objectives, and implementation of the Huajiao testing platform, describing how it lowers code barriers for testers, provides visual case management, integrates API and pressure testing, and outlines backend services built with Springboot, Mybatis, and related tooling.

API testingJavaSpringBoot
0 likes · 17 min read
Design and Implementation of the Huajiao Test Platform for Integrated API and Stress Testing
JD Retail Technology
JD Retail Technology
Oct 22, 2019 · Industry Insights

How JD.com Prepares Its Systems for 11.11: Stress Tests, Forcebot Evolution, and Quality Controls

JD.com's Retail Technology and Data Platform orchestrated a full‑chain, four‑entry‑point stress test for the 11.11 shopping festival, introduced an upgraded Forcebot traffic‑recording tool, and implemented a "Quality Month" with ten safeguards to ensure system stability and prevent incidents during the massive sales event.

DevOpsOperationse‑commerce
0 likes · 7 min read
How JD.com Prepares Its Systems for 11.11: Stress Tests, Forcebot Evolution, and Quality Controls
JD Retail Technology
JD Retail Technology
Jun 15, 2019 · Information Security

Risk Control Department's 618 Shopping Festival Preparation

The Risk Control Department's 618 shopping festival preparation involves comprehensive planning, system capacity expansion, and 24/7 monitoring to ensure stable operations and combat fraud during the high-traffic shopping event.

24/7 monitoringFraud Preventioncapacity expansion
0 likes · 3 min read
Risk Control Department's 618 Shopping Festival Preparation
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Oct 21, 2018 · Operations

How to Use tcpcopy for Real-Time Traffic Replay and Stress Testing

tcpcopy is a request‑copy tool that captures live traffic from production servers and replays it on test machines, enabling functional and stress testing without affecting users, and the guide covers its architecture, workflow, installation, configuration, and both online and offline usage modes.

Linuxiptablesnetwork testing
0 likes · 10 min read
How to Use tcpcopy for Real-Time Traffic Replay and Stress Testing
Dada Group Technology
Dada Group Technology
Dec 22, 2017 · Operations

Performance Testing Process, Plans, and Best Practices for High‑Traffic Events

This article explains the purpose of performance (stress) testing, compares four testing approaches, details the chosen proportional‑deployment strategy, and provides comprehensive preparation steps, script guidelines, metric analysis, and practical tips for ensuring system stability during large‑scale traffic spikes.

Load TestingOperationscapacity planning
0 likes · 10 min read
Performance Testing Process, Plans, and Best Practices for High‑Traffic Events
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 11, 2017 · Operations

How Alibaba’s Full‑Link Stress Test Powers Double 11’s Record‑Breaking Traffic

Alibaba’s full‑link stress testing, which simulates real‑world traffic across the entire e‑commerce platform, enabled the 2017 Double 11 event to handle peak loads of 325,000 transactions per second, demonstrating how production‑level, data‑isolated load testing ensures stability and capacity planning for massive online sales.

capacity planningstress testing
0 likes · 9 min read
How Alibaba’s Full‑Link Stress Test Powers Double 11’s Record‑Breaking Traffic
360 Quality & Efficiency
360 Quality & Efficiency
Sep 2, 2016 · Operations

Linux Test Project (LTP): Installation, Usage, and Stress Testing Guide

This article provides a comprehensive guide to the Linux Test Project (LTP), covering its purpose, supported architectures, directory layout, installation steps, test categories, execution scripts, stress‑testing commands, result analysis, and troubleshooting tips for improving kernel stability and reliability.

LTPLinuxkernel testing
0 likes · 7 min read
Linux Test Project (LTP): Installation, Usage, and Stress Testing Guide
dbaplus Community
dbaplus Community
Aug 28, 2016 · Databases

Scaling Databases: From Baseline Metrics to Multi‑Layer Optimization

This guide walks DBAs through evaluating current database resources, establishing performance baselines, building business pressure models, conducting realistic stress tests, and applying a seven‑layer optimization roadmap—from statement tweaks to hardware upgrades and business‑level adjustments—to ensure the system can handle ten‑fold or hundred‑fold growth.

Database PerformanceHardwarecapacity planning
0 likes · 16 min read
Scaling Databases: From Baseline Metrics to Multi‑Layer Optimization
360 Quality & Efficiency
360 Quality & Efficiency
Apr 8, 2016 · Operations

Comparative Analysis of Six Server Load‑Testing Tools: LoadRunner, JMeter, Spirent Avalanche, Siege, Tsung, and Locust

This article presents a detailed comparison of six server‑side load‑testing tools—LoadRunner, JMeter, Spirent Avalanche, Siege, Tsung, and Locust—through multiple real‑world scenarios, evaluating their protocol support, scripting flexibility, performance, cost, and suitability for different testing requirements.

Load TestingTool comparisonstress testing
0 likes · 11 min read
Comparative Analysis of Six Server Load‑Testing Tools: LoadRunner, JMeter, Spirent Avalanche, Siege, Tsung, and Locust
ITPUB
ITPUB
Feb 18, 2016 · Operations

Building a Custom RPC Stress‑Testing Tool: Insights from Meituan

Meituan’s internal RPC services, largely built on Thrift, required a streamlined pressure‑testing solution, leading to the development of a custom tool that automates traffic capture, provides an intuitive UI, aggregates metrics via InfluxDB, and supports both Thrift and HTTP workloads, addressing the shortcomings of existing open‑source options.

Backend ToolsOperationsRPC
0 likes · 8 min read
Building a Custom RPC Stress‑Testing Tool: Insights from Meituan
MaGe Linux Operations
MaGe Linux Operations
Jul 5, 2014 · Operations

Discover 10 Free Tools to Load‑Test Your Web Applications

Learn about ten free, open‑source tools—including Grinder, JMeter, and Siege—that let you simulate concurrent users, measure performance, and identify bottlenecks in web applications, helping you assess server capacity and optimize site responsiveness under load.

Load TestingOperationsWeb Performance
0 likes · 8 min read
Discover 10 Free Tools to Load‑Test Your Web Applications