Tag

backend operations

0 views collected around this technical thread.

Architecture Digest
Architecture Digest
Jan 5, 2024 · Operations

Nginx Rate Limiting: Request Rate, Connection Limits, and Bandwidth Control

This article explains how to configure Nginx for rate limiting, including request rate control, burst handling, whitelisting, connection limits, and upload/download bandwidth throttling, with detailed directives, examples, and code snippets to ensure proper service stability.

NginxRate Limitingbackend operations
0 likes · 14 min read
Nginx Rate Limiting: Request Rate, Connection Limits, and Bandwidth Control
Baidu Geek Talk
Baidu Geek Talk
Nov 22, 2023 · Operations

Stability Assurance for Baidu Search Aladdin during Large-Scale Events

Baidu’s Aladdin search service safeguards stability during massive traffic spikes—such as Gaokao, the Tokyo and Beijing Olympics—by mapping dependencies, deploying multi‑dimensional monitoring, adding scaling layers like multi‑region Redis, and establishing rapid‑response on‑call teams, achieving over 99.99 % uptime and near‑real‑time data updates.

backend operationsfault handlinglarge-scale traffic
0 likes · 9 min read
Stability Assurance for Baidu Search Aladdin during Large-Scale Events
360 Quality & Efficiency
360 Quality & Efficiency
Nov 11, 2022 · Operations

Understanding TCPCopy: Architecture, Core Principles, and Performance

This article introduces the open‑source traffic‑replay tool TCPCopy, explains its 1.0 architecture—including the tcpcopy and intercept components—covers its packet‑capture and injection methods (raw socket vs pcap), TCP state handling, routing challenges, intercept role, and performance characteristics, providing practical insights for backend testing and operations.

backend operationsnetwork testingpcap
0 likes · 9 min read
Understanding TCPCopy: Architecture, Core Principles, and Performance
Sanyou's Java Diary
Sanyou's Java Diary
Aug 11, 2022 · Operations

Rapidly Diagnose Production Bugs with Linux Tools, Performance Tricks & Design Patterns

This article guides developers through classifying system‑level and business‑level bugs, using Linux utilities like perf, ps, and vmstat for quick root‑cause analysis, and outlines effective code‑design patterns and architectural strategies—caching, rate‑limiting, and high‑availability—to prevent and resolve production incidents.

Linux performancebackend operationsbug troubleshooting
0 likes · 13 min read
Rapidly Diagnose Production Bugs with Linux Tools, Performance Tricks & Design Patterns
Laravel Tech Community
Laravel Tech Community
Jun 6, 2022 · Operations

Nginx Unit 1.27.0 Release: HTTPS Redirection, Configurable Filenames, and Platform Updates

The Nginx Unit 1.27.0 release introduces HTTP‑to‑HTTPS redirection using $request_uri, configurable default filenames for pure‑path URIs, numerous bug fixes, expanded Linux distribution support, and updated Docker images with the latest language runtimes.

HTTPS redirectionNGINX UnitRelease Notes
0 likes · 4 min read
Nginx Unit 1.27.0 Release: HTTPS Redirection, Configurable Filenames, and Platform Updates
Architecture Digest
Architecture Digest
Oct 3, 2021 · Operations

Comparison of Distributed Scheduling Frameworks and Their Differences from Quartz

This article examines common business scenarios that require timed tasks, introduces single‑machine and distributed scheduling solutions such as Timer, ScheduledExecutorService, Spring, Quartz, TBSchedule, elastic‑job, Saturn, and XXL‑Job, and provides a detailed feature‑by‑feature comparison to help choose the most suitable framework.

QuartzTask Schedulerbackend operations
0 likes · 11 min read
Comparison of Distributed Scheduling Frameworks and Their Differences from Quartz
macrozheng
macrozheng
Sep 6, 2021 · Operations

Choosing the Right Distributed Scheduler: Elastic‑Job vs X‑Job vs Quartz

This article examines common business scenarios requiring timed tasks, compares single‑machine and distributed scheduling frameworks such as Timer, Spring, Quartz, TBSchedule, Elastic‑Job, Saturn and XXL‑Job, and provides guidance on selecting the most suitable solution.

Quartzbackend operationsdistributed scheduling
0 likes · 15 min read
Choosing the Right Distributed Scheduler: Elastic‑Job vs X‑Job vs Quartz
Qunar Tech Salon
Qunar Tech Salon
Feb 7, 2020 · Operations

Internal Resource Governance Practices for High‑Availability Systems

This article outlines comprehensive internal resource governance techniques—including degradation, circuit breaking, isolation, async conversion, thread‑pool management, JVM and hardware metric monitoring, and daily operational practices—to enhance system stability and high availability in large‑scale backend services.

High AvailabilitySystem Stabilitybackend operations
0 likes · 10 min read
Internal Resource Governance Practices for High‑Availability Systems
Architecture Digest
Architecture Digest
Sep 23, 2019 · Operations

Improving Application Availability: Practices, Monitoring, and Fault‑Tolerance in a Large‑Scale Payment System

The article describes how a high‑traffic payment platform achieves 99.999% availability by avoiding single points of failure, applying fail‑fast principles, implementing resource limits, building real‑time monitoring and alerting, and automating fault detection, routing, and recovery to ensure continuous 7×24 operation.

High Availabilitybackend operationsfault tolerance
0 likes · 23 min read
Improving Application Availability: Practices, Monitoring, and Fault‑Tolerance in a Large‑Scale Payment System
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Aug 1, 2019 · Operations

Handling GC Alerts by Splitting and Sharding Scheduled Tasks in Production

The article recounts a production incident where a GC alert triggered due to excessive object creation in a scheduled ad‑transaction sync task, and explains how the problem was diagnosed, mitigated by task splitting, and finally resolved through data sharding across multiple machines.

GCPerformance TuningSharding
0 likes · 6 min read
Handling GC Alerts by Splitting and Sharding Scheduled Tasks in Production
Efficient Ops
Efficient Ops
May 13, 2018 · Operations

Diagnosing and Fixing TCP SYN Queue Overflows that Crash E‑commerce Sites

This article walks through a real‑world incident where an e‑commerce site suffered intermittent outages due to TCP SYN and accept queue overflows, explains the underlying handshake mechanics, shows how kernel and Nginx parameters can be tuned, and provides Python scripts for testing and SYN‑flood simulation.

LinuxSYN floodTCP
0 likes · 9 min read
Diagnosing and Fixing TCP SYN Queue Overflows that Crash E‑commerce Sites