Operations 7 min read

Logistics R&D Preparation for the 618 Promotion: System Readiness, Stress Testing, and Real‑Time Monitoring

The logistics R&D team spent 62 days preparing for the 618 promotion by analyzing core processes, applying stress tests, implementing fault‑tolerant architectures, planning capacity, and deploying real‑time monitoring tools to ensure system stability and performance under peak traffic.

JD Retail Technology
JD Retail Technology
JD Retail Technology
Logistics R&D Preparation for the 618 Promotion: System Readiness, Stress Testing, and Real‑Time Monitoring

Each major sales promotion is treated like a full‑scale battle; the logistics R&D team prepared from the end of February to May 24, a total of 62 days, with more thorough preparation than in previous years.

Preparation ("Ready"): The team meticulously identified weak points in core systems, introduced stress testing to uncover deep issues, and iterated optimizations to advance the preparation process.

Separate core and auxiliary flows: Keep core processes minimal and stable, handle non‑core work asynchronously or with graceful degradation to avoid rollbacks.

Isolate different business types: Prioritize availability for order services, consistency for inventory, and performance for high‑concurrency features; split memory‑intensive tasks like report import/export to separate resources.

Database sharding: Use vertical sharding (e.g., order DB, product DB) and horizontal expansion (read/write separation, replication), apply partitioning, and separate hot/cold or historical data.

Service dependency principles: Stable services must not depend on volatile ones; avoid circular dependencies; prefer asynchronous weak dependencies across business boundaries; lower‑level services should not depend on higher‑level ones.

Fault transfer: Deploy core systems across multiple data centers, enable multi‑active setups, and provide rapid traffic switching to avoid single‑point failures.

Capacity planning (DID principle): Design for 20× capacity, implement 3×, and deploy 1.5×; evaluate traffic needs without increasing overall resources.

Traffic control: Sequentially manage traffic spikes by diverting load, applying degradation, and finally rate‑limiting with fallback plans.

Return to code: Identify and eliminate "code smells" in the codebase.

Battle ("Fight"): After thorough preparation, the team employed a suite of tools for real‑time monitoring and rapid fault diagnosis.

Customizable real‑time monitoring dashboard per department, with sortable performance metrics.

Comprehensive call‑chain tracing with configurable latency thresholds to locate slow methods.

Alarm statistics portal offering department‑ and application‑level insights for alarm trend analysis.

Slow‑SQL analysis tool that filters by project, IP, and time range.

Configuration scanning covering JVM, Log4j, DB connection pools, etc., plus third‑party library version checks.

Glances – a Linux host performance monitor that replaces the traditional top command with real‑time CPU, memory, load, network, disk, and filesystem metrics.

Arthas – a powerful Java performance diagnostic tool providing full‑stack analysis.

Automatic snapshot generation for medium‑or‑higher severity alarms, preserving fault‑scene data for post‑mortem.

Unified aggregation of alerts from multiple monitoring platforms to give a clear view of application health.

Thanks to all teams for their hard work; we are fully prepared for the 618 promotion!

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

monitoringOperationsSystem DesignPerformance Testingcapacity planning
JD Retail Technology
Written by

JD Retail Technology

Official platform of JD Retail Technology, delivering insightful R&D news and a deep look into the lives and work of technologists.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.