Big Data 7 min read

Millisecond-Level Counting for Billion-Scale Data via Offline Batch and Online Incremental Statistics

To achieve millisecond‑level counting on billion‑scale data, the Xianyu team replaced slow MySQL count queries with an offline batch that snapshots relational tables and computes totals, then uses KV‑store incremental statistics for online updates, delivering sub‑10 ms responses with near‑100 % success.

Xianyu Technology

Oct 16, 2018

Millisecond-Level Counting for Billion-Scale Data via Offline Batch and Online Incremental Statistics

Relational databases become inefficient for count queries on billion‑scale data; the Xianyu team needed millisecond‑level counting.

Traditional MySQL count operations cannot meet online service requirements due to high latency.

The proposed design replaces costly count queries with an offline batch processing step combined with online incremental statistics stored in a KV store, achieving sub‑10 ms response and near‑100 % success rate.

Offline batch copies all relational data to an offline store (e.g., ODPS) at a snapshot time, computes total counts per source, and records the latest modification timestamp (offlineTotal).

Because sharded tables have inconsistent snapshot times, the solution uses the batch start time as the snapshot reference, avoiding data pollution.

Online, incremental data records daily total increments (dailyIncrTotal) and per‑event increments (modifiedTimeIncr). The final count is calculated as offlineTotal + ΣdailyIncrTotal – overlap, using the latest modification time to subtract duplicated increments.

This approach reduces a single count request to a few KV reads, delivering real‑time performance while retaining the ability to re‑run offline jobs for correction.

The technique demonstrates how offline‑online hybrid processing can solve high‑throughput counting problems in big‑data environments.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Big Data database offline processing incremental counting

Written by

Xianyu Technology

Official account of the Xianyu technology team

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.