Tagged articles
20 articles
Page 1 of 1
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 5, 2025 · Big Data

How StarRocks + Paimon Powered Real‑Time Analytics for Alibaba’s Taobao Flash Sale

Facing minute‑level decision demands and billions of marketing events during Taobao's Flash Sale, the Ele.me data team built a real‑time lakehouse with StarRocks and Paimon, leveraging asynchronous materialized views, RoaringBitmap de‑duplication, and resource isolation to achieve sub‑second query latency, lower storage costs, and stable high‑concurrency.

LakehouseMaterialized ViewsPaimon
0 likes · 25 min read
How StarRocks + Paimon Powered Real‑Time Analytics for Alibaba’s Taobao Flash Sale
StarRocks
StarRocks
Sep 2, 2025 · Big Data

How StarRocks + Paimon Powered Real‑Time Analytics for Alibaba’s Flash Sale

Faced with billions of marketing events and minute‑level decision requirements during Taobao's flash‑sale campaign, the e‑commerce data team built a real‑time lakehouse using StarRocks and Paimon, leveraged asynchronous materialized views and RoaringBitmap deduplication, and achieved sub‑second query latency, massive cost savings, and stable high‑concurrency performance.

Big DataLakehouseMaterialized Views
0 likes · 26 min read
How StarRocks + Paimon Powered Real‑Time Analytics for Alibaba’s Flash Sale
JD Cloud Developers
JD Cloud Developers
Dec 25, 2024 · Backend Development

How RoaringBitmap Transforms Massive User ID Storage in CDPs

This article explains how a CDP tackles billions‑scale user ID tags and groups by replacing naïve text‑file storage with bitmap techniques, detailing Bitmap basics, encoding strategies, Java BitSet limitations, and the adoption of RoaringBitmap for efficient compression and fast set operations.

RoaringBitmapbigdatastorage
0 likes · 10 min read
How RoaringBitmap Transforms Massive User ID Storage in CDPs
JD Tech
JD Tech
Mar 1, 2024 · Fundamentals

Optimizing Marketing System Blacklist Filtering with Bitmaps

This article examines how bitmap data structures and multithreading can dramatically accelerate blacklist filtering in large‑scale marketing systems, reducing processing time from tens of minutes to milliseconds while saving memory and improving overall system performance.

BitmapBlacklistRoaringBitmap
0 likes · 13 min read
Optimizing Marketing System Blacklist Filtering with Bitmaps
JD Tech
JD Tech
Jan 22, 2024 · Big Data

Efficient High‑Concurrency Data Retrieval Using Inverted Index and Bitmap Techniques

This article explores how to achieve fast, scalable data retrieval in million‑level high‑concurrency scenarios by replacing naïve full‑combination rule matching with column‑wise inverted indexes and bitmap operations, dramatically reducing time complexity and improving stability while leveraging RoaringBitmap compression for space efficiency.

BitmapRoaringBitmaphigh concurrency
0 likes · 12 min read
Efficient High‑Concurrency Data Retrieval Using Inverted Index and Bitmap Techniques
DaTaobao Tech
DaTaobao Tech
Sep 6, 2023 · Big Data

Accelerating User Profile Analysis with Hologres RoaringBitmap

The article explains how Hologres RoaringBitmap compresses user ID sets into efficient bitmap indexes, splits 64‑bit IDs into buckets, syncs them from MaxCompute, and enables sub‑second user portrait queries that previously took minutes, dramatically improving performance and scalability.

Bitmap IndexHologresRoaringBitmap
0 likes · 18 min read
Accelerating User Profile Analysis with Hologres RoaringBitmap
dbaplus Community
dbaplus Community
Dec 6, 2022 · Backend Development

How Meituan Cut Elasticsearch Search Latency by 84% with an RLE‑Based Inverted Index

This article details Meituan's search‑engine team optimization of Elasticsearch for a high‑traffic LBS scenario, describing the performance bottlenecks in term‑posting retrieval and merging, the design of a run‑length‑encoding (RLE) inverted index, its integration as a plugin, extensive benchmarking, and the resulting 84% reduction in TP99 query latency.

Backend SearchElasticsearchRoaringBitmap
0 likes · 25 min read
How Meituan Cut Elasticsearch Search Latency by 84% with an RLE‑Based Inverted Index
Meituan Technology Team
Meituan Technology Team
Nov 17, 2022 · Backend Development

Elasticsearch Query and Merge Optimization Using Run-Length Encoding for Meituan Takeaway Search

Meituan's food‑delivery search team identified heavy CPU and latency hotspots in Elasticsearch's posting‑list query and merge phases, then redesigned the inverted index using Run‑Length Encoding, hash‑based term lookup, index sorting and a custom SparseRoaringDocIdSet, ultimately reducing TP99 search latency by 84% and cutting CPU usage dramatically.

ElasticsearchIndex SortingRoaringBitmap
0 likes · 26 min read
Elasticsearch Query and Merge Optimization Using Run-Length Encoding for Meituan Takeaway Search
Bilibili Tech
Bilibili Tech
Sep 30, 2022 · Big Data

From BitMap to RoaringBitmap: Principles, Performance, and Big Data Applications

RoaringBitmap improves traditional BitMap by lazily allocating four container types, compressing sparse data, and dynamically switching between array, bitmap, and run containers, enabling fast exact set operations that power big‑data systems such as Kylin, ClickHouse, and B‑Station’s user‑visit and crowd‑package pipelines, dramatically reducing memory use and processing latency.

Big DataBitmap CompressionData Structures
0 likes · 16 min read
From BitMap to RoaringBitmap: Principles, Performance, and Big Data Applications
DeWu Technology
DeWu Technology
Jul 8, 2022 · Big Data

Optimizing Large-Scale Product Set Refresh with RoaringBitmap

By representing pre‑and post‑refresh SPU sets as RoaringBitmaps and diffing them, the system avoids full‑insert writes, cuts memory usage by orders of magnitude, speeds refreshes by over 50 % and reduces write volume nearly 87 %, solving large‑scale tag‑based product refresh challenges.

BitmapDataStructureHBase
0 likes · 14 min read
Optimizing Large-Scale Product Set Refresh with RoaringBitmap
vivo Internet Technology
vivo Internet Technology
Apr 13, 2022 · Databases

Redis Integer Set Optimization for Game Recommendation Deduplication: RoaringBitMap vs intset vs Bloom Filter

For deduplicating game recommendations in Redis, RoaringBitMap outperforms intset and Bloom filters by storing 300 auto‑incrementing game IDs in roughly 0.5 KB—over twice the compression of intset and far smaller than the 29 KB Bloom filter—thereby cutting memory use, latency, and hardware costs.

Data Structure OptimizationMemory OptimizationRoaringBitmap
0 likes · 9 min read
Redis Integer Set Optimization for Game Recommendation Deduplication: RoaringBitMap vs intset vs Bloom Filter
Sohu Tech Products
Sohu Tech Products
Jun 9, 2021 · Big Data

Real-time UV Counting with Flink, Hologres, and RoaringBitmap

This article explains how to implement both offline (T+1) and real‑time UV counting using Hologres with RoaringBitmap for high‑cardinality aggregation, and demonstrates a complete Flink‑Hologres pipeline—including table creation, streaming joins, windowed aggregation, and query examples—for fine‑grained user metric analysis.

FlinkHologresRoaringBitmap
0 likes · 11 min read
Real-time UV Counting with Flink, Hologres, and RoaringBitmap
Suning Technology
Suning Technology
Dec 18, 2020 · Big Data

How ClickHouse Powered Suning’s Billion‑Tag User Profiles in Seconds

Suning’s senior architect Yang Zhaohui explains how his team rebuilt the tag platform with ClickHouse, using RoaringBitmap and custom optimizations to achieve second-level queries on billions of user tags, dramatically cutting response time, reducing hardware costs, and enabling real-time marketing insights.

OLAPRoaringBitmapclickhouse
0 likes · 5 min read
How ClickHouse Powered Suning’s Billion‑Tag User Profiles in Seconds
58 Tech
58 Tech
Feb 15, 2019 · Artificial Intelligence

Precise Push Notification Architecture and Algorithm Optimization at 58.com

This article describes the evolution of 58.com's user‑set service architecture, the transition from MongoDB to RoaringBitmap storage, and the machine‑learning‑driven algorithm optimizations that enable real‑time, multi‑dimensional, and localized push notifications for millions of users.

Algorithm OptimizationRoaringBitmapbitmap storage
0 likes · 13 min read
Precise Push Notification Architecture and Algorithm Optimization at 58.com