Tag

data aggregation

0 views collected around this technical thread.

php中文网 Courses
php中文网 Courses
May 7, 2025 · Fundamentals

Comprehensive Guide to Pandas Data Processing in Python

This tutorial provides a detailed overview of Pandas, covering its core data structures, data import/export, selection, cleaning, aggregation, merging, and a practical sales analysis example, with complete code snippets for each operation.

Pythondata aggregationdata analysis
0 likes · 8 min read
Comprehensive Guide to Pandas Data Processing in Python
Test Development Learning Exchange
Test Development Learning Exchange
Nov 18, 2024 · Fundamentals

Using Pandas groupby for Data Aggregation and Grouping

This tutorial teaches how to use the Pandas library's groupby method to group data by one or multiple columns and apply aggregation functions such as sum, mean, max, and min, with clear code examples and a practical exercise.

data aggregationdata analysisgroupby
0 likes · 4 min read
Using Pandas groupby for Data Aggregation and Grouping
Zhuanzhuan Tech
Zhuanzhuan Tech
Nov 14, 2024 · Backend Development

Design and Implementation of a Counting System for a B2B Marketplace

This article describes the background, requirements, and architectural design of a custom counting system for a B2B second‑hand trading platform, covering dimension definitions, internal vs external implementations, data flow, persistence strategies, code interfaces, and read‑query optimizations.

Redisbackend architecturecounting system
0 likes · 20 min read
Design and Implementation of a Counting System for a B2B Marketplace
Bilibili Tech
Bilibili Tech
Jun 30, 2023 · Backend Development

Optimizing Bilibili Video Detail Page Backend with Business Association Index and Data Aggregation Gateway

The article describes how Bilibili tackled exploding fan‑out reads on its video detail page by introducing a Redis‑backed business‑association index and a generic Data Aggregation Gateway, which together cut downstream service traffic and load by over 90% while simplifying aggregation across multiple terminals and scenarios.

BFFBackendIndexing
0 likes · 13 min read
Optimizing Bilibili Video Detail Page Backend with Business Association Index and Data Aggregation Gateway
Xianyu Technology
Xianyu Technology
Apr 13, 2022 · Big Data

Real-time Multi-system Data Aggregation for Fan Tag System

The Xianyu fan‑tag system solves the challenge of displaying full‑history purchase counts with real‑time updates and low‑latency, high‑throughput queries by daily exporting multi‑system data to a LevelDB‑based KV store, converting schemas, and applying real‑time compensation from transaction and follow‑change messages, merging offline and live data to produce sorted fan lists at ~10 k QPS.

KV storageReal-time Processingdata aggregation
0 likes · 6 min read
Real-time Multi-system Data Aggregation for Fan Tag System
Beike Product & Technology
Beike Product & Technology
Apr 8, 2022 · Product Management

Effective Reading Duration and Screen Count Metrics for Mobile App Content

The article explains how to define, collect, and calculate effective reading duration and screen count for mobile app H5 content using event timestamps, formulas, fragment reporting, and aggregation methods to improve content quality analysis and visualization.

data aggregationevent trackingmobile app
0 likes · 9 min read
Effective Reading Duration and Screen Count Metrics for Mobile App Content
Baidu Geek Talk
Baidu Geek Talk
Jan 19, 2022 · Big Data

Quantile Computation in Baidu Advertising System: Architecture and Implementation

Baidu’s advertising platform computes high‑precision response‑time quantiles at massive scale by intercepting each API call, locally summarizing data with mergeable T‑Digest histograms, periodically uploading compressed, Base64‑encoded summaries to a warehouse where they are merged on demand, enabling low‑latency, cost‑effective percentile analysis with sub‑0.1% error.

Big DataT-Digestdata aggregation
0 likes · 11 min read
Quantile Computation in Baidu Advertising System: Architecture and Implementation
Architecture & Thinking
Architecture & Thinking
Nov 2, 2021 · Backend Development

How to Transform a T+1 Dashboard into Real‑Time T+0 with MQ and MongoDB

This article explains how a user‑behavior data dashboard originally built on a daily T+1 batch process was redesigned to achieve real‑time T+0 updates by introducing message‑queue notifications, a dedicated aggregation service, and MongoDB storage, improving data freshness and user experience.

Message QueueMongoDBbackend architecture
0 likes · 5 min read
How to Transform a T+1 Dashboard into Real‑Time T+0 with MQ and MongoDB
Baidu Geek Talk
Baidu Geek Talk
May 24, 2021 · Big Data

Real-Time Quantile Computation Using TDigest: Architecture and Solutions

The article presents a real‑time quantile solution using the TDigest data structure, which clusters data into centroids and stores digests in Redis or Doris, pre‑computes quantiles for all dimension combinations, and provides a reusable API that delivers fast, accurate, low‑memory quantile statistics for diverse business scenarios.

Big DataDorisRedis
0 likes · 11 min read
Real-Time Quantile Computation Using TDigest: Architecture and Solutions
Xianyu Technology
Xianyu Technology
Mar 26, 2021 · Backend Development

A General Solution for Improving Daily Business Issue Diagnosis Efficiency

The article presents a general solution that streamlines daily business issue diagnosis by aggregating multi‑service data via GraphQL, converting it into human‑readable semantics, and automatically detecting anomalies with a rule engine, enabling one‑click retrieval, reduced hand‑offs, and extensible monitoring for platforms such as Xianyu.

DiagnosisGraphQLQLExpress
0 likes · 10 min read
A General Solution for Improving Daily Business Issue Diagnosis Efficiency
Ctrip Technology
Ctrip Technology
Mar 25, 2021 · Big Data

Challenges and Approaches for Real‑Time Data Aggregation Analysis

The article examines the key challenges of real‑time data aggregation—data freshness, timely processing, and result visibility—and surveys common solutions such as timestamp‑based sync, CDC, full and incremental computation, storage formats, and trigger mechanisms.

Big DataCDCdata aggregation
0 likes · 11 min read
Challenges and Approaches for Real‑Time Data Aggregation Analysis
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 12, 2021 · Frontend Development

Hawkeye: A Self-Developed Front-End Monitoring Platform for iQiyi

Hawkeye, iQiyi’s self‑developed front‑end monitoring platform, unifies exception, performance and behavior tracking, automatically aggregates events with unique error IDs, isolates business‑type alerts via configurable codes, links front‑end and back‑end traces through Trace IDs, and leverages a JSSDK, backend services, big‑data stream processing and storage in Redis, Elasticsearch and HBbase, with future plans for small‑program SDKs, wider framework support and open‑source release.

Architecture DesignError TrackingFront-end Monitoring
0 likes · 9 min read
Hawkeye: A Self-Developed Front-End Monitoring Platform for iQiyi
Aikesheng Open Source Community
Aikesheng Open Source Community
Nov 4, 2019 · Databases

Using MySQL Multi‑Source Replication to Aggregate Data from Multiple Servers

The article explains how MySQL multi‑source replication can be employed to back up, horizontally or vertically partition, and merge data from several servers into a single target database, illustrating three typical scenarios and providing example SQL for data consolidation.

Multi-Source ReplicationMySQLdata aggregation
0 likes · 5 min read
Using MySQL Multi‑Source Replication to Aggregate Data from Multiple Servers
Youku Technology
Youku Technology
Sep 4, 2019 · Backend Development

Technical Deep Dive of Youku Media Asset Platform: Storage, Search, and Data Aggregation

The article details Youku’s new media‑asset platform, which replaces a fragmented MySQL‑based system with a domain‑driven entity model stored in Ali‑HBase, leverages Elasticsearch for flexible front‑and back‑end search, and adds an aggregation layer that unifies diverse data sources and reusable computation tasks, delivering high‑availability, low‑latency service for billions of daily API calls.

BackendElasticsearchSearch
0 likes · 9 min read
Technical Deep Dive of Youku Media Asset Platform: Storage, Search, and Data Aggregation
Xianyu Technology
Xianyu Technology
Aug 23, 2018 · Backend Development

Data Aggregation and GraphQL Adoption at Xianyu: Improving Development Efficiency

Xianyu introduced a lightweight data‑aggregation service and adopted Alibaba’s TQL GraphQL gateway, allowing front‑ends to fetch multiple domain data in a single request, which cut development costs by over 50%, reduced latency, and accelerated page composition across more than 30 pages.

GraphQLbackend developmentdata aggregation
0 likes · 11 min read
Data Aggregation and GraphQL Adoption at Xianyu: Improving Development Efficiency
JD Tech Talk
JD Tech Talk
Aug 2, 2018 · Big Data

Real-Time Order Statistics with Apache Flink in a Data Aggregation Platform

This article explains how the data aggregation platform adopts Apache Flink for high‑throughput, low‑latency stream processing, covering the complete workflow from data source integration, transformation operations, windowing and time concepts, to a concrete order‑count example with custom aggregation logic.

Apache FlinkBig DataFlink
0 likes · 10 min read
Real-Time Order Statistics with Apache Flink in a Data Aggregation Platform
Qunar Tech Salon
Qunar Tech Salon
Jun 5, 2018 · Backend Development

Hotel Quote Search System Architecture and Workflow Overview

This article details the design and operation of a high‑concurrency hotel quote search platform, covering business background, core functionalities, system layers, data fetching, aggregation, scheduling, and price‑update mechanisms to ensure comprehensive, real‑time hotel pricing for users.

Backenddata aggregationhigh concurrency
0 likes · 10 min read
Hotel Quote Search System Architecture and Workflow Overview
Architecture Digest
Architecture Digest
May 27, 2018 · Big Data

Installing Elasticsearch and Performing Data Aggregation Queries

This article walks through installing Elasticsearch 5.6.9, configuring system limits, creating indices, inserting and deleting documents, executing complex aggregation queries, and integrating Elasticsearch with Java using the TransportClient, providing a practical guide for building analytics on large‑scale data.

Big DataElasticsearchJava
0 likes · 12 min read
Installing Elasticsearch and Performing Data Aggregation Queries
Ctrip Technology
Ctrip Technology
May 18, 2017 · Mobile Development

From Crash Logs to Data-Driven Debugging: Mobile Client Development Practices and Performance Monitoring

The article shares practical experiences from large‑scale mobile client development, covering user crash scenarios, non‑intrusive data collection, AOP‑based interaction and network monitoring, performance instrumentation, and data aggregation techniques to improve debugging efficiency and operational reliability.

AOPPerformance Monitoringclient architecture
0 likes · 9 min read
From Crash Logs to Data-Driven Debugging: Mobile Client Development Practices and Performance Monitoring
Qunar Tech Salon
Qunar Tech Salon
Apr 28, 2015 · Databases

SQL Solution for Counting Paid and Unpaid Orders per User‑Product in PostgreSQL

This article explains how to count paid and unpaid orders per user‑product pair in PostgreSQL by aggregating order statuses into arrays, using a sub‑query with GROUP BY and a CASE expression to produce accurate payment success and total order counts.

Order AnalysisPostgreSQLSQL
0 likes · 4 min read
SQL Solution for Counting Paid and Unpaid Orders per User‑Product in PostgreSQL