Tagged articles
8 articles
Page 1 of 1
Huolala Tech
Huolala Tech
Nov 22, 2024 · Backend Development

Building a Scalable Message Fusion Platform for Billions of User Notifications

This article details how a rapidly growing on‑demand logistics service designed and implemented a high‑performance, extensible message fusion platform that handles massive, diverse user messages through distributed processing, visual configuration, and self‑healing mechanisms, enabling real‑time, personalized communication at scale.

Backend ArchitectureDistributed ProcessingMessage Queue
0 likes · 13 min read
Building a Scalable Message Fusion Platform for Billions of User Notifications

How Apache SeaTunnel Redefines Data Integration for Modern Data Platforms

This article reviews the evolution of data‑integration architectures toward EtLT, explains the core capabilities of Apache SeaTunnel, and details how a Chinese data‑platform vendor applied and extended SeaTunnel to simplify batch and streaming ingestion, unify multi‑engine processing, and reduce development and operational costs.

Apache SeaTunnelBig DataConnector Development
0 likes · 17 min read
How Apache SeaTunnel Redefines Data Integration for Modern Data Platforms
Baidu Geek Talk
Baidu Geek Talk
Sep 27, 2023 · Big Data

Design and Implementation of a Content Revenue Settlement System

The article details the design and implementation of a content revenue settlement platform that aggregates traffic and ad data, uses a Spark‑plus‑PALO architecture for processing tens of millions of daily records, and employs a master‑worker model with idempotent tasks, temporary tables, and verification steps to ensure reliable monthly profit‑share calculations for authors, media, mini‑program owners, and users.

Distributed ProcessingPaloSpark
0 likes · 14 min read
Design and Implementation of a Content Revenue Settlement System
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Jul 20, 2021 · Backend Development

How NetEase Cloud Accelerates Video Transcoding with Slice‑Based Parallelism

NetEase Cloud’s video transcoding service boosts processing speed by combining hardware acceleration, custom codecs, AMD EPYC servers, and a slice‑based parallel transcoding pipeline, while optimizing cluster task scheduling and handling straggler issues to achieve significant performance gains across large‑scale media workloads.

Distributed ProcessingHardware accelerationVideo Transcoding
0 likes · 16 min read
How NetEase Cloud Accelerates Video Transcoding with Slice‑Based Parallelism
Xianyu Technology
Xianyu Technology
Jun 8, 2021 · Big Data

Longgong Data Analysis Platform: Architecture and Solutions for Large‑Scale Structured Data

The Longgong Data Analysis Platform enables Idle Fish to capture, store, and analyze billions of structured product attributes in real time across more than 8,000 categories, using TableStore, MySQL, ODPS, and a distributed scheduler to achieve over 50% query speedup, 80% category coverage, and rapid support for search and recommendation teams.

AlibabaBig DataData Platform
0 likes · 9 min read
Longgong Data Analysis Platform: Architecture and Solutions for Large‑Scale Structured Data
Programmer DD
Programmer DD
Dec 9, 2020 · Big Data

Master Apache Beam: Build a Portable Word Count Pipeline in Minutes

This tutorial introduces Apache Beam’s unified programming model for batch and streaming, explains its core concepts and terminology, compares it with other runners, and walks through a complete Java word‑count example—including dependencies, pipeline construction, transforms, and execution with DirectRunner.

Apache BeamDataflowDistributed Processing
0 likes · 8 min read
Master Apache Beam: Build a Portable Word Count Pipeline in Minutes