Tagged articles
5 articles
Page 1 of 1
DataFunSummit
DataFunSummit
May 4, 2025 · Big Data

Iceberg Table Format Practice in Huawei Terminal Cloud

This article explains how Huawei's terminal cloud adopts the Apache Iceberg table format to efficiently manage large-scale datasets, detailing its architecture, feature engineering, merge operations, LSM-based storage, schema versioning, AB testing support, catalog enhancements, and future roadmap for full lifecycle data governance.

Big DataData LakeHuawei Cloud
0 likes · 13 min read
Iceberg Table Format Practice in Huawei Terminal Cloud
ITPUB
ITPUB
Apr 18, 2024 · Databases

Essential PostgreSQL Features: Versioned Schemas, Online Migrations, Git‑Style Branching

The article outlines a developer‑focused wishlist for PostgreSQL, proposing built‑in versioned schemas, online schema changes, instant branch creation akin to Git, configurable archive tables, a flexible key‑value labeling system, and tighter Git integration, while highlighting industry examples such as Neon, Xata, Snaplet, Supabase and Bytebase.

Database DevelopmentGit integrationPostgreSQL
0 likes · 8 min read
Essential PostgreSQL Features: Versioned Schemas, Online Migrations, Git‑Style Branching
Baidu Geek Talk
Baidu Geek Talk
Apr 8, 2024 · Big Data

How RTS Platform Turns Real‑Time Data Streams into Reliable Business Value

This article analyzes the challenges of commercial real‑time data processing—such as stability, multi‑stage computation, and frequent schema changes—and explains how the RTS platform provides end‑to‑end managed solutions, auto schema handling, primary‑secondary redundancy, experiment‑first deployment, and metadata generation to unlock high‑velocity data value for advertising operations.

Big DataRTS platformReal-time Streaming
0 likes · 17 min read
How RTS Platform Turns Real‑Time Data Streams into Reliable Business Value
DataFunTalk
DataFunTalk
Sep 3, 2021 · Big Data

Building an Exabyte‑Scale Data Lake with Apache Hudi at ByteDance: Architecture, Design Choices, and Performance Optimizations

This article details ByteDance's implementation of an exabyte‑scale data lake using Apache Hudi, covering scenario requirements, engine selection, functional support, schema management, extensive performance tuning, and future directions, while also noting recruitment opportunities within the team.

Apache HudiBig DataByteDance
0 likes · 9 min read
Building an Exabyte‑Scale Data Lake with Apache Hudi at ByteDance: Architecture, Design Choices, and Performance Optimizations