ByteDance Data Platform
Author

ByteDance Data Platform

The ByteDance Data Platform team empowers all ByteDance business lines by lowering data‑application barriers, aiming to build data‑driven intelligent enterprises, enable digital transformation across industries, and create greater social value. Internally it supports most ByteDance units; externally it delivers data‑intelligence products under the Volcano Engine brand to enterprise customers.

78
Articles
0
Likes
187
Views
0
Comments
Recent Articles

Latest from ByteDance Data Platform

78 recent articles
ByteDance Data Platform
ByteDance Data Platform
Feb 18, 2022 · Frontend Development

How ByteDance’s Front‑End Team Built High‑Performance Shape Word Clouds

ByteDance’s data platform front‑end team surveyed academic, commercial, and open‑source word‑cloud solutions, identified gaps in geo‑ and shape‑based clouds, and engineered a performant front‑end layout algorithm that generates customizable shape word clouds for diverse business scenarios.

Data Visualizationalgorithmshape cloud
0 likes · 11 min read
How ByteDance’s Front‑End Team Built High‑Performance Shape Word Clouds
ByteDance Data Platform
ByteDance Data Platform
Feb 16, 2022 · Frontend Development

Exploring the Evolution and Design Space of Word Clouds: Algorithms, Layouts, and Interactions

This article surveys academic, commercial, and open‑source word‑cloud solutions, explains the underlying algorithms, visual encodings, layout strategies, interaction techniques, and classifications, and discusses the strengths, limitations, and future directions of word‑cloud visualisation.

Visualizationalgorithminteraction
0 likes · 19 min read
Exploring the Evolution and Design Space of Word Clouds: Algorithms, Layouts, and Interactions
ByteDance Data Platform
ByteDance Data Platform
Jan 24, 2022 · Databases

Accelerating ClickHouse LowCardinality: Merge Optimizations & Auto Fallback

This article details how ByteDance’s ClickHouse UBA edition improves dictionary encoding for low‑cardinality columns by redesigning the Part‑merge process, introducing a single‑dictionary merge, and implementing an automatic fallback for high‑cardinality columns, resulting in significant storage savings and query‑performance gains across large‑scale applications.

ClickHouseDictionary EncodingLowCardinality
0 likes · 12 min read
Accelerating ClickHouse LowCardinality: Merge Optimizations & Auto Fallback
ByteDance Data Platform
ByteDance Data Platform
Jan 17, 2022 · Big Data

How ByteHouse Scales Real‑Time Analytics on ClickHouse: Challenges & Solutions

This article details ByteHouse’s evolution from ClickHouse, presenting two real‑time analytics use cases, the technical selection process, performance bottlenecks such as write throughput and Kafka consumption, and the engineered solutions—including asynchronous indexing, multi‑threaded Kafka engines, and enhanced Buffer engines—that enable reliable, high‑throughput data processing at massive scale.

ByteHouseClickHouseKafka
0 likes · 11 min read
How ByteHouse Scales Real‑Time Analytics on ClickHouse: Challenges & Solutions
ByteDance Data Platform
ByteDance Data Platform
Jan 14, 2022 · Product Management

Why A/B Testing Matters: Theory, ByteDance Architecture & Best Practices

This article explains why A/B testing is crucial for data‑driven product decisions, outlines ByteDance’s A/B testing system architecture across multiple layers, describes client‑ and server‑side experiment workflows, shares statistical best practices, and presents real‑world case studies illustrating hypothesis generation, evaluation, and future industry trends.

A/B testingByteDancedata-driven
0 likes · 15 min read
Why A/B Testing Matters: Theory, ByteDance Architecture & Best Practices
ByteDance Data Platform
ByteDance Data Platform
Dec 31, 2021 · Big Data

How ByteDance Leverages Hudi for a Real‑Time Data Lake Platform

This article introduces ByteDance’s real‑time data lake platform built on Apache Hudi, covering Hudi fundamentals, table types, indexing, practical use cases, platform optimizations, and future roadmap, illustrating how the system enables low‑latency, scalable analytics across batch and streaming workloads.

Hudilakehousemetadata management
0 likes · 11 min read
How ByteDance Leverages Hudi for a Real‑Time Data Lake Platform