ByteDance Data Platform
Author

ByteDance Data Platform

The ByteDance Data Platform team empowers all ByteDance business lines by lowering data‑application barriers, aiming to build data‑driven intelligent enterprises, enable digital transformation across industries, and create greater social value. Internally it supports most ByteDance units; externally it delivers data‑intelligence products under the Volcano Engine brand to enterprise customers.

78
Articles
0
Likes
187
Views
0
Comments
Recent Articles

Latest from ByteDance Data Platform

78 recent articles
ByteDance Data Platform
ByteDance Data Platform
May 8, 2024 · Backend Development

How DataTester’s Architecture Upgrade Uses DDD to Tame Code Complexity

DataTester’s A/B testing platform underwent a comprehensive architectural overhaul, applying domain‑driven design, modular refactoring, automated validation, and dependency inversion to reduce change amplification, cognitive load, and unknown unknowns, ultimately improving code readability, maintainability, scalability, and development efficiency across its lifecycle.

A/B testingDDDcode refactoring
0 likes · 29 min read
How DataTester’s Architecture Upgrade Uses DDD to Tame Code Complexity
ByteDance Data Platform
ByteDance Data Platform
Mar 6, 2024 · Databases

How ByteHouse Boosted Douyin’s Interest Circle Queries by 100×

This article explains how Douyin rebuilt its interest‑circle platform by replacing MySQL with the columnar OLAP engine ByteHouse, achieving roughly a hundred‑fold improvement in query speed, lower hardware costs, and seamless horizontal scalability for massive daily data volumes.

ByteHouseColumnarStorageDataWarehouse
0 likes · 10 min read
How ByteHouse Boosted Douyin’s Interest Circle Queries by 100×
ByteDance Data Platform
ByteDance Data Platform
Feb 21, 2024 · Big Data

How ByteDance’s DataLeap Automates Big Data Governance and Boosts Performance

ByteDance’s DataLeap suite tackles the complexities of large‑scale data platform governance by identifying manual tuning pain points, deploying automated rule‑engine recommendations, and optimizing resource allocation, ultimately improving stability, reducing costs, and enhancing overall system health across thousands of Spark, Flink and other tasks.

Resource Optimizationdata governancereal-time monitoring
0 likes · 14 min read
How ByteDance’s DataLeap Automates Big Data Governance and Boosts Performance
ByteDance Data Platform
ByteDance Data Platform
Jan 31, 2024 · Artificial Intelligence

How A/B Testing Powers Continuous Improvement in Recommendation Systems

This article explains the role of A/B experiments in recommendation systems, outlines their workflow, shares practical tips and parameter design strategies, and demonstrates how to use experiment parameters and feature flags for efficient testing, optimization, and full‑scale deployment.

A/B testingexperiment parametersfeature flag
0 likes · 15 min read
How A/B Testing Powers Continuous Improvement in Recommendation Systems
ByteDance Data Platform
ByteDance Data Platform
Jan 24, 2024 · Big Data

How ByteDance Cut Billions in Event‑Tracking Costs with Smart Data Governance

This article details ByteDance's end‑to‑end event‑tracking cost governance, covering background, strategies, large‑scale data pipelines, resource challenges, control mechanisms, automated and supervised governance modes, and the substantial savings achieved through point filtering, tiered prioritization, and sampling.

Resource Optimizationcost governancedata pipeline
0 likes · 16 min read
How ByteDance Cut Billions in Event‑Tracking Costs with Smart Data Governance
ByteDance Data Platform
ByteDance Data Platform
Dec 27, 2023 · Databases

How ByteHouse Redefines Cloud‑Native Data Warehousing for Real‑Time Analytics

This article details ByteHouse's evolution from a ClickHouse‑based OLAP engine to a cloud‑native, massively parallel data warehouse, highlighting its distributed and cloud‑native architectures, enhanced table engines, HaKafka and Materialized MySQL extensions, and real‑world use cases in short‑video, marketing and gaming analytics.

ByteHouseHaKafkaReal-time analytics
0 likes · 20 min read
How ByteHouse Redefines Cloud‑Native Data Warehousing for Real‑Time Analytics
ByteDance Data Platform
ByteDance Data Platform
Nov 22, 2023 · Frontend Development

How HookStore Revamps DataWind’s Frontend Data Flow with Redux and Hooks

This article explains how DataWind’s frontend team replaced the outdated Dva‑based data flow with a new Redux‑plus‑hook solution called HookStore, detailing the pain points of complex initialization, legacy async syntax, tight module coupling, and showing migration steps, code examples, and benefits such as multi‑instance support and OpenAPI decoupling.

MigrationReActRedux
0 likes · 14 min read
How HookStore Revamps DataWind’s Frontend Data Flow with Redux and Hooks
ByteDance Data Platform
ByteDance Data Platform
Nov 1, 2023 · Big Data

How a Leading E‑Commerce Platform Solves EB‑Scale Data Governance Challenges

Facing massive data volumes and strict SLA requirements during the Double 11 shopping festival, a major e‑commerce platform built a systematic data‑governance framework that addresses quality, stability, cost, and efficiency through multi‑layered grading, digital cost models, automated tools, and full‑lifecycle management.

SLA managementbig datacost optimization
0 likes · 23 min read
How a Leading E‑Commerce Platform Solves EB‑Scale Data Governance Challenges
ByteDance Data Platform
ByteDance Data Platform
Oct 11, 2023 · Backend Development

How Volcano Engine Rebuilt Its Ad‑Testing Platform for Scalability and Reliability

This article explains how Volcano Engine identified the tangled authorization, data‑fetching, and performance problems of its advertising AB‑testing platform and refactored it by splitting services, redesigning the data model with MySQL and ClickHouse, applying DAG scheduling, time‑wheel algorithms, Domain‑Driven Design, and rigorous unit testing to achieve a more stable, extensible backend solution.

AB testingAdvertisingDAG
0 likes · 16 min read
How Volcano Engine Rebuilt Its Ad‑Testing Platform for Scalability and Reliability