ByteDance Data Platform
Author

ByteDance Data Platform

The ByteDance Data Platform team empowers all ByteDance business lines by lowering data‑application barriers, aiming to build data‑driven intelligent enterprises, enable digital transformation across industries, and create greater social value. Internally it supports most ByteDance units; externally it delivers data‑intelligence products under the Volcano Engine brand to enterprise customers.

78
Articles
0
Likes
187
Views
0
Comments
Recent Articles

Latest from ByteDance Data Platform

78 recent articles
ByteDance Data Platform
ByteDance Data Platform
Aug 16, 2023 · Operations

How LeKe Scaled 1,200 Gyms Using Data‑Driven Ops, Agile Testing & AI Recommendations

In an interview with LeKe CTO Chengshi, the company’s rapid growth is attributed to three data‑powered capabilities—fine‑grained user operation, agile A/B‑testing, and AI‑driven personalized recommendation—enabled by Volcano Engine’s data platform and the data‑flywheel concept.

A/B testingdata-driven operationsdigital transformation
0 likes · 13 min read
How LeKe Scaled 1,200 Gyms Using Data‑Driven Ops, Agile Testing & AI Recommendations
ByteDance Data Platform
ByteDance Data Platform
May 29, 2023 · Databases

Which Open‑Source OLAP Engine Wins the TPC‑DS Benchmark? A Deep Performance Comparison

Using the TPC‑DS benchmark’s 99 queries on a 1 TB dataset, this study evaluates the performance of four open‑source OLAP engines—ClickHouse, Doris, Presto, and ByConity—across basic, join, aggregation, subquery, and window‑function scenarios, revealing ByConity’s superior speed and the limitations of ClickHouse.

ByConityClickHouseOLAP
0 likes · 12 min read
Which Open‑Source OLAP Engine Wins the TPC‑DS Benchmark? A Deep Performance Comparison
ByteDance Data Platform
ByteDance Data Platform
Mar 22, 2023 · Backend Development

How DataTester’s Open Platform Empowers Customizable A/B Testing for Enterprises

DataTester’s Open Platform provides a configurable, extensible A/B testing solution that integrates via OpenAPI, SDKs, and a visual integration console, enabling enterprises to customize experiments, reduce development costs, and streamline deployment while supporting diverse business needs through modular components and a material ecosystem.

A/B testingDataTesterOpenAPI
0 likes · 12 min read
How DataTester’s Open Platform Empowers Customizable A/B Testing for Enterprises
ByteDance Data Platform
ByteDance Data Platform
Feb 15, 2023 · Databases

How ByteHouse Powers Real‑Time Data Warehousing at Scale

ByteHouse, a cloud‑native data warehouse built on ClickHouse, delivers ultra‑fast real‑time and massive offline analytics with elastic scaling, addressing business needs in ByteDance and the financial sector through optimized architecture, ROI‑driven monitoring, and comprehensive operational tools.

ByteHouseClickHouseReal-time Data Warehouse
0 likes · 16 min read
How ByteHouse Powers Real‑Time Data Warehousing at Scale
ByteDance Data Platform
ByteDance Data Platform
Jan 11, 2023 · Big Data

How EMR Stateless Transforms Big Data with Transient, Stateless Clusters

This article explains the concept of transient clusters and the Stateless architecture in Volcano Engine's EMR platform, compares Stateless with traditional Stateful approaches, outlines its evolution, core components, elastic scaling features, and the business value of cost‑effective, on‑demand big‑data processing.

EMRTransient Clustercloud-native
0 likes · 17 min read
How EMR Stateless Transforms Big Data with Transient, Stateless Clusters
ByteDance Data Platform
ByteDance Data Platform
Jan 4, 2023 · Databases

How ByteHouse Enhances ClickHouse with Resource Isolation and High Availability

This article explains how ByteHouse, an enhanced version of ClickHouse used at ByteDance, adds full upsert support, multi‑table joins, high‑availability features, and, most importantly, a Resource Group mechanism that provides fine‑grained CPU, memory, and concurrency isolation to improve query performance and stability.

ByteHouseClickHouseConcurrency Control
0 likes · 8 min read
How ByteHouse Enhances ClickHouse with Resource Isolation and High Availability
ByteDance Data Platform
ByteDance Data Platform
Dec 28, 2022 · Big Data

How Cloud Data Warehouses Are Shaping the Future of Big Data and DataOps

This article examines the four‑stage evolution of data warehouses, highlights the cost‑effective, scalable advantages of cloud‑native warehouses, explores the rapid growth of data‑management infrastructure, and discusses the emerging practices of DataOps and AI integration that are redefining modern data stacks.

AIData ManagementDataOps
0 likes · 15 min read
How Cloud Data Warehouses Are Shaping the Future of Big Data and DataOps
ByteDance Data Platform
ByteDance Data Platform
Nov 16, 2022 · Big Data

How ByteDance’s Data Lake Powers Near‑Real‑Time E‑Commerce Analytics

This article explains ByteDance’s data lake technology, its Apache Hudi‑based features, near‑real‑time architecture, and practical e‑commerce use cases such as marketing promotion, traffic diagnosis, logistics monitoring, risk governance, and operational monitoring, while outlining future challenges and plans.

Apache HudiBig Data Architecturedata lake
0 likes · 15 min read
How ByteDance’s Data Lake Powers Near‑Real‑Time E‑Commerce Analytics