ByteDance Data Platform
Author

ByteDance Data Platform

The ByteDance Data Platform team empowers all ByteDance business lines by lowering data‑application barriers, aiming to build data‑driven intelligent enterprises, enable digital transformation across industries, and create greater social value. Internally it supports most ByteDance units; externally it delivers data‑intelligence products under the Volcano Engine brand to enterprise customers.

78
Articles
0
Likes
187
Views
0
Comments
Recent Articles

Latest from ByteDance Data Platform

78 recent articles
ByteDance Data Platform
ByteDance Data Platform
Oct 30, 2024 · Big Data

How Volcano Engine’s DataLeap Platform Transforms Data Service Management

Volcano Engine’s DataLeap platform offers a unified API service solution that transforms raw data into reliable, secure data services, featuring full lifecycle management, monitoring, permission control, rate limiting, and visual API orchestration to simplify complex data workflows and improve operational efficiency across big-data scenarios.

API orchestrationData Servicebig data
0 likes · 21 min read
How Volcano Engine’s DataLeap Platform Transforms Data Service Management
ByteDance Data Platform
ByteDance Data Platform
Oct 16, 2024 · Databases

How ByteHouse Boosted Sales Data Platform Queries Up to 16× with ACL and Optimizer

This article examines a fast‑growing company's sales data platform, outlines the data‑access pain points caused by ACL permissions, describes the migration from ClickHouse to ByteHouse, details the optimizer’s rule‑based, cost‑based, and distributed‑plan enhancements, and presents benchmark results showing query speedups of up to sixteen times.

ACLByteHouseOLAP
0 likes · 16 min read
How ByteHouse Boosted Sales Data Platform Queries Up to 16× with ACL and Optimizer
ByteDance Data Platform
ByteDance Data Platform
Oct 9, 2024 · Big Data

Douyin’s E‑commerce Tracking Journey: From Log 1.0 to a Unified Attribution Platform

This article examines Douyin Group’s e‑commerce data‑tracking evolution, detailing the transition from early log‑free collection through Log 2.0’s failed overhaul to the streamlined Log 3.0 framework, and explains the resulting SDK, BTM/BCM management, and attribution platform that solve quality, efficiency, and analysis challenges for data engineers.

SDKdata attributione-commerce tracking
0 likes · 19 min read
Douyin’s E‑commerce Tracking Journey: From Log 1.0 to a Unified Attribution Platform
ByteDance Data Platform
ByteDance Data Platform
Sep 25, 2024 · Artificial Intelligence

How LLMs Power the “Find Data Assistant” for Smarter Data Retrieval

This article explains how the Volcano Engine DataLeap team leveraged large‑language models to build the “Find Data Assistant”, detailing its design, challenges, embedding‑and‑reranker enhancements, LLM‑driven semantic search, mixing architecture, and practical lessons for improving data asset management and retrieval.

Data Asset ManagementData RetrievalEmbedding
0 likes · 17 min read
How LLMs Power the “Find Data Assistant” for Smarter Data Retrieval
ByteDance Data Platform
ByteDance Data Platform
Sep 18, 2024 · Big Data

Apache Calcite for Multi‑Engine Metric Management: Practices & Roadmap

This article explains the technical principles and best practices of multi‑engine metric management based on Apache Calcite, covering common metric management methods, implementation details of unified SQL, virtual columns, and SQL defined functions, and outlines ByteDance’s future roadmap for extending these capabilities.

Apache CalciteSQL Defined FunctionSQL Rewrite
0 likes · 16 min read
Apache Calcite for Multi‑Engine Metric Management: Practices & Roadmap
ByteDance Data Platform
ByteDance Data Platform
Aug 27, 2024 · Artificial Intelligence

AI-Driven BI: Achieving Zero-Barrier Data Access and Smart Insights

This article traces the evolution of business intelligence platforms from early report‑centric tools to modern AI‑enhanced, search‑driven solutions, detailing the architectural layers, high‑performance data analysis design, multi‑level aggregation, hot‑cold data tiering, and large‑model applications that enable zero‑threshold data consumption and intelligent insights.

Artificial IntelligenceData Analyticsbusiness intelligence
0 likes · 18 min read
AI-Driven BI: Achieving Zero-Barrier Data Access and Smart Insights
ByteDance Data Platform
ByteDance Data Platform
Aug 20, 2024 · Big Data

How FlinkSQL Optimizations Cut CPU Usage by Up to 60% in Streaming Jobs

This article details the FlinkSQL performance enhancements implemented by the streaming team, covering view reuse, redundant shuffle removal, multiple‑input operator redesign, long sliding‑window optimizations, and native JSON format improvements, which together deliver up to 60% CPU savings and massive core‑hour reductions.

CPU ReductionFlinkSQLLong Sliding Window
0 likes · 13 min read
How FlinkSQL Optimizations Cut CPU Usage by Up to 60% in Streaming Jobs
ByteDance Data Platform
ByteDance Data Platform
Jul 31, 2024 · Product Management

How Data‑Driven Flywheels Power User Growth: Insights from Volcengine

This article shares a data‑centric perspective on user growth, covering entropy reduction, information management, the data‑driven flywheel, A/B testing practices, retention strategies, and practical case studies that illustrate how systematic data analysis fuels sustainable product expansion.

A/B testingEntropy Reductiondata-driven
0 likes · 16 min read
How Data‑Driven Flywheels Power User Growth: Insights from Volcengine
ByteDance Data Platform
ByteDance Data Platform
May 15, 2024 · R&D Management

How ByteDance Embeds A/B Testing into Every Stage of Product Development

This article explains how ByteDance integrates data‑driven A/B testing throughout its R&D workflow—from feature design and large‑scale refactoring to bug fixes, release safety, SQL optimization, and cultural adoption—demonstrating the ROI and sustainable practices of a data‑centric development culture.

AB testingdata-drivenexperiment
0 likes · 18 min read
How ByteDance Embeds A/B Testing into Every Stage of Product Development