Author

ByteDance Cloud Native

Sharing ByteDance's cloud-native technologies, technical practices, and developer events.

Articles

Likes

101

Views

Comments

Latest from ByteDance Cloud Native

39 recent articles

ByteDance Cloud Native

Dec 12, 2023 · Mobile Development

Why Android Apps Crash with TransactionTooLargeException and How to Fix It

This article analyzes a recent surge of TransactionTooLargeException crashes in the Douyin Android app, explains how oversized Bundles exceed Binder limits during Activity stop, and presents a hook‑based solution that compresses Bundle data to prevent binder transaction failures.

AndroidBinderCrashAnalysis

0 likes · 21 min read

Why Android Apps Crash with TransactionTooLargeException and How to Fix It

ByteDance Cloud Native

Oct 30, 2023 · Cloud Native

Unlocking Elastic Resource Sharing: TikTok’s Cloud‑Native Mix‑Mode Scaling

This article explains how TikTok’s cloud‑native platform leverages elastic scaling, monitoring, and quota systems to dynamically share resources between online, latency‑sensitive services and offline, batch workloads, improving utilization while preserving service stability across tidal traffic patterns.

Kubernetescloud-nativeelastic scaling

0 likes · 19 min read

Unlocking Elastic Resource Sharing: TikTok’s Cloud‑Native Mix‑Mode Scaling

ByteDance Cloud Native

Oct 11, 2023 · Cloud Native

How Katalyst Memory Advisor Optimizes Kubernetes Memory Management in Mixed Workloads

This article explains the challenges of memory management in mixed Kubernetes workloads, introduces ByteDance's open‑source Katalyst Memory Advisor, details native allocation and reclamation mechanisms, outlines its architecture and plugins, and describes interference detection and multi‑level mitigation strategies to improve memory utilization and service quality.

KatalystKubernetesResource Optimization

0 likes · 19 min read

How Katalyst Memory Advisor Optimizes Kubernetes Memory Management in Mixed Workloads

ByteDance Cloud Native

Sep 7, 2023 · Cloud Native

How CDSBen Bridges Database Transactions and Storage I/O for Accurate Cloud‑Native Benchmarking

This article introduces the CDSBen model, a machine‑learning‑based benchmark that translates real database transaction patterns into storage‑layer I/O workloads, enabling precise, isolated performance testing of the cloud‑native veDB storage system.

CDSBencloud-nativedatabase benchmarking

0 likes · 11 min read

How CDSBen Bridges Database Transactions and Storage I/O for Accurate Cloud‑Native Benchmarking

ByteDance Cloud Native

Aug 15, 2023 · Cloud Native

What’s New in Katalyst v0.3.0? Core Enhancements Explained

Katalyst v0.3.0 introduces major upgrades including enhanced KCNR API bandwidth isolation, a more extensible task and async execution framework, advanced mixed‑deployment controls, load‑aware resource prediction, and concurrent unit testing, all aimed at improving cloud‑native resource management efficiency.

KatalystKubernetesresource management

0 likes · 4 min read

What’s New in Katalyst v0.3.0? Core Enhancements Explained

ByteDance Cloud Native

Aug 9, 2023 · Cloud Native

How Volcano Engine’s New GPU Sharing Scheduler Boosts AI Workloads by 500%

This article explains Volcano Engine's next‑generation GPU sharing scheduling technology, detailing the two‑layer scheduler, card‑level bin‑pack/spread strategies, system architecture, API definitions, and optimization algorithms that together increase GPU deployment density over 500% and improve utilization by more than 50% for AI workloads.

GPU SchedulingKubernetesmGPU

0 likes · 13 min read

How Volcano Engine’s New GPU Sharing Scheduler Boosts AI Workloads by 500%

ByteDance Cloud Native

Jul 12, 2023 · Cloud Native

How Kelemetry Transforms Kubernetes Observability with Object‑Centric Tracing

Kelemetry, an open‑source tracing system from ByteDance, links Kubernetes control‑plane components by treating each object as a span, aggregating audit logs and events into unified traces that are visualized as trees or timelines, supporting multi‑cluster monitoring and custom conversion pipelines.

KelemetryKubernetesTracing

0 likes · 17 min read

How Kelemetry Transforms Kubernetes Observability with Object‑Centric Tracing

ByteDance Cloud Native

Jun 13, 2023 · Artificial Intelligence

How Ray and Cloud‑Native Tech Supercharge Large‑Model Offline Inference

This article explains the challenges of large‑model offline (batch) inference, such as GPU memory limits and distributed scheduling, and shows how Ray’s cloud‑native architecture, model partitioning, and Ray Datasets can be used to build efficient, elastic inference frameworks deployed with KubeRay.

Distributed ComputingGPU memoryRay

0 likes · 18 min read

How Ray and Cloud‑Native Tech Supercharge Large‑Model Offline Inference

ByteDance Cloud Native

Jun 1, 2023 · Cloud Native

How to Deploy and Scale ByConity’s Cloud‑Native Data Warehouse on Kubernetes

ByConity is a cloud‑native, storage‑compute separated data warehouse engine that supports multi‑tenant isolation, high performance, and elastic scaling; this guide explains its three‑layer architecture, hardware requirements, Helm‑based Kubernetes deployment, dynamic scaling, and practical SQL testing steps.

Auto ScalingByConityHelm

0 likes · 11 min read

How to Deploy and Scale ByConity’s Cloud‑Native Data Warehouse on Kubernetes

ByteDance Cloud Native

Apr 20, 2023 · Cloud Native

How Dragonfly Accelerates Image Distribution and Scales Kubernetes Batch Processing

At KubeCon+CloudNativeCon 2023 in Amsterdam, Volcano Engine and ByteDance presented two technical sessions covering Dragonfly's P2P image distribution best practices and large‑scale Kubernetes batch processing strategies, offering deep insights and real‑world implementations for cloud‑native developers.

Batch ProcessingDragonflyImage Distribution

0 likes · 4 min read

How Dragonfly Accelerates Image Distribution and Scales Kubernetes Batch Processing