Tag

Cluster Optimization

1 views collected around this technical thread.

DevOps Cloud Academy
DevOps Cloud Academy
Sep 8, 2023 · Cloud Native

Kubernetes Resource Management: Concepts, Monitoring, and Optimization

This article explains Kubernetes resource management, covering compute and non‑compute resources, key mechanisms such as quotas and autoscalers, monitoring tools, and optimization strategies to improve cluster efficiency, scalability, and cost effectiveness.

Cluster OptimizationKubernetescloud native
0 likes · 19 min read
Kubernetes Resource Management: Concepts, Monitoring, and Optimization
DataFunSummit
DataFunSummit
May 4, 2022 · Big Data

NetEase Big Data Platform: HDFS Optimization and Practices

NetEase’s senior big‑data engineer shares how the company’s large‑scale data platform leverages Hadoop, HDFS, YARN and related technologies, detailing multi‑layer architecture, cross‑cloud deployment, storage optimizations, NameNode performance enhancements, RPC prioritization, and practical lessons from operating petabyte‑scale clusters.

Cluster OptimizationHDFSPerformance Tuning
0 likes · 23 min read
NetEase Big Data Platform: HDFS Optimization and Practices
DataFunTalk
DataFunTalk
Mar 30, 2022 · Big Data

NetEase Big Data Platform: HDFS Optimization and Practice

This article presents NetEase's big data platform architecture, detailing multi‑layer storage and compute design, HDFS deployment challenges, NameNode and NameSpace performance optimizations, cluster scaling strategies, data tiering, hardware upgrades, and real‑world business use cases, illustrating practical large‑scale big data engineering.

Cluster OptimizationHDFSNetEase
0 likes · 23 min read
NetEase Big Data Platform: HDFS Optimization and Practice
DataFunTalk
DataFunTalk
Mar 3, 2021 · Big Data

Kwai Scheduler: Scaling YARN for Ultra‑Large Clusters at Kuaishou

This article presents Kuaishou's large‑scale offline computing challenges and describes how the team customized YARN and built the Kwai scheduler to achieve multi‑threaded, pluggable resource scheduling for clusters of tens of thousands of nodes, supporting diverse workloads such as ETL, ad‑hoc queries, machine‑learning training, and real‑time Flink jobs.

Cluster OptimizationKwai SchedulerResource Scheduling
0 likes · 15 min read
Kwai Scheduler: Scaling YARN for Ultra‑Large Clusters at Kuaishou
DataFunTalk
DataFunTalk
Jul 5, 2020 · Big Data

ByteDance’s Optimizations to Hadoop YARN: Enhancing Utilization, Multi‑Load Scenarios, Stability, and Multi‑Region Active‑Active

This article describes ByteDance’s four‑year series of customizations to Hadoop YARN—covering utilization improvements, multi‑load scenario optimizations, stability enhancements, and multi‑region active‑active deployment—along with practical production experiences, architectural details, and future work directions.

ByteDanceCluster OptimizationHadoop
0 likes · 12 min read
ByteDance’s Optimizations to Hadoop YARN: Enhancing Utilization, Multi‑Load Scenarios, Stability, and Multi‑Region Active‑Active
Big Data Technology Architecture
Big Data Technology Architecture
Apr 24, 2020 · Databases

Best Practices for HBase Region Count and Size to Improve Cluster Stability and Performance

The article explains how maintaining an optimal number of HBase regions (typically 20‑200 per RegionServer) and appropriate region size, along with careful MemStore and compaction settings, can prevent memory pressure, reduce GC pauses, and enhance overall cluster stability and throughput.

Cluster OptimizationHBasePerformance Tuning
0 likes · 5 min read
Best Practices for HBase Region Count and Size to Improve Cluster Stability and Performance
Tencent Cloud Developer
Tencent Cloud Developer
Sep 11, 2019 · Big Data

YARN Practice and Technical Evolution at Kuaishou

Jiaoxiao Fang’s talk details Kuaishou’s YARN deployment, covering its architecture, support for offline, real‑time and ML workloads, and recent enhancements such as event‑handling stability, refined preemption, high‑throughput parallel scheduling, shuffle‑caching for small I/O, plus plans for job protection and multi‑cluster resource utilization.

Cluster OptimizationDistributed SystemsHadoop
0 likes · 16 min read
YARN Practice and Technical Evolution at Kuaishou