Tagged articles
7 articles
Page 1 of 1
Alibaba Cloud Native
Alibaba Cloud Native
Oct 24, 2023 · Cloud Native

Boost Cluster Efficiency with Koordinator’s K8s‑YARN Co‑Location Solution

Koordinator extends its open‑source container scheduler to enable seamless co‑location of Kubernetes Pods and Hadoop YARN tasks, allowing over‑provisioned batch resources to be shared without modifying YARN, and has delivered up to 10 % CPU utilization gains and sub‑1 % eviction rates in Xiaohongshu’s production clusters.

Cluster SchedulingKubernetesResource Management
0 likes · 9 min read
Boost Cluster Efficiency with Koordinator’s K8s‑YARN Co‑Location Solution
DataFunSummit
DataFunSummit
Sep 2, 2023 · Big Data

Practical Experience of Bilibili's Big Data Cluster Mixed Deployment Architecture

This article details Bilibili's offline big‑data cluster challenges, the mixed‑deployment architecture that combines offline and online resources, the Amiya service's over‑commit and eviction mechanisms, performance optimizations, monitoring strategies, and future plans to further improve resource utilization and scheduling.

AmiyaBig DataBilibili
0 likes · 14 min read
Practical Experience of Bilibili's Big Data Cluster Mixed Deployment Architecture
Meituan Technology Team
Meituan Technology Team
Feb 17, 2022 · Cloud Native

Meituan's Cloud‑Native Cluster Scheduling System: Design, Challenges, and Future Directions

Meituan’s cloud‑native cluster scheduling system, built on a customized Kubernetes engine, unifies multi‑cluster management, improves CPU utilization, reduces costs, and enhances stability by balancing throughput, complexity, and reliability while addressing large‑scale deployment, fault‑tolerance, and dynamic resource allocation challenges.

Cloud NativeCluster SchedulingKubernetes
0 likes · 21 min read
Meituan's Cloud‑Native Cluster Scheduling System: Design, Challenges, and Future Directions
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 4, 2021 · Cloud Computing

How Partitioned Synchronization Scales Alibaba’s Massive Cloud Clusters

At USENIX ATC2021, Alibaba Cloud’s Fuxi 2.0 team presented a best‑paper‑award research showing how a partitioned‑synchronization (ParSync) scheduling architecture dramatically reduces conflicts and latency in ultra‑large production clusters, balancing efficiency, quality, and fairness without adding resources.

Cluster SchedulingResource Managementcloud computing
0 likes · 17 min read
How Partitioned Synchronization Scales Alibaba’s Massive Cloud Clusters
Java Architect Essentials
Java Architect Essentials
Aug 20, 2020 · Cloud Native

Borg’s Resource Management, Isolation, and Lessons for Cloud‑Native Systems

This article examines Borg’s strategies for efficient machine utilization, fine‑grained resource allocation, and isolation—including cell compaction, resource reclamation, and performance isolation—provides experimental results and comparisons, and distills lessons learned to inform the design of cloud‑native systems such as Kubernetes.

BorgCluster SchedulingKubernetes
0 likes · 27 min read
Borg’s Resource Management, Isolation, and Lessons for Cloud‑Native Systems
Meituan Technology Team
Meituan Technology Team
Sep 12, 2019 · Cloud Native

Meituan HULK: Cloud‑Native Container Cluster Management and Scheduling Practices

Meituan’s HULK platform evolved from an OpenStack‑based scheduler to a Kubernetes‑native container cluster manager, integrating service governance, release, CMDB, and monitoring to automate VM‑to‑container migration, improve resource utilization, and deliver elastic, policy‑driven scheduling and scaling with reduced troubleshooting time and higher SLA compliance.

Cloud NativeCluster SchedulingKubernetes
0 likes · 13 min read
Meituan HULK: Cloud‑Native Container Cluster Management and Scheduling Practices
Qunar Tech Salon
Qunar Tech Salon
May 14, 2015 · Operations

Key Takeaways from Google’s Borg Paper: Resource Management and Scheduling Insights

The article reviews Google’s Borg paper, highlighting how Borg distinguishes production and non‑production tasks, manages jobs and containers, improves utilization through mixed‑workload scheduling, enforces isolation via overload and over‑commitment controls, and compares Borg’s approach to other cluster managers.

BorgCluster SchedulingResource Management
0 likes · 8 min read
Key Takeaways from Google’s Borg Paper: Resource Management and Scheduling Insights