Tag

elastic scheduling

0 views collected around this technical thread.

Baidu Tech Salon
Baidu Tech Salon
Jun 17, 2025 · Operations

How Baidu Scaled Its Vertical Search: Elastic Scheduling and Data Management Secrets

This article explains how Baidu's vertical search platform tackled massive data growth and scaling challenges by redesigning its data management system, introducing elastic scheduling, decoupling ETCD access, implementing auto‑scaling, and advancing shard expansion to improve performance, stability, and cost efficiency.

ETCDShardingauto scaling
0 likes · 18 min read
How Baidu Scaled Its Vertical Search: Elastic Scheduling and Data Management Secrets
vivo Internet Technology
vivo Internet Technology
Apr 16, 2025 · Big Data

Offline Mixed Deployment of Spark Tasks on Kubernetes: Containerization, Scheduling, and Elastic Resource Management

The article explains how the vivo Internet Big Data team containerized offline Spark jobs and deployed them with the Spark Operator on a mixed online‑offline Kubernetes cluster, using elastic scheduling and resource‑over‑subscription to boost CPU utilization by 30‑40% and handle over 100,000 daily tasks.

Big DataContainerizationKubernetes
0 likes · 36 min read
Offline Mixed Deployment of Spark Tasks on Kubernetes: Containerization, Scheduling, and Elastic Resource Management
AntTech
AntTech
Oct 16, 2024 · Artificial Intelligence

CNCC2024 Forum: Industry‑Academia Collaboration for Scientific Exploration and the CCF‑Ant Research Fund Release

The CNCC2024 forum held on October 24‑26 in Hangzhou showcases industry‑academia‑research integration through keynote speeches, speaker introductions, and detailed abstracts on topics such as AI, privacy computing, graph machine learning, and elastic cloud scheduling, while announcing the CCF‑Ant Research Fund and related initiatives.

AIelastic schedulinggraph-ml
0 likes · 10 min read
CNCC2024 Forum: Industry‑Academia Collaboration for Scientific Exploration and the CCF‑Ant Research Fund Release
DataFunSummit
DataFunSummit
Sep 25, 2022 · Big Data

Practical Optimizations and Resource Management of Hadoop YARN at Xiaomi

This article shares Xiaomi's internal practices of Hadoop YARN, covering scheduling and resource optimization, elastic scheduling, node overcommit handling, federation architecture, metadata warehouse construction, and future plans to improve cluster utilization and cost efficiency.

Big DataHadoopResource Scheduling
0 likes · 20 min read
Practical Optimizations and Resource Management of Hadoop YARN at Xiaomi
DataFunSummit
DataFunSummit
Jul 1, 2022 · Big Data

Exploring and Implementing Elastic Scheduling for Xiaomi Hadoop YARN

Shilong Fei from Xiaomi Data Platform presents an in‑depth exploration of elastic scheduling for Hadoop YARN, covering background, design of resource pools, auto‑scaling architecture, challenges such as job stability and user transparency, achieved cost reductions, and future plans for further optimization.

Big DataCost OptimizationHadoop
0 likes · 20 min read
Exploring and Implementing Elastic Scheduling for Xiaomi Hadoop YARN
DataFunTalk
DataFunTalk
May 21, 2022 · Big Data

Exploring and Implementing Elastic Scheduling for Xiaomi Hadoop YARN

This talk presents Xiaomi's design and deployment of an elastic scheduling system for Hadoop YARN, covering background analysis, resource‑pool strategy, auto‑scaling architecture, stability challenges, label‑based resource isolation, Spark shuffle handling, cost‑saving results and future plans.

Big DataHadoopYARN
0 likes · 16 min read
Exploring and Implementing Elastic Scheduling for Xiaomi Hadoop YARN
AntTech
AntTech
Jul 13, 2020 · Artificial Intelligence

ElasticDL: An Open‑Source Distributed Deep Learning Framework with Elastic Scheduling

ElasticDL is an open‑source distributed deep learning framework built on TensorFlow 2.x and Kubernetes that simplifies programming by letting users define models with the Keras API, while providing elastic scheduling, fault tolerance, and significant performance gains demonstrated through extensive benchmarks.

Distributed Deep LearningElasticDLKubernetes
0 likes · 19 min read
ElasticDL: An Open‑Source Distributed Deep Learning Framework with Elastic Scheduling
AntTech
AntTech
Sep 11, 2019 · Artificial Intelligence

ElasticDL: An Open‑Source Elastic Deep Learning System Built on TensorFlow 2.0 and Kubernetes

ElasticDL, the first industry‑level open‑source system for elastic deep learning on TensorFlow, leverages Kubernetes‑native scheduling, fault‑tolerance, and TensorFlow 2.0 Eager Execution to dramatically improve cluster utilization, simplify distributed training, and integrate seamlessly with tools like Kubeflow and SQLFlow.

AI infrastructureDistributed Deep LearningElasticDL
0 likes · 13 min read
ElasticDL: An Open‑Source Elastic Deep Learning System Built on TensorFlow 2.0 and Kubernetes
JD Retail Technology
JD Retail Technology
Aug 27, 2018 · Databases

JD.com's Database Automation and ContainerDB: Evolution of Intelligent Database Management

This article details JD.com's database automation journey, covering their intelligent database management platform (DBS), automated deployment systems, intelligent analysis and diagnostics, and the evolution from traditional containerization to ContainerDB, a smart database service with elastic scheduling and resource optimization.

Backup and RecoveryContainerDBDatabase Automation
0 likes · 22 min read
JD.com's Database Automation and ContainerDB: Evolution of Intelligent Database Management
Efficient Ops
Efficient Ops
May 16, 2016 · Cloud Native

How JD Scaled to 100,000 Docker Containers: Lessons in Cloud‑Native Operations

This article details JD.com's journey from physical servers to a massive Docker‑based cloud‑native platform, covering challenges, architecture, elastic scheduling, monitoring, and resource‑driven operations that support tens of thousands of containers across multiple data centers.

Dockercloud nativeelastic scheduling
0 likes · 26 min read
How JD Scaled to 100,000 Docker Containers: Lessons in Cloud‑Native Operations