Tagged articles
20 articles
Page 1 of 1
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jul 11, 2025 · Cloud Native

How Alibaba Cloud’s AI Infra Innovations Are Transforming Kubernetes Workloads

This article summarizes Alibaba Cloud’s key technical contributions at KubeCon China 2025, covering AI‑focused Kubernetes optimizations, Argo Workflows enhancements, storage strategies for large models, Fluid’s data orchestration, multi‑tenant security, and the RoleBasedGroup framework for PD‑separated AI inference.

AI InfrastructureArgo WorkflowsFluid
0 likes · 20 min read
How Alibaba Cloud’s AI Infra Innovations Are Transforming Kubernetes Workloads
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 26, 2025 · Cloud Native

How Fluid Enables Cloud‑Native Elastic Data for AI Workloads

Fluid introduces a cloud‑native elastic data abstraction that lets AI workloads efficiently access, manage, and accelerate heterogeneous data sources across serverful and serverless environments, offering unified Dataset, Runtime, and DataOperation concepts, and has been recognized by CNCF’s 2024 Technology Radar.

AI workloadsCNCFCloud Native
0 likes · 9 min read
How Fluid Enables Cloud‑Native Elastic Data for AI Workloads
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 5, 2025 · Cloud Native

Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud

This article describes how JoinQuant built a cloud‑native quantitative research platform on Alibaba Cloud, identified performance, cost, data‑management, and security challenges, and solved them with Fluid’s JindoRuntime data‑caching, elastic scaling, and Python‑driven workflows, achieving dramatic speed and cost improvements.

Cloud NativeData CachingFluid
0 likes · 18 min read
Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 17, 2025 · Artificial Intelligence

Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid

This article explains how to reduce inference cost and improve performance for large language models on Alibaba Cloud ACK by using Knative's request‑based autoscaling, custom ResourcePolicy priority scheduling, and Fluid data‑caching to achieve elastic scaling, resource pre‑emption, and faster model loading.

FluidInferenceKnative
0 likes · 22 min read
Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 6, 2025 · Cloud Native

How Fluid Enables Seamless Dynamic Dataset Mounting for Cloud‑Native AI Development

PAI‑DSW leverages the Fluid project to provide a cloud‑native AI development platform where data scientists can dynamically mount and unmount OSS datasets on running Kubernetes pods without restarting, improving workflow efficiency and addressing the challenges of heterogeneous data source management in AI engineering.

AI DevelopmentCloud NativeFluid
0 likes · 18 min read
How Fluid Enables Seamless Dynamic Dataset Mounting for Cloud‑Native AI Development
Alibaba Cloud Native
Alibaba Cloud Native
Feb 21, 2024 · Cloud Native

How Fluid & JindoCache Accelerate Large‑Scale AI Training in a Cloud‑Native Environment

This article examines the challenges of data‑intensive AI training on heterogeneous cloud‑native infrastructure and explains how the Fluid framework combined with JindoCache and KubeDL provides distributed caching, metadata acceleration, and seamless POSIX access to dramatically improve I/O performance, GPU utilization, and cost efficiency.

AI trainingData CachingFluid
0 likes · 18 min read
How Fluid & JindoCache Accelerate Large‑Scale AI Training in a Cloud‑Native Environment
Alibaba Cloud Native
Alibaba Cloud Native
Jun 23, 2023 · Cloud Native

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

This guide explains how to deploy large language models on Alibaba Cloud's ACK using KServe for serverless inference, integrates Fluid for distributed data caching to cut cold‑start latency, provides step‑by‑step commands, performance benchmarks, and practical tips for production‑grade AI model serving.

Cloud NativeFluidKServe
0 likes · 22 min read
Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid
DataFunTalk
DataFunTalk
Aug 20, 2022 · Artificial Intelligence

Atlas Supercomputing Platform: Architecture, Alluxio‑Fluid Integration, and Performance Improvements for AI Workloads

The article presents CloudKnow's Atlas supercomputing platform, detailing its AI‑focused architecture, early storage and bandwidth challenges, the integration of Alluxio and Fluid for distributed caching, various business adaptations, and experimental results showing significant performance gains across speech denoising, image classification, large‑file processing, and speech recognition workloads.

AIAlluxioFluid
0 likes · 16 min read
Atlas Supercomputing Platform: Architecture, Alluxio‑Fluid Integration, and Performance Improvements for AI Workloads
Zuoyebang Tech Team
Zuoyebang Tech Team
Apr 7, 2022 · Cloud Native

How Fluid Transforms Large‑Scale Data Retrieval on Kubernetes

This article explains how Zuoyebang redesigned its massive data retrieval platform by separating compute and storage with the Fluid project on Kubernetes, achieving minute‑level hundred‑TB distribution, elastic caching, and improved stability for real‑time educational services.

Compute-Storage SeparationData RetrievalFluid
0 likes · 8 min read
How Fluid Transforms Large‑Scale Data Retrieval on Kubernetes
Architecture Digest
Architecture Digest
Dec 8, 2021 · Cloud Native

Implementing Compute-Storage Separation for Large-Scale Retrieval Systems Using Fluid

This article describes the challenges of operating massive, TB‑scale retrieval clusters at Zuoyebang, and presents a Fluid‑based compute‑storage separation architecture that improves data distribution, update efficiency, scalability, and stability, enabling containerized search services to be managed like regular stateless workloads.

Compute-Storage SeparationData OrchestrationFluid
0 likes · 13 min read
Implementing Compute-Storage Separation for Large-Scale Retrieval Systems Using Fluid
Alibaba Cloud Native
Alibaba Cloud Native
Sep 13, 2021 · Artificial Intelligence

How Fluid + JindoRuntime Supercharged Autonomous Driving Model Training

This article details how the Fluid CNCF project combined with JindoRuntime was used to overcome storage‑compute separation bottlenecks in an autonomous‑driving machine‑learning platform, achieving up to 300% faster training, reduced OSS bandwidth pressure, and higher GPU utilization through distributed caching on Kubernetes.

Data OrchestrationFluidJindoRuntime
0 likes · 13 min read
How Fluid + JindoRuntime Supercharged Autonomous Driving Model Training
Alibaba Cloud Native
Alibaba Cloud Native
Jun 3, 2021 · Artificial Intelligence

How Weibo Boosted Deep Learning Training Speed 18× with Fluid and JindoRuntime

Weibo’s deep learning platform faced severe latency and stability issues when accessing massive small‑file datasets via a compute‑storage‑separated architecture, so the team adopted the CNCF Fluid project with JindoRuntime, implementing a distributed cache that leverages POSIX interfaces, dramatically improving data locality, reducing HDFS load, and achieving up to 18‑fold training speedups while raising success rates from 37 % to 98 %.

Data CachingDeep LearningDistributed Training
0 likes · 15 min read
How Weibo Boosted Deep Learning Training Speed 18× with Fluid and JindoRuntime
Alibaba Cloud Native
Alibaba Cloud Native
Apr 2, 2021 · Cloud Native

How Fluid Turns Kubernetes into a High‑Performance Data Logistics System

This article explains how the open‑source Fluid project addresses the inefficiencies of data‑intensive AI and big‑data workloads in cloud‑native Kubernetes environments by introducing a data‑centric abstraction, dual orchestration mechanisms, and seamless integration with Alluxio to achieve faster, secure, and scalable data access.

AlluxioBig DataCloud Native
0 likes · 19 min read
How Fluid Turns Kubernetes into a High‑Performance Data Logistics System
Alibaba Cloud Native
Alibaba Cloud Native
Feb 10, 2021 · Cloud Native

Accelerate AI and Big Data Workloads on Kubernetes with Fluid’s JindoRuntime

Fluid is an open‑source Kubernetes‑native engine that orchestrates and accelerates distributed datasets for AI and big‑data workloads, and this guide explains its core concepts, the JindoRuntime implementation, performance benefits, and step‑by‑step instructions to deploy and test JindoRuntime on a K8s cluster.

AIBig DataCloud Native
0 likes · 14 min read
Accelerate AI and Big Data Workloads on Kubernetes with Fluid’s JindoRuntime
Alibaba Cloud Native
Alibaba Cloud Native
Nov 16, 2020 · Cloud Native

What’s New in Fluid 0.4? DataLoad, Small‑File Boost, HDFS Support & Multi‑Dataset Deployment

Fluid 0.4 introduces a DataLoad custom resource for declarative data pre‑warming, enhances support for massive small‑file datasets, adds HDFS‑compatible access for Spark and other big‑data frameworks, and enables mixed‑deployment of multiple datasets on a single node, all backed by significant performance gains.

AIAlluxioBig Data
0 likes · 8 min read
What’s New in Fluid 0.4? DataLoad, Small‑File Boost, HDFS Support & Multi‑Dataset Deployment
Alibaba Cloud Native
Alibaba Cloud Native
Oct 14, 2020 · Cloud Native

How Fluid v0.3 Accelerates Kubernetes PVC Access and Enhances Data Security

Fluid v0.3, the open‑source cloud‑native data acceleration platform, introduces PVC and HostPath acceleration, fine‑grained dataset permission controls, and built‑in parameter optimizations, delivering over 20% faster AI training while simplifying configuration for diverse storage back‑ends.

Cloud NativeData AccelerationFluid
0 likes · 8 min read
How Fluid v0.3 Accelerates Kubernetes PVC Access and Enhances Data Security