Tagged articles

Fluid

20 articles · Page 1 of 1

Jul 11, 2025 · Cloud Native

How Alibaba Cloud’s AI Infra Innovations Are Transforming Kubernetes Workloads

This article summarizes Alibaba Cloud’s key technical contributions at KubeCon China 2025, covering AI‑focused Kubernetes optimizations, Argo Workflows enhancements, storage strategies for large models, Fluid’s data orchestration, multi‑tenant security, and the RoleBasedGroup framework for PD‑separated AI inference.

AI InfrastructureArgo WorkflowsFluid

0 likes · 20 min read

How Alibaba Cloud’s AI Infra Innovations Are Transforming Kubernetes Workloads

Alibaba Cloud Infrastructure

Jun 26, 2025 · Cloud Native

How Fluid Enables Cloud‑Native Elastic Data for AI Workloads

Fluid introduces a cloud‑native elastic data abstraction that lets AI workloads efficiently access, manage, and accelerate heterogeneous data sources across serverful and serverless environments, offering unified Dataset, Runtime, and DataOperation concepts, and has been recognized by CNCF’s 2024 Technology Radar.

AI workloadsCNCFCloud Native

0 likes · 9 min read

How Fluid Enables Cloud‑Native Elastic Data for AI Workloads

Alibaba Cloud Infrastructure

Mar 5, 2025 · Cloud Native

Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud

This article describes how JoinQuant built a cloud‑native quantitative research platform on Alibaba Cloud, identified performance, cost, data‑management, and security challenges, and solved them with Fluid’s JindoRuntime data‑caching, elastic scaling, and Python‑driven workflows, achieving dramatic speed and cost improvements.

Cloud NativeData CachingElastic Scaling

0 likes · 18 min read

Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud

Alibaba Cloud Infrastructure

Jan 17, 2025 · Artificial Intelligence

Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid

This article explains how to reduce inference cost and improve performance for large language models on Alibaba Cloud ACK by using Knative's request‑based autoscaling, custom ResourcePolicy priority scheduling, and Fluid data‑caching to achieve elastic scaling, resource pre‑emption, and faster model loading.

Elastic ScalingFluidKnative

0 likes · 22 min read

Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid

Alibaba Cloud Big Data AI Platform

Jan 6, 2025 · Cloud Native

How Fluid Enables Seamless Dynamic Dataset Mounting for Cloud‑Native AI Development

PAI‑DSW leverages the Fluid project to provide a cloud‑native AI development platform where data scientists can dynamically mount and unmount OSS datasets on running Kubernetes pods without restarting, improving workflow efficiency and addressing the challenges of heterogeneous data source management in AI engineering.

Cloud NativeData EngineeringFluid

0 likes · 18 min read

How Fluid Enables Seamless Dynamic Dataset Mounting for Cloud‑Native AI Development

Alibaba Cloud Native

Mar 23, 2024 · Cloud Native

Boost AI/Big Data Pipelines on Kubernetes with Fluid and Vineyard: A Hands‑On Guide

This article explains the performance and development challenges of end‑to‑end AI/Big Data workflows on Kubernetes and shows how combining Fluid’s data orchestration with Vineyard’s zero‑copy sharing can dramatically improve efficiency, followed by a step‑by‑step tutorial with code examples.

AIBig DataData Orchestration

0 likes · 15 min read

Boost AI/Big Data Pipelines on Kubernetes with Fluid and Vineyard: A Hands‑On Guide

Alibaba Cloud Native

Feb 21, 2024 · Cloud Native

How Fluid & JindoCache Accelerate Large‑Scale AI Training in a Cloud‑Native Environment

This article examines the challenges of data‑intensive AI training on heterogeneous cloud‑native infrastructure and explains how the Fluid framework combined with JindoCache and KubeDL provides distributed caching, metadata acceleration, and seamless POSIX access to dramatically improve I/O performance, GPU utilization, and cost efficiency.

AI trainingData CachingFluid

0 likes · 18 min read

How Fluid & JindoCache Accelerate Large‑Scale AI Training in a Cloud‑Native Environment

Alibaba Cloud Native

Jun 23, 2023 · Cloud Native

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

This guide explains how to deploy large language models on Alibaba Cloud's ACK using KServe for serverless inference, integrates Fluid for distributed data caching to cut cold‑start latency, provides step‑by‑step commands, performance benchmarks, and practical tips for production‑grade AI model serving.

Cloud NativeFluidKServe

0 likes · 22 min read

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

Alibaba Cloud Native

Feb 17, 2023 · Cloud Native

How Fluid + JuiceFSRuntime Powers Scalable Cloud‑Native Quantitative Research

This article explains how Metabit Trading built a cloud‑native quantitative research platform using Fluid and JuiceFSRuntime to achieve elastic compute, high‑throughput data caching, and cost‑effective scaling for AI‑driven trading strategies.

Cloud NativeData CachingElastic Scaling

0 likes · 20 min read

How Fluid + JuiceFSRuntime Powers Scalable Cloud‑Native Quantitative Research

Alibaba Cloud Native

Sep 20, 2022 · Cloud Native

How Fluid Accelerates Data‑Intensive Serverless Workloads on Alibaba ASK

This guide explains how Fluid, a Kubernetes‑native data orchestration engine, can be deployed on Alibaba Serverless Kubernetes (ASK) to cache and pre‑warm large datasets from OSS, enabling elastic bandwidth, reducing latency, and cutting costs for data‑intensive serverless applications.

ASKData CachingFluid

0 likes · 19 min read

How Fluid Accelerates Data‑Intensive Serverless Workloads on Alibaba ASK

DataFunTalk

Aug 20, 2022 · Artificial Intelligence

Atlas Supercomputing Platform: Architecture, Alluxio‑Fluid Integration, and Performance Improvements for AI Workloads

The article presents CloudKnow's Atlas supercomputing platform, detailing its AI‑focused architecture, early storage and bandwidth challenges, the integration of Alluxio and Fluid for distributed caching, various business adaptations, and experimental results showing significant performance gains across speech denoising, image classification, large‑file processing, and speech recognition workloads.

AIAlluxioFluid

0 likes · 16 min read

Atlas Supercomputing Platform: Architecture, Alluxio‑Fluid Integration, and Performance Improvements for AI Workloads

Zuoyebang Tech Team

Apr 7, 2022 · Cloud Native

How Fluid Transforms Large‑Scale Data Retrieval on Kubernetes

This article explains how Zuoyebang redesigned its massive data retrieval platform by separating compute and storage with the Fluid project on Kubernetes, achieving minute‑level hundred‑TB distribution, elastic caching, and improved stability for real‑time educational services.

Compute-Storage SeparationData RetrievalFluid

0 likes · 8 min read

How Fluid Transforms Large‑Scale Data Retrieval on Kubernetes

Architecture Digest

Dec 8, 2021 · Cloud Native

Implementing Compute-Storage Separation for Large-Scale Retrieval Systems Using Fluid

This article describes the challenges of operating massive, TB‑scale retrieval clusters at Zuoyebang, and presents a Fluid‑based compute‑storage separation architecture that improves data distribution, update efficiency, scalability, and stability, enabling containerized search services to be managed like regular stateless workloads.

Compute-Storage SeparationData OrchestrationFluid

0 likes · 13 min read

Implementing Compute-Storage Separation for Large-Scale Retrieval Systems Using Fluid

Alibaba Cloud Native

Sep 13, 2021 · Artificial Intelligence

How Fluid + JindoRuntime Supercharged Autonomous Driving Model Training

This article details how the Fluid CNCF project combined with JindoRuntime was used to overcome storage‑compute separation bottlenecks in an autonomous‑driving machine‑learning platform, achieving up to 300% faster training, reduced OSS bandwidth pressure, and higher GPU utilization through distributed caching on Kubernetes.

Data OrchestrationFluidJindoRuntime

0 likes · 13 min read

How Fluid + JindoRuntime Supercharged Autonomous Driving Model Training

Alibaba Cloud Native

Jun 3, 2021 · Artificial Intelligence

How Weibo Boosted Deep Learning Training Speed 18× with Fluid and JindoRuntime

Weibo’s deep learning platform faced severe latency and stability issues when accessing massive small‑file datasets via a compute‑storage‑separated architecture, so the team adopted the CNCF Fluid project with JindoRuntime, implementing a distributed cache that leverages POSIX interfaces, dramatically improving data locality, reducing HDFS load, and achieving up to 18‑fold training speedups while raising success rates from 37 % to 98 %.

Data CachingFluidJindoRuntime

0 likes · 15 min read

How Weibo Boosted Deep Learning Training Speed 18× with Fluid and JindoRuntime

Alibaba Cloud Native

May 10, 2021 · Cloud Native

What Is Fluid? A Cloud‑Native Data Orchestration and Acceleration Platform

Fluid is an open‑source cloud‑native data orchestration and acceleration system that runs on Kubernetes, offering storage‑agnostic datasets, distributed caching, intelligent scheduling, and performance optimizations for data‑intensive AI and big‑data workloads.

AIBig DataCloud Native

0 likes · 6 min read

What Is Fluid? A Cloud‑Native Data Orchestration and Acceleration Platform

Alibaba Cloud Native

Apr 2, 2021 · Cloud Native

How Fluid Turns Kubernetes into a High‑Performance Data Logistics System

This article explains how the open‑source Fluid project addresses the inefficiencies of data‑intensive AI and big‑data workloads in cloud‑native Kubernetes environments by introducing a data‑centric abstraction, dual orchestration mechanisms, and seamless integration with Alluxio to achieve faster, secure, and scalable data access.

AlluxioBig DataCloud Native

0 likes · 19 min read

How Fluid Turns Kubernetes into a High‑Performance Data Logistics System

Alibaba Cloud Native

Feb 10, 2021 · Cloud Native

Accelerate AI and Big Data Workloads on Kubernetes with Fluid’s JindoRuntime

Fluid is an open‑source Kubernetes‑native engine that orchestrates and accelerates distributed datasets for AI and big‑data workloads, and this guide explains its core concepts, the JindoRuntime implementation, performance benefits, and step‑by‑step instructions to deploy and test JindoRuntime on a K8s cluster.

AIBig DataCloud Native

0 likes · 14 min read

Accelerate AI and Big Data Workloads on Kubernetes with Fluid’s JindoRuntime

Alibaba Cloud Native

Nov 16, 2020 · Cloud Native

What’s New in Fluid 0.4? DataLoad, Small‑File Boost, HDFS Support & Multi‑Dataset Deployment

Fluid 0.4 introduces a DataLoad custom resource for declarative data pre‑warming, enhances support for massive small‑file datasets, adds HDFS‑compatible access for Spark and other big‑data frameworks, and enables mixed‑deployment of multiple datasets on a single node, all backed by significant performance gains.

AIAlluxioBig Data

0 likes · 8 min read

What’s New in Fluid 0.4? DataLoad, Small‑File Boost, HDFS Support & Multi‑Dataset Deployment

Alibaba Cloud Native

Oct 14, 2020 · Cloud Native

How Fluid v0.3 Accelerates Kubernetes PVC Access and Enhances Data Security

Fluid v0.3, the open‑source cloud‑native data acceleration platform, introduces PVC and HostPath acceleration, fine‑grained dataset permission controls, and built‑in parameter optimizations, delivering over 20% faster AI training while simplifying configuration for diverse storage back‑ends.

Access ControlCloud NativeData Acceleration

0 likes · 8 min read

How Fluid v0.3 Accelerates Kubernetes PVC Access and Enhances Data Security