Big Data Technology Tribe
Big Data Technology Tribe
Nov 23, 2025 · Artificial Intelligence

How Ray Data Accelerates AI Workloads with Streaming Execution

Ray Data is a scalable library built on Ray that offers high‑performance, streaming‑execution APIs for AI workloads, enabling efficient batch inference, data preprocessing, and training data ingestion across CPU and GPU resources, while supporting diverse data formats and seamless integration with popular frameworks.

AI data processingPythonRay Data
0 likes · 11 min read
How Ray Data Accelerates AI Workloads with Streaming Execution
Volcano Engine Developer Services
Volcano Engine Developer Services
Mar 5, 2025 · Artificial Intelligence

How DeepSeek Smallpond Powers AI Data Processing with Ray and DuckDB

This article introduces DeepSeek Smallpond, a lightweight yet high‑performance AI data‑processing engine built on Ray and DuckDB, explains its dual Dataframe and LogicalPlan APIs, showcases integration with Volcano Engine's AI Data Lake LAS, and provides practical code examples for distributed processing, multimodal storage, and RAG pipelines.

AI data processingDistributed ComputingDuckDB
0 likes · 18 min read
How DeepSeek Smallpond Powers AI Data Processing with Ray and DuckDB