Tag

Ray

0 views collected around this technical thread.

Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 3, 2025 · Artificial Intelligence

Deploying and Managing Ray on Alibaba Cloud ACK with KubeRay: Architecture, Code Samples, and Scheduling Strategies

This article explains how to build a flexible machine‑learning infrastructure on Alibaba Cloud ACK using Ray and KubeRay, covering Ray's core components, AI libraries, deployment options on VMs and Kubernetes, code examples for data processing, model serving, and advanced scheduling and quota management techniques.

AIAlibaba CloudKubeRay
0 likes · 17 min read
Deploying and Managing Ray on Alibaba Cloud ACK with KubeRay: Architecture, Code Samples, and Scheduling Strategies
AntData
AntData
Apr 3, 2025 · Artificial Intelligence

Ray Flow Insight: Visualizing and Debugging Distributed AI Applications

Ray Flow Insight is an Ant Group open‑source tool that visualizes Ray's distributed programming primitives—Actors, Tasks, and Objects—to turn complex reinforcement‑learning systems from opaque "black boxes" into transparent, debuggable workflows, providing logical, physical, distributed stack, and flame‑graph views for performance analysis and optimization.

AIRayRay Flow Insight
0 likes · 32 min read
Ray Flow Insight: Visualizing and Debugging Distributed AI Applications
DataFunTalk
DataFunTalk
Jan 11, 2025 · Artificial Intelligence

Ragent: Ant Group’s Ray‑Based Distributed Agent Framework

This article introduces Ragent, Ant Group’s Ray‑powered distributed agent framework, covering its background, motivation, design, implementation details, multi‑agent capabilities, and future directions for large‑model AI applications.

AGENT frameworkAIDistributed Agents
0 likes · 14 min read
Ragent: Ant Group’s Ray‑Based Distributed Agent Framework
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 29, 2024 · Big Data

How ByteDance Builds Large-Scale Data Processing Pipelines for Multimodal Models with Ray

The article details ByteDance's use of Ray and RayData to construct scalable audio and video data processing pipelines for multimodal AI models, addressing challenges of massive data volume, resource constraints, and fault tolerance through pipeline design, RayCore enhancements, and custom scheduling optimizations.

AIBig DataByteDance
0 likes · 16 min read
How ByteDance Builds Large-Scale Data Processing Pipelines for Multimodal Models with Ray
Didi Tech
Didi Tech
Jan 25, 2024 · Artificial Intelligence

Ray-native XGBoost Training Platform: Architecture, Performance, and Technical Challenges

Didi’s new Ray‑native XGBoost training platform replaces the fault‑prone Spark solution with a fully Pythonic, fault‑tolerant architecture that leverages Ray’s autoscaling and gang‑scheduling, delivering 2–6× speedups, reduced failure rates, efficient sparse‑vector handling, scalable hyper‑parameter search, and improved resource utilization for large‑scale machine‑learning workloads.

Hyperparameter OptimizationRayXGBoost
0 likes · 20 min read
Ray-native XGBoost Training Platform: Architecture, Performance, and Technical Challenges
DataFunTalk
DataFunTalk
Aug 22, 2023 · Artificial Intelligence

Building Complex Distributed Systems with Ray: An AutoML Case Study and Cloud‑Native Deployment

This article explains how the Ray distributed computing engine simplifies the design, deployment, and operation of complex cloud‑native distributed systems—illustrated through an AutoML service example—by detailing system complexity, Ray’s core concepts, resource customization, runtime environments, monitoring, and ecosystem integrations.

AIAutoMLKubernetes
0 likes · 26 min read
Building Complex Distributed Systems with Ray: An AutoML Case Study and Cloud‑Native Deployment
AntTech
AntTech
Jun 27, 2023 · Artificial Intelligence

Fanglue: An Interactive System for Decision Rule Crafting in Fraud Detection

Fanglue is an interactive, web‑based rule‑development platform that integrates expert domain knowledge with distributed AI algorithms to efficiently generate and evaluate decision rules for anti‑fraud scenarios, leveraging Ray for real‑time processing and achieving VLDB‑2023 acceptance.

AIRayVLDB2023
0 likes · 10 min read
Fanglue: An Interactive System for Decision Rule Crafting in Fraud Detection
ByteDance Cloud Native
ByteDance Cloud Native
Jun 13, 2023 · Artificial Intelligence

How Ray and Cloud‑Native Tech Supercharge Large‑Model Offline Inference

This article explains the challenges of large‑model offline (batch) inference, such as GPU memory limits and distributed scheduling, and shows how Ray’s cloud‑native architecture, model partitioning, and Ray Datasets can be used to build efficient, elastic inference frameworks deployed with KubeRay.

GPU memoryRaycloud native
0 likes · 18 min read
How Ray and Cloud‑Native Tech Supercharge Large‑Model Offline Inference
AntTech
AntTech
Jan 3, 2023 · Artificial Intelligence

Ray: The Distributed Framework Powering the Next Generation of Generative AI

Ray, an open‑source distributed computing framework originally created by Berkeley's RiseLab and heavily contributed to by Ant Group, underpins many AI workloads—from privacy‑preserving federated learning to large‑scale model training for ChatGPT—making it a critical yet often overlooked engine of the generative AI revolution.

Artificial IntelligenceLarge Language ModelsOpenAI
0 likes · 7 min read
Ray: The Distributed Framework Powering the Next Generation of Generative AI
AntTech
AntTech
Mar 23, 2021 · Big Data

From MapReduce to Ray: The Evolution of Big Data Computing Engines and Career Opportunities

This article traces the history of big‑data computing engines—from early MapReduce and Hadoop through Spark, Storm, Flink, and the newer Ray—explaining their technical advances, real‑world applications in AI and finance, and why graduates should consider a career in this rapidly evolving field.

AIBig DataRay
0 likes · 16 min read
From MapReduce to Ray: The Evolution of Big Data Computing Engines and Career Opportunities
AntTech
AntTech
Mar 1, 2021 · Artificial Intelligence

Building a Fusion Engine with Ray: Ant Group’s Large‑Scale Distributed Computing Practices

The article explains how Ant Group tackles the challenge of tightly integrating multiple computing paradigms by building a Ray‑based fusion engine, detailing its architecture, features, large‑scale applications in online machine learning and parallel processing, and outlining future development and recruitment opportunities.

Ant GroupFusion EngineOnline Machine Learning
0 likes · 10 min read
Building a Fusion Engine with Ray: Ant Group’s Large‑Scale Distributed Computing Practices
AntTech
AntTech
Dec 4, 2019 · Artificial Intelligence

Ant Financial’s Online Learning System Built on Ray: Architecture, Challenges, and Future Plans

The interview details how Ant Financial transitioned from offline to online machine learning by adopting the Ray distributed engine, describing their open architecture, fusion computing approach, technical advantages, encountered pitfalls, and plans to open‑source the system for broader AI and big‑data use.

AIBig DataRay
0 likes · 15 min read
Ant Financial’s Online Learning System Built on Ray: Architecture, Challenges, and Future Plans
AntTech
AntTech
Nov 1, 2019 · Artificial Intelligence

Building a Unified Online Machine Learning Platform with Ray for Alipay’s “Collect Five Blessings” Campaign

The article describes how Alipay tackled the cold‑start, conversion, and user‑experience challenges of its time‑limited “Collect Five Blessings” activity by designing a unified online machine‑learning system based on the Ray distributed‑computing framework, emphasizing stability, efficiency, simplicity, multi‑language support, and fault‑tolerant scheduling.

AlipayOnline Machine LearningRay
0 likes · 11 min read
Building a Unified Online Machine Learning Platform with Ray for Alipay’s “Collect Five Blessings” Campaign