How Ant Group’s Ragent Leverages Ray for Scalable AI Agents

This article introduces Ant Group’s Ragent, a Ray‑based distributed agent framework, covering its background, motivation, and design—including core modules like Profile, Memory, Planning, and Action—that enable large‑language‑model agents to operate at massive scale.

DataFunSummit
DataFunSummit
DataFunSummit
How Ant Group’s Ragent Leverages Ray for Scalable AI Agents

Background

Ray, the open‑source distributed framework originally built for large‑model training at OpenAI, has been adopted by Ant Group since 2017. Ant contributed over 26% of Ray’s core code, becoming the second‑largest contributor worldwide, and now runs more than 1.5 million CPU cores in production while maintaining the Ray community in China.

Motivation

From 2018 to 2022 Ant built several Ray‑based engines—Geaflow for flow‑graph computation, Realtime and Mobius for streaming and ML training, and Mars for inference and scientific computing. The team also pioneered a Multi‑Tenant architecture that the Ray community only began to consider later. In the 2023‑2024 era of large models, Ant delivered a Unified AI Serving framework that integrates offline, online, inference, and deployment workloads for its massive core business.

Design & Implementation

The Ragent framework is organized around four essential modules for LLM‑based agents:

Profile : Defines the agent’s persona (e.g., a gentle travel assistant) and the roles it can play.

Memory : Consists of Knowledge (domain and prior knowledge) and Experience (historical dialogues, user queries, reasoning steps, and action outcomes) to help the agent learn from past interactions.

Planning : Decomposes complex tasks into manageable subtasks using algorithms such as Chain‑of‑Thought or Tree‑of‑Thought, similar to flow‑chart design in programming.

Action : Executes real‑world tasks based on the plan and memory. A key feature is Function Calling, which lets the model invoke external services or even control physical devices like robotic arms.

These four components constitute the core of a large‑language‑model agent capable of influencing the real world beyond pure text generation.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

RayAnt GroupRagent
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.