Tagged articles

Multi-Agent Reinforcement Learning

13 articles · Page 1 of 1

Machine Learning Algorithms & Natural Language Processing

Jun 10, 2026 · Artificial Intelligence

Beyond Orchestrating Workflows: How UnityMAS-O Trains LLM-Based Multi‑Agent Systems

UnityMAS‑O introduces a general reinforcement‑learning framework that converts predefined LLM multi‑agent workflows into trainable tasks, enabling credit assignment across roles, supporting parameter‑sharing configurations, and demonstrating significant F1 and test‑pass improvements on QA and code‑generation benchmarks.

LLMMulti-Agent Reinforcement LearningPPO

0 likes · 12 min read

Beyond Orchestrating Workflows: How UnityMAS-O Trains LLM-Based Multi‑Agent Systems

PaperAgent

Feb 26, 2026 · Artificial Intelligence

How In-Context Co‑Player Inference and LLM‑Driven Evolution Are Redefining Multi‑Agent RL

This article analyzes two recent Google papers—one introducing context‑based co‑player inference for robust multi‑agent cooperation and the other presenting AlphaEvolve, an LLM‑guided evolutionary framework that automatically discovers novel multi‑agent learning algorithms—detailing their methods, experimental findings, and broader implications for AI research.

AlphaEvolveLLM-driven algorithm discoveryMulti-Agent Reinforcement Learning

0 likes · 11 min read

How In-Context Co‑Player Inference and LLM‑Driven Evolution Are Redefining Multi‑Agent RL

PaperAgent

Feb 25, 2026 · Artificial Intelligence

How Contextual Co-Player Inference Enables Robust Multi-Agent Cooperation

These two recent Google papers advance multi‑agent reinforcement learning: one introduces contextual co‑player inference to achieve robust cooperation without explicit meta‑learning, while the other presents AlphaEvolve, a large‑language‑model‑driven evolutionary framework that automatically discovers novel MARL algorithms such as VAD‑CFR and SHOR‑PSRO.

AI researchCFRLLM-driven algorithm discovery

0 likes · 13 min read

How Contextual Co-Player Inference Enables Robust Multi-Agent Cooperation

Machine Learning Algorithms & Natural Language Processing

Feb 18, 2026 · Artificial Intelligence

Multi-Agent Communication: A Survey from MARL to Emergent Language and Large Language Models

This survey examines the evolution of multi‑agent communication—from early hand‑crafted protocols in MARL, through emergent discrete languages, to recent large‑language‑model‑driven approaches—using a unified "five W" framework to analyze who communicates, what, when, why, and how.

Large Language ModelsMulti-Agent Reinforcement Learningcommunication protocols

0 likes · 19 min read

Multi-Agent Communication: A Survey from MARL to Emergent Language and Large Language Models

AntTech

Jan 16, 2026 · Databases

Can Multi‑Agent Collaboration Automatically Tune Database Parameters with High Efficiency?

The paper presents CMA+DB, a hierarchical multi‑agent framework that automatically tunes database parameters across diverse workloads by combining classification‑based collaboration, layered training, and joint action selection, achieving superior performance, faster convergence, and strong generalization compared with existing tuning methods.

CMA+DBDatabase TuningMulti-Agent Reinforcement Learning

0 likes · 9 min read

Can Multi‑Agent Collaboration Automatically Tune Database Parameters with High Efficiency?

JD Tech

Apr 8, 2025 · Artificial Intelligence

MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Advertising Systems

The article presents MaRCA, a multi‑agent reinforcement learning framework that models user value, compute consumption, and action reward to allocate limited computation resources across the entire advertising recommendation pipeline, achieving higher ad revenue while keeping system load stable under fluctuating traffic and diverse request values.

Deep LearningLoad-Aware SchedulingMulti-Agent Reinforcement Learning

0 likes · 16 min read

MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Advertising Systems

Python Programming Learning Circle

Sep 10, 2024 · Artificial Intelligence

Using TorchRL to Implement Multi‑Agent PPO for MARL

This tutorial explains how to set up a multi‑agent reinforcement learning (MARL) environment with VMAS, install required dependencies, configure PPO hyper‑parameters, build policy and critic networks, collect data with TorchRL, and run a training loop to train agents for coordinated navigation tasks.

Deep LearningMulti-Agent Reinforcement LearningPPO

0 likes · 10 min read

Using TorchRL to Implement Multi‑Agent PPO for MARL

DataFunTalk

Aug 24, 2023 · Artificial Intelligence

Multi-Agent Decision Large Models: Challenges, Action Semantic Networks, Permutation Invariance/Equivariance, and Automated Curriculum Learning

This talk outlines the fundamental challenges of multi‑agent decision large models, introduces three core design priors—action semantic networks, permutation invariance/equivariance, and cross‑task automated curriculum learning— and demonstrates how these concepts improve performance across diverse environments such as StarCraft, Neural‑MMO, and SMAC.

AIMulti-Agent Reinforcement Learningaction semantics

0 likes · 12 min read

Multi-Agent Decision Large Models: Challenges, Action Semantic Networks, Permutation Invariance/Equivariance, and Automated Curriculum Learning

Alimama Tech

Mar 9, 2022 · Artificial Intelligence

Multi-Agent Auto-bidding (MAAB): A Framework for Distributed Automatic Bidding in Online Advertising

The paper introduces MAAB, a scalable multi‑agent reinforcement‑learning framework for online ad bidding that uses temperature‑regularized credit assignment, adaptive threshold agents, and mean‑field clustering to balance individual advertiser utility, platform revenue, and overall social welfare in competitive auction environments.

Multi-Agent Reinforcement LearningOnline Advertisingauto-bidding

0 likes · 28 min read

Multi-Agent Auto-bidding (MAAB): A Framework for Distributed Automatic Bidding in Online Advertising

Alimama Tech

Oct 13, 2021 · Artificial Intelligence

Multi-Agent Cooperative Bidding Game Framework for Multi-Objective Optimization in Online Advertising

The paper presents MACG, a multi‑agent cooperative bidding game that integrates a global objective with individual advertiser goals, derives optimal bidding formulas, employs a strategy network and evolutionary search to tune parameters, and demonstrates over‑5% metric gains and stable 15‑day performance in Taobao’s online advertising platform.

Multi-Agent Reinforcement LearningOnline AdvertisingTaobao advertising platform

0 likes · 18 min read

Multi-Agent Cooperative Bidding Game Framework for Multi-Objective Optimization in Online Advertising

Amap Tech

Mar 5, 2021 · Artificial Intelligence

AI Applications in Mobility: Route Planning, ETA Prediction, Dynamic Event Mining, and Global Scheduling

The article surveys Amap’s AI‑driven mobility solutions—from personalized, multi‑objective route planning using Cell‑Based Routing and bias‑aware sorting, through spatio‑temporal ETA prediction and lightweight BERT‑based traffic‑event mining, to rapid POI freshness updates and a future global scheduling system that coordinates vehicles and signals via multi‑agent reinforcement learning.

AIKnowledge DistillationMulti-Agent Reinforcement Learning

0 likes · 14 min read

AI Applications in Mobility: Route Planning, ETA Prediction, Dynamic Event Mining, and Global Scheduling

Alibaba Cloud Developer

Apr 7, 2017 · Artificial Intelligence

How BiCNet Enables Multi‑Agent Cooperation in StarCraft Battles

This article reviews the BiCNet framework, a bidirectional coordination network that lets multiple AI agents learn cooperative strategies in StarCraft micro‑battles, achieving state‑of‑the‑art performance across various combat scenarios and demonstrating broad applicability to real‑world multi‑agent tasks.

BiCNetDeep LearningMulti-Agent Reinforcement Learning

0 likes · 14 min read

How BiCNet Enables Multi‑Agent Cooperation in StarCraft Battles

Alibaba Cloud Developer

Apr 4, 2017 · Artificial Intelligence

BiCNet: Mastering Multi-Agent Cooperation in StarCraft Battles

The paper introduces BiCNet, a bidirectional coordination network that learns optimal multi‑agent strategies in StarCraft micro‑battles—ranging from collision‑free movement to complex cover attacks and focused fire—outperforming prior state‑of‑the‑art methods and demonstrating scalable potential for real‑world cooperative AI tasks.

BiCNetDeep LearningMulti-Agent Reinforcement Learning

0 likes · 14 min read

BiCNet: Mastering Multi-Agent Cooperation in StarCraft Battles