Beyond Orchestrating Workflows: How UnityMAS-O Trains LLM-Based Multi‑Agent Systems
UnityMAS‑O introduces a general reinforcement‑learning framework that converts predefined LLM multi‑agent workflows into trainable tasks, enabling credit assignment across roles, supporting parameter‑sharing configurations, and demonstrating significant F1 and test‑pass improvements on QA and code‑generation benchmarks.
