Ma Wei Says
Mar 4, 2025 · Artificial Intelligence
Microsoft’s Open‑Source Multimodal AI Agent Model Magma: Capabilities and Innovations
On February 25 2025, Microsoft open‑sourced its first multimodal AI agent foundation model, Magma, which extends multimodal processing to images, video, and text, introduces Set‑of‑Mark and Trace‑of‑Mark techniques for spatial‑temporal reasoning, optimizes modular inference for edge devices, and integrates reinforcement learning for adaptive task execution.
Edge ComputingMagmaSet-of-Mark
0 likes · 6 min read
