Tagged articles

Mobile Agent

5 articles · Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Jun 6, 2026 · Artificial Intelligence

How to Systematically Build More Realistic Mobile Agent Environments for Large‑Scale Training

PhoneWorld reconstructs mock Android apps from real‑world usage traces, creating scalable, resettable, and verifiable environments that let Mobile Agents train on realistic page structures, navigation paths, and state changes, and the paper shows substantial gains across four mobile benchmarks.

AI trainingMobile AgentPhoneWorld
0 likes · 12 min read
How to Systematically Build More Realistic Mobile Agent Environments for Large‑Scale Training
Machine Heart
Machine Heart
Jun 5, 2026 · Artificial Intelligence

Building More Realistic Mobile Agent Worlds for Large‑Scale Training

The article examines the PhoneWorld project, which reconstructs realistic Android app environments from user interaction traces to create scalable, resettable, and verifiable mock apps, enabling large‑scale training and evaluation of Mobile Agents with demonstrated performance gains across multiple benchmarks.

Large‑Scale TrainingMobile AgentPhoneWorld
0 likes · 12 min read
Building More Realistic Mobile Agent Worlds for Large‑Scale Training
DataFunSummit
DataFunSummit
Jul 30, 2024 · Artificial Intelligence

Multimodal Mobile AI Agent (Mobile‑Agent): From V1 to V2 and Open‑Source Practice

This article introduces Alibaba Tongyi Lab's multimodal mobile AI agent, Mobile‑Agent, covering the background of large‑model agents, the design and capabilities of V1 and V2, the multi‑agent framework, evaluation results, open‑source resources, and future development directions.

AI planningLarge Language ModelMobile Agent
0 likes · 13 min read
Multimodal Mobile AI Agent (Mobile‑Agent): From V1 to V2 and Open‑Source Practice
DataFunTalk
DataFunTalk
Feb 5, 2024 · Artificial Intelligence

Mobile-Agent: An Autonomous Multi‑Modal Mobile Device Agent with Visual Perception

The Mobile-Agent paper presents a vision‑only, autonomous multi‑modal AI system that can interpret user commands, locate UI elements on a smartphone screen, and execute complex tasks such as browsing, commenting, and content creation through a defined operation space, self‑planning, and self‑reflection mechanisms, achieving high success rates across diverse Chinese and English scenarios.

Mobile AgentMultimodal AIautonomous operation
0 likes · 7 min read
Mobile-Agent: An Autonomous Multi‑Modal Mobile Device Agent with Visual Perception