Tagged articles

GUI automation

9 articles · Page 1 of 1

Machine Learning Algorithms & Natural Language Processing

Mar 12, 2026 · Artificial Intelligence

LongHorizonUI: A Unified Robust Framework for Long‑Horizon GUI Agent Automation

LongHorizonUI tackles the steep success‑rate drop of GUI agents on tasks longer than 10‑15 steps by introducing three tightly coupled modules—enhanced perception, deep reflective decision, and compensatory execution—and validates the approach on the new LongGUIBench benchmark with consistent performance gains across both app and game scenarios.

BenchmarkGUI automationICLR 2026

0 likes · 12 min read

LongHorizonUI: A Unified Robust Framework for Long‑Horizon GUI Agent Automation

PaperAgent

Dec 10, 2025 · Artificial Intelligence

How AI Agents Like UFO, Mobile-Agent, and UI-TARS Are Shaping 2025 Smartphones

The article examines the underlying GUI‑Agent technologies behind the 2025 “Doubao” smartphone, comparing Microsoft’s UFO series, Alibaba’s Mobile‑Agent v2/v3, and ByteDance’s UI‑TARS, detailing their model foundations, input modalities, action spaces, planning mechanisms, learning strategies, open‑source status, and multi‑agent frameworks.

AI agentsGUI automationOpen-source

0 likes · 8 min read

How AI Agents Like UFO, Mobile-Agent, and UI-TARS Are Shaping 2025 Smartphones

Network Intelligence Research Center (NIRC)

Nov 4, 2025 · Artificial Intelligence

SEAgent: A Self‑Evolving Computer Agent that Learns Software Use Autonomously

SEAgent introduces a self‑evolving framework that enables a GUI agent to master unfamiliar software through autonomous exploration and experience learning, leveraging a curriculum generator, a world‑state model, and GRPO‑based reinforcement with adversarial imitation, achieving state‑of‑the‑art performance on OSWorld.

GUI automationSEAgentautonomous learning

0 likes · 6 min read

SEAgent: A Self‑Evolving Computer Agent that Learns Software Use Autonomously

AsiaInfo Technology: New Tech Exploration

Oct 28, 2025 · Industry Insights

How Multimodal LLMs Are Transforming GUI Automation: A Comprehensive Survey

This article surveys the evolution of GUI automation from rule‑based scripts to multimodal large‑model‑driven agents, detailing core architectures, key components, application scenarios, current challenges, and future research directions for intelligent GUI agents.

GUI automationHuman-Computer InteractionIndustry Survey

0 likes · 19 min read

How Multimodal LLMs Are Transforming GUI Automation: A Comprehensive Survey

Software Engineering 3.0 Era

Mar 14, 2025 · Artificial Intelligence

From J.A.R.V.I.S. to Real AI Agents: A Must‑Read Guide to Modern GUI Agents

This article provides a comprehensive overview of AI agents, focusing on GUI‑based agents, their definitions, classifications, core capabilities, recent research such as OpenAI's ComputerUse, SpiritSight and MobileFlow, practical applications, technical and security challenges, and future development directions.

AI agentsComputerUseGUI automation

0 likes · 16 min read

From J.A.R.V.I.S. to Real AI Agents: A Must‑Read Guide to Modern GUI Agents

Python Programming Learning Circle

Jun 18, 2024 · Operations

Using pywinauto for Windows GUI Automation with Python

This article introduces the pywinauto library, explains how to install it, create Application objects, locate windows and controls, and perform mouse and keyboard automation on Windows applications using Python, with detailed code examples and usage tips.

GUI automationKeyboardMouse

0 likes · 8 min read

Using pywinauto for Windows GUI Automation with Python

ByteDance SE Lab

Aug 21, 2023 · Artificial Intelligence

How Fastbot Uses Reinforcement Learning for Faster Android GUI Testing

Fastbot is a reusable, model‑based Android GUI testing tool that leverages reinforcement‑learning techniques to learn from previous test runs, accelerating coverage and crash detection through a two‑phase workflow, probabilistic and learning‑based event selection, and provides configurable custom events, widget blocking, and tree‑pruning features.

GUI automationandroid testingfastbot

0 likes · 16 min read

How Fastbot Uses Reinforcement Learning for Faster Android GUI Testing

Python Programming Learning Circle

Sep 7, 2021 · Operations

Using PyAutoGUI for Cross‑Platform GUI Automation in Python

This article introduces PyAutoGUI, a cross‑platform Python library for automating mouse and keyboard actions, explains its coordinate system, outlines key functions for mouse, keyboard, dialogs, and screenshots, and provides practical code examples for drawing and image‑based button clicking.

Cross-PlatformGUI automationPython

0 likes · 5 min read

Using PyAutoGUI for Cross‑Platform GUI Automation in Python

360 Quality & Efficiency

Jun 28, 2019 · Operations

Using Sikuli for GUI Automation: Installation, Python Integration, and Practical Tips

This article introduces Sikuli, an image‑based GUI automation tool, explains its origins, provides download links, details installation steps, demonstrates Python integration via the Lackey library and SikuliX API, shares useful code snippets, and highlights common pitfalls and overall considerations for test automation.

GUI automationLackeyPython

0 likes · 6 min read

Using Sikuli for GUI Automation: Installation, Python Integration, and Practical Tips