Tagged articles
8 articles
Page 1 of 1
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
May 12, 2026 · Artificial Intelligence

Treating Automated Testing as AI Coding: Xiaohongshu GUI Agent Real‑World Review

During the 2026 Spring Festival promotion, Xiaohongshu replaced manual UI testing with a three‑layer AI‑driven GUI Agent that executed over 43,000 runs across 106 devices and 128 scenarios, achieving 58% automation, 82% AI‑generated case adoption, 68% bug recall, 98% stability and roughly $1 per test case while drastically cutting token costs.

AI CodingAutomated TestingCode-as-Action
0 likes · 23 min read
Treating Automated Testing as AI Coding: Xiaohongshu GUI Agent Real‑World Review
AI Engineering
AI Engineering
Apr 1, 2026 · Artificial Intelligence

Holo3 AI Model Beats GPT‑5.4 at One‑Tenth the Cost for Computer Use

H Company’s new Holo3 series delivers a visual language model that outperforms GPT‑5.4 on the OSWorld‑Verified benchmark with a 78.85% score while costing only about one‑tenth as much, offering both a flagship API‑only version and an open‑source lightweight variant optimized for GUI agents.

AI BenchmarkGUI AgentHolo3
0 likes · 4 min read
Holo3 AI Model Beats GPT‑5.4 at One‑Tenth the Cost for Computer Use
DataFunSummit
DataFunSummit
Dec 23, 2025 · Artificial Intelligence

What Core Capabilities Do Mature GUI Agents Need? Expert Insights from the Agentic AI Summit

In a live discussion hosted by Prof. Yang Jian with experts Zhang Xi and Cui Chen, the panel explores the essential abilities of mature GUI agents, the role of multimodal models in visual understanding, the transfer of code‑agent techniques to GUI tasks, edge‑device performance trade‑offs, complex planning, tool ecosystems, deployment challenges, and future breakthrough scenarios.

Agentic AICode AgentGUI Agent
0 likes · 22 min read
What Core Capabilities Do Mature GUI Agents Need? Expert Insights from the Agentic AI Summit
AntTech
AntTech
Aug 19, 2025 · Artificial Intelligence

How UI‑Venus Achieves SOTA in Multimodal GUI Agent Benchmarks

Ant Group's open‑source native GUI agent UI‑Venus leverages multimodal large‑model and reinforcement‑learning techniques to outperform prior models on grounding and navigation benchmarks, while using a high‑quality data pipeline and a self‑evolving alignment mechanism to push the limits of GUI automation.

BenchmarkGUI AgentMultimodal AI
0 likes · 7 min read
How UI‑Venus Achieves SOTA in Multimodal GUI Agent Benchmarks
ByteDance Web Infra
ByteDance Web Infra
Jan 22, 2025 · Artificial Intelligence

Introducing UI‑TARS: A Native GUI Agent Model Integrated with Midscene.js for Multimodal UI Automation

The article presents UI‑TARS, a native GUI‑agent model that combines multimodal large‑language models with the open‑source Midscene.js framework to enable more accurate, token‑efficient, and privacy‑preserving UI automation, while discussing its architecture, advantages, limitations, and integration steps.

GUI AgentMidscene.jsMultimodal AI
0 likes · 11 min read
Introducing UI‑TARS: A Native GUI Agent Model Integrated with Midscene.js for Multimodal UI Automation
AI Large Model Application Practice
AI Large Model Application Practice
Dec 9, 2024 · Artificial Intelligence

How GUI Agents Use Large Models to Automate Any Desktop Task

This article explains why GUI agents are needed, defines their multimodal capabilities, walks through a high‑level automation scenario, details the architecture of large‑model‑driven GUI agents, highlights recent open‑source projects, and compares them with traditional RPA solutions.

AI automationGUI AgentHuman-Computer Interaction
0 likes · 10 min read
How GUI Agents Use Large Models to Automate Any Desktop Task