Tagged articles
12 articles
Page 1 of 1
Machine Heart
Machine Heart
May 11, 2026 · Artificial Intelligence

Why Visual Perception Limits STEM Large Models and How CodePercept Breaks the Barrier

The authors demonstrate that visual perception, not reasoning, is the primary bottleneck for STEM multimodal large language models, introduce the CodePercept paradigm and the ICC-1M dataset, and show that code‑driven perception dramatically improves performance, surpassing much larger models on new benchmarks.

BenchmarkCVPR2026CodePercept
0 likes · 9 min read
Why Visual Perception Limits STEM Large Models and How CodePercept Breaks the Barrier
Machine Heart
Machine Heart
May 7, 2026 · Artificial Intelligence

Photo‑Level Simulation Bridges Vision Gap for Robot Learning (GS‑Playground, RSS 2026)

GS‑Playground is a next‑generation visual‑high‑fidelity robot simulator that cuts photo‑level rendering cost, automates asset creation, and narrows the Sim2Real gap, achieving up to 10,000 FPS on RTX 4090 and outperforming MuJoCo by 32× while supporting full‑stack parallel physics, 3DGS rendering, and end‑to‑end Real2Sim pipelines.

3D Gaussian SplattingHigh ThroughputRobotics
0 likes · 10 min read
Photo‑Level Simulation Bridges Vision Gap for Robot Learning (GS‑Playground, RSS 2026)
Model Perspective
Model Perspective
Sep 1, 2025 · Operations

Mathematical Secrets Behind a Perfect Military Parade

This article explores how mathematical models—ranging from matrix representations of formations to error analysis, phase synchronization, timing control, perspective geometry, and multi‑objective optimization—can be applied to design, evaluate, and perfect military parades.

Visual Perceptionformation optimizationmathematical modeling
0 likes · 6 min read
Mathematical Secrets Behind a Perfect Military Parade
AIWalker
AIWalker
Aug 5, 2025 · Artificial Intelligence

Perception‑R1: RL Gives Visual Insight Without Chain‑of‑Thought, Beats Four Tasks

The paper introduces Perception‑R1, a rule‑based reinforcement‑learning framework that trains multimodal large language models for visual perception tasks without relying on chain‑of‑thought reasoning, and demonstrates up to 17.9% performance gains on RefCOCO+, PixMo‑Count, PageOCR and COCO2017, while analyzing the key roles of perception confusion and reward design.

BenchmarkRLHFVisual Perception
0 likes · 24 min read
Perception‑R1: RL Gives Visual Insight Without Chain‑of‑Thought, Beats Four Tasks
VMIC UED
VMIC UED
Jul 21, 2025 · Fundamentals

Unlocking Design Psychology: How Gestalt Principles Shape User Experience

This article explores design psychology and Gestalt principles, explaining why designers need psychological insight, how perception biases affect user interaction, and offering practical tips and examples to create more intuitive, consistent, and engaging visual designs.

Gestalt principlesUser experienceVisual Perception
0 likes · 22 min read
Unlocking Design Psychology: How Gestalt Principles Shape User Experience
DataFunTalk
DataFunTalk
Feb 5, 2024 · Artificial Intelligence

Mobile-Agent: An Autonomous Multi‑Modal Mobile Device Agent with Visual Perception

The Mobile-Agent paper presents a vision‑only, autonomous multi‑modal AI system that can interpret user commands, locate UI elements on a smartphone screen, and execute complex tasks such as browsing, commenting, and content creation through a defined operation space, self‑planning, and self‑reflection mechanisms, achieving high success rates across diverse Chinese and English scenarios.

Mobile AutomationMultimodal AIVisual Perception
0 likes · 7 min read
Mobile-Agent: An Autonomous Multi‑Modal Mobile Device Agent with Visual Perception
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 11, 2022 · Artificial Intelligence

Media Experience Quality Assessment: Visual Perception and Objective Quality Metrics

Professor Zhai’s REDtech talk explained how the human visual system underlies full‑, reduced‑ and no‑reference media quality metrics, introduced a free‑energy‑based perception model and pseudo‑reference technique for accurate no‑reference UGC video assessment, and discussed audio‑visual integration, opinion‑score distributions, and EEG‑based perceptual loss challenges.

UGCVisual Perceptionaudio-visual
0 likes · 15 min read
Media Experience Quality Assessment: Visual Perception and Objective Quality Metrics
Zhaori User Experience
Zhaori User Experience
Oct 7, 2022 · Fundamentals

Unlocking Visual Psychology: 6 Design Secrets Every Designer Must Know

This article explores essential visual psychology concepts for designers, covering how we observe, read, remember, think, focus, and make decisions, and offers practical design guidelines such as leveraging optical illusions, peripheral vision, pattern recognition, color‑blind accessibility, cultural color meanings, and the proximity principle to create more intuitive user experiences.

UI designUser experienceVisual Perception
0 likes · 18 min read
Unlocking Visual Psychology: 6 Design Secrets Every Designer Must Know
JD Tech
JD Tech
Feb 1, 2018 · Artificial Intelligence

Telepath: A Vision‑Based Recommender Model Inspired by Human Visual Perception

The Telepath model, presented at AAAI 2018, leverages a biologically‑inspired visual extraction pipeline and dual interest‑understanding networks to improve ranking in large‑scale e‑commerce recommendation and advertising, achieving significant offline and online gains in CTR, GMV, and ROI.

AAAI 2018Deep LearningTelepath
0 likes · 13 min read
Telepath: A Vision‑Based Recommender Model Inspired by Human Visual Perception
58UXD
58UXD
Nov 25, 2016 · Fundamentals

Why 120 FPS Makes “Billy Lynn’s Midway Battle” Feel So Real

The article explains how higher frame rates like 120 fps, along with visual‑perception phenomena such as flash fusion and apparent motion, give movies such as “Billy Lynn’s Midway Battle” a hyper‑realistic feel compared to the traditional 24 fps cinema standard.

120fpsCinemaFilm Technology
0 likes · 10 min read
Why 120 FPS Makes “Billy Lynn’s Midway Battle” Feel So Real
58UXD
58UXD
Mar 8, 2016 · Frontend Development

How Psychology Shapes User Experience: 5 Design Tricks That Influence Behavior

This article explores how fundamental psychological principles—such as compulsion, conformity, visual illusion, and facial attraction—shape user experience design, illustrating each concept with real-world examples and visual cues to help designers create more engaging, trustworthy, and intuitive interfaces.

UI designUser experienceVisual Perception
0 likes · 8 min read
How Psychology Shapes User Experience: 5 Design Tricks That Influence Behavior
Meiyou UED
Meiyou UED
Sep 1, 2015 · Fundamentals

How Flat Design Transforms ACG: From Anime to Game Art

This article explores how the flat design movement, sparked by iOS 7, influences ACG (animation, comics, games), examining visual perception, contrast, and the distinction between flat symbolism and artistic abstraction, while highlighting examples from anime, CG artist Craig Mullins, and the award‑winning game Journey.

ACGVisual Perceptionanimation
0 likes · 8 min read
How Flat Design Transforms ACG: From Anime to Game Art