Tagged articles

12 articles

Page 1 of 1

May 11, 2026 · Artificial Intelligence

Why Visual Perception Limits STEM Large Models and How CodePercept Breaks the Barrier

The authors demonstrate that visual perception, not reasoning, is the primary bottleneck for STEM multimodal large language models, introduce the CodePercept paradigm and the ICC-1M dataset, and show that code‑driven perception dramatically improves performance, surpassing much larger models on new benchmarks.

BenchmarkCVPR2026CodePercept

0 likes · 9 min read

Why Visual Perception Limits STEM Large Models and How CodePercept Breaks the Barrier

Machine Heart

May 7, 2026 · Artificial Intelligence

Photo‑Level Simulation Bridges Vision Gap for Robot Learning (GS‑Playground, RSS 2026)

GS‑Playground is a next‑generation visual‑high‑fidelity robot simulator that cuts photo‑level rendering cost, automates asset creation, and narrows the Sim2Real gap, achieving up to 10,000 FPS on RTX 4090 and outperforming MuJoCo by 32× while supporting full‑stack parallel physics, 3DGS rendering, and end‑to‑end Real2Sim pipelines.

3D Gaussian SplattingHigh ThroughputRobotics

0 likes · 10 min read

Photo‑Level Simulation Bridges Vision Gap for Robot Learning (GS‑Playground, RSS 2026)

Model Perspective

Sep 1, 2025 · Operations

Mathematical Secrets Behind a Perfect Military Parade

This article explores how mathematical models—ranging from matrix representations of formations to error analysis, phase synchronization, timing control, perspective geometry, and multi‑objective optimization—can be applied to design, evaluate, and perfect military parades.

Visual Perceptionformation optimizationmathematical modeling

0 likes · 6 min read

Mathematical Secrets Behind a Perfect Military Parade

AIWalker

Aug 5, 2025 · Artificial Intelligence

Perception‑R1: RL Gives Visual Insight Without Chain‑of‑Thought, Beats Four Tasks

The paper introduces Perception‑R1, a rule‑based reinforcement‑learning framework that trains multimodal large language models for visual perception tasks without relying on chain‑of‑thought reasoning, and demonstrates up to 17.9% performance gains on RefCOCO+, PixMo‑Count, PageOCR and COCO2017, while analyzing the key roles of perception confusion and reward design.

BenchmarkRLHFVisual Perception

0 likes · 24 min read

Perception‑R1: RL Gives Visual Insight Without Chain‑of‑Thought, Beats Four Tasks

VMIC UED

Jul 21, 2025 · Fundamentals

Unlocking Design Psychology: How Gestalt Principles Shape User Experience

This article explores design psychology and Gestalt principles, explaining why designers need psychological insight, how perception biases affect user interaction, and offering practical tips and examples to create more intuitive, consistent, and engaging visual designs.

Gestalt principlesUser experienceVisual Perception

0 likes · 22 min read

Unlocking Design Psychology: How Gestalt Principles Shape User Experience

DataFunTalk

Feb 5, 2024 · Artificial Intelligence

Mobile-Agent: An Autonomous Multi‑Modal Mobile Device Agent with Visual Perception

The Mobile-Agent paper presents a vision‑only, autonomous multi‑modal AI system that can interpret user commands, locate UI elements on a smartphone screen, and execute complex tasks such as browsing, commenting, and content creation through a defined operation space, self‑planning, and self‑reflection mechanisms, achieving high success rates across diverse Chinese and English scenarios.

Mobile AutomationMultimodal AIVisual Perception

0 likes · 7 min read

Mobile-Agent: An Autonomous Multi‑Modal Mobile Device Agent with Visual Perception

Xiaohongshu Tech REDtech

Nov 11, 2022 · Artificial Intelligence

Media Experience Quality Assessment: Visual Perception and Objective Quality Metrics

Professor Zhai’s REDtech talk explained how the human visual system underlies full‑, reduced‑ and no‑reference media quality metrics, introduced a free‑energy‑based perception model and pseudo‑reference technique for accurate no‑reference UGC video assessment, and discussed audio‑visual integration, opinion‑score distributions, and EEG‑based perceptual loss challenges.

UGCVisual Perceptionaudio-visual

0 likes · 15 min read

Media Experience Quality Assessment: Visual Perception and Objective Quality Metrics

Zhaori User Experience

Oct 7, 2022 · Fundamentals

Unlocking Visual Psychology: 6 Design Secrets Every Designer Must Know

This article explores essential visual psychology concepts for designers, covering how we observe, read, remember, think, focus, and make decisions, and offers practical design guidelines such as leveraging optical illusions, peripheral vision, pattern recognition, color‑blind accessibility, cultural color meanings, and the proximity principle to create more intuitive user experiences.

UI designUser experienceVisual Perception

0 likes · 18 min read

Unlocking Visual Psychology: 6 Design Secrets Every Designer Must Know

JD Tech

Feb 1, 2018 · Artificial Intelligence

Telepath: A Vision‑Based Recommender Model Inspired by Human Visual Perception

The Telepath model, presented at AAAI 2018, leverages a biologically‑inspired visual extraction pipeline and dual interest‑understanding networks to improve ranking in large‑scale e‑commerce recommendation and advertising, achieving significant offline and online gains in CTR, GMV, and ROI.

AAAI 2018Deep LearningTelepath

0 likes · 13 min read

Telepath: A Vision‑Based Recommender Model Inspired by Human Visual Perception

58UXD

Nov 25, 2016 · Fundamentals

Why 120 FPS Makes “Billy Lynn’s Midway Battle” Feel So Real

The article explains how higher frame rates like 120 fps, along with visual‑perception phenomena such as flash fusion and apparent motion, give movies such as “Billy Lynn’s Midway Battle” a hyper‑realistic feel compared to the traditional 24 fps cinema standard.

120fpsCinemaFilm Technology

0 likes · 10 min read

Why 120 FPS Makes “Billy Lynn’s Midway Battle” Feel So Real

58UXD

Mar 8, 2016 · Frontend Development

How Psychology Shapes User Experience: 5 Design Tricks That Influence Behavior

This article explores how fundamental psychological principles—such as compulsion, conformity, visual illusion, and facial attraction—shape user experience design, illustrating each concept with real-world examples and visual cues to help designers create more engaging, trustworthy, and intuitive interfaces.

UI designUser experienceVisual Perception

0 likes · 8 min read

How Psychology Shapes User Experience: 5 Design Tricks That Influence Behavior

Meiyou UED

Sep 1, 2015 · Fundamentals

How Flat Design Transforms ACG: From Anime to Game Art

This article explores how the flat design movement, sparked by iOS 7, influences ACG (animation, comics, games), examining visual perception, contrast, and the distinction between flat symbolism and artistic abstraction, while highlighting examples from anime, CG artist Craig Mullins, and the award‑winning game Journey.

ACGVisual Perceptionanimation

0 likes · 8 min read

How Flat Design Transforms ACG: From Anime to Game Art