Tagged articles
36 articles
Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 15, 2026 · Artificial Intelligence

How Google’s AI‑Enabled Pointer Lets AI Read Your Intent Without Prompts

Google DeepMind’s new AI‑enabled pointer prototype shows how a cursor can capture visual context and intent, letting Gemini understand user commands without lengthy prompt engineering, and demonstrates two demos—AI‑Pointer: Create and AI‑Pointer: Find—while outlining design principles and future challenges.

AI-pointerDeepMindGemini
0 likes · 10 min read
How Google’s AI‑Enabled Pointer Lets AI Read Your Intent Without Prompts
AI Explorer
AI Explorer
Apr 9, 2026 · Artificial Intelligence

Hermes Agent: An Open‑Source AI Assistant That Controls Your PC via Natural Language

Hermes Agent is an open‑source AI assistant that translates natural‑language commands into concrete desktop actions by coupling large language models with OS automation interfaces, enabling tasks like file organization, web queries, and cross‑application workflows, while outlining its architecture, capabilities, limitations, and future prospects.

AI AssistantDesktop AutomationHuman-Computer Interaction
0 likes · 5 min read
Hermes Agent: An Open‑Source AI Assistant That Controls Your PC via Natural Language
Machine Heart
Machine Heart
Apr 3, 2026 · Artificial Intelligence

Capture Character Animation from Any Object Using Just a Phone – CHI 2026 Best Paper Nominee

DancingBox demonstrates that a single RGB camera, a flat calibration board, and any handheld object can be used to capture realistic character animation by first estimating coarse 3D bounding‑box motion with visual foundation models and then refining it with a diffusion‑based motion generation model, validated by a user study.

AIDancingBoxHuman-Computer Interaction
0 likes · 9 min read
Capture Character Animation from Any Object Using Just a Phone – CHI 2026 Best Paper Nominee
SuanNi
SuanNi
Mar 27, 2026 · Artificial Intelligence

Can AI Build a Whole Website From One Sentence? Inside Google’s Flash‑Lite Browser

Google DeepMind’s experimental Flash‑Lite Browser uses the Gemini 3.1 Flash‑Lite model to generate complete, interactive web pages in real time from natural‑language prompts, eliminating traditional front‑end development cycles and reshaping how users and developers experience the web.

AIFlash-Lite BrowserGemini 3.1
0 likes · 9 min read
Can AI Build a Whole Website From One Sentence? Inside Google’s Flash‑Lite Browser
PMTalk Product Manager Community
PMTalk Product Manager Community
Jan 7, 2026 · Artificial Intelligence

Why AI+AR Product Managers Who Fuse Sensor Data and User Behavior Are In High Demand

The article analyses AI‑augmented reality intent‑recognition systems, detailing multi‑modal data fusion, three‑way interaction architectures, and adaptive response mechanisms, and demonstrates their impact across medical surgery, elderly care, accessibility, and product design while outlining technical challenges and design methodologies.

AIARHuman-Computer Interaction
0 likes · 20 min read
Why AI+AR Product Managers Who Fuse Sensor Data and User Behavior Are In High Demand
IT Services Circle
IT Services Circle
Jul 1, 2025 · Artificial Intelligence

From Microsoft Bob to Copilot: How Virtual Assistants Evolved and What We Learned

The article traces Microsoft’s experiments with virtual assistants—from the home‑metaphor of Microsoft Bob and the intrusive Clippy to modern AI‑driven Copilot—highlighting design lessons about timing, personalization, user control, and how advances in large language models finally make the long‑standing vision viable.

AI designHuman-Computer InteractionMicrosoft
0 likes · 7 min read
From Microsoft Bob to Copilot: How Virtual Assistants Evolved and What We Learned
Java Tech Enthusiast
Java Tech Enthusiast
Jun 29, 2025 · Artificial Intelligence

From Microsoft Bob to Copilot: How Virtual Assistants Evolved with AI

This article traces the evolution of Microsoft’s virtual assistants—from the home‑metaphor of Bob and the intrusive Clippy to the voice‑enabled Cortana and the modern AI‑powered Copilot—highlighting design lessons, user reactions, and the impact of large language models on productivity software.

AIHuman-Computer InteractionMicrosoft
0 likes · 8 min read
From Microsoft Bob to Copilot: How Virtual Assistants Evolved with AI
58UXD
58UXD
Apr 17, 2025 · Artificial Intelligence

How Zero‑UI and Gemini’s Multimodal AI Are Redefining Human‑Computer Interaction

Zero‑UI, powered by multimodal AI models like Google Gemini, is shifting design from screen‑based interfaces to natural voice, gesture, and environmental interactions, prompting a fundamental redesign of how devices understand user intent across smart homes, cars, and immersive experiences.

AIHuman-Computer InteractionMultimodal
0 likes · 9 min read
How Zero‑UI and Gemini’s Multimodal AI Are Redefining Human‑Computer Interaction
AI Large Model Application Practice
AI Large Model Application Practice
Dec 9, 2024 · Artificial Intelligence

How GUI Agents Use Large Models to Automate Any Desktop Task

This article explains why GUI agents are needed, defines their multimodal capabilities, walks through a high‑level automation scenario, details the architecture of large‑model‑driven GUI agents, highlights recent open‑source projects, and compares them with traditional RPA solutions.

AI automationGUI AgentHuman-Computer Interaction
0 likes · 10 min read
How GUI Agents Use Large Models to Automate Any Desktop Task
21CTO
21CTO
Nov 24, 2024 · Artificial Intelligence

When AI Becomes a Digital Jesus: Inside the ‘Deus in Machina’ Confession Booth

An experimental art installation at Lucerne’s Peterskapelle Catholic Church lets visitors converse with an AI‑driven “Jesus” that answers in up to 100 languages, sparking both curiosity and ethical debate about AI’s role in religion and personal data protection.

AI ethicsArtificial IntelligenceHuman-Computer Interaction
0 likes · 6 min read
When AI Becomes a Digital Jesus: Inside the ‘Deus in Machina’ Confession Booth
Ops Development & AI Practice
Ops Development & AI Practice
Oct 4, 2024 · Artificial Intelligence

How ChatGPT 4.0 with Canvas Redefines Multimodal Human‑AI Interaction

ChatGPT 4.0 with Canvas introduces a visual "canvas" that blends language and graphics, enabling multimodal dialogue, real‑time visual feedback, and collaborative workflows across education, design, and business, while posing technical challenges in vision‑language integration, context consistency, and performance optimization.

AI applicationsCanvasChatGPT
0 likes · 10 min read
How ChatGPT 4.0 with Canvas Redefines Multimodal Human‑AI Interaction
AntTech
AntTech
Sep 16, 2024 · Artificial Intelligence

Opportunities and Challenges in the Era of Large Models: Technology Integration and Industry Leap

In his keynote at the 2024 Inclusion·Bund Conference, HKUST Board Chair Shen Xiangyang discusses how large‑model AI reshapes human‑computer interaction, introduces the concept of Intelligent Augmentation, emphasizes responsible AI governance, and outlines the practical steps needed to deploy AI agents in industry.

AIAI GovernanceHuman-Computer Interaction
0 likes · 4 min read
Opportunities and Challenges in the Era of Large Models: Technology Integration and Industry Leap
58UXD
58UXD
Dec 8, 2023 · Artificial Intelligence

How AI Agents Will Redefine Design, UX, and Interaction in the Next Five Years

Bill Gates predicts AI agents will soon replace most apps, prompting designers to rethink user experience, explore new interaction devices, and master AI communication skills, while emphasizing accessible design principles to thrive in a rapidly evolving digital landscape.

AI CollaborationAI agentsDesign
0 likes · 8 min read
How AI Agents Will Redefine Design, UX, and Interaction in the Next Five Years
We-Design
We-Design
Aug 16, 2023 · Frontend Development

Mastering Keyboard Shortcuts: Design Principles and Best Practices

This article explains why keyboard shortcuts boost efficiency, reduce fatigue, and improve precision, classifies accelerators and access keys, outlines discoverable, conflict‑free, and memorable design principles, and provides a step‑by‑step workflow for creating a robust shortcut system across platforms.

Human-Computer InteractionUI designfrontend development
0 likes · 7 min read
Mastering Keyboard Shortcuts: Design Principles and Best Practices
We-Design
We-Design
Aug 10, 2023 · Fundamentals

Can Unconscious Design Make Gesture Interfaces Instinctive?

Exploring how unconscious design principles—such as objective sketching and perceptual substitution—can reduce the learning curve of gesture-based interfaces by aligning hidden gestures with users' natural perceptions, the article examines UI evolution, case studies, and future prospects for more intuitive interactions.

Human-Computer Interactiongesture designunconscious design
0 likes · 12 min read
Can Unconscious Design Make Gesture Interfaces Instinctive?
Baidu Tech Salon
Baidu Tech Salon
May 26, 2023 · Industry Insights

How Large Models Are Redefining AI and Shaping the Next Industrial Revolution

In a 2023 Zhi Guan Cun Forum speech, Baidu CEO Robin Li explains how large AI models are compressing human knowledge, transforming human‑computer interaction, redefining marketing and customer service, spawning AI‑native applications, and reshaping the entire technology stack, ultimately driving a new era of industrial growth.

AI-native applicationsArtificial IntelligenceHuman-Computer Interaction
0 likes · 10 min read
How Large Models Are Redefining AI and Shaping the Next Industrial Revolution
DataFunSummit
DataFunSummit
Dec 19, 2022 · Artificial Intelligence

Multimodal Large‑Model Driven Virtual Digital Humans: Background, Methods, and Applications

This article introduces the rapid development of multimodal digital humans powered by large AI models, covering their background, current challenges, NeRF‑GAN based modeling methods, multimodal dialogue capabilities, and real‑world application cases such as virtual assistants, tourism guides, and sign‑language avatars.

AIGCHuman-Computer InteractionLarge Model
0 likes · 14 min read
Multimodal Large‑Model Driven Virtual Digital Humans: Background, Methods, and Applications
Tencent Cloud Developer
Tencent Cloud Developer
Aug 23, 2022 · Artificial Intelligence

Brain-Computer Interface Competition Showcases AI-Powered Mind-Controlled Technology

The Tencent Cloud‑backed “Tencent Cloud Cup” BCI competition, part of the World Robot Contest, drew over 250 teams from 26 provinces and three countries to tackle brain‑computer tasks like spelling and emotion detection, demonstrating typing, wheelchair and robotic arm applications, with the winning Chinese university team typing 81 characters in 285 seconds and results set for 5G deployment and publication in Brain Science Advances.

AI competitionDeep LearningHuman-Computer Interaction
0 likes · 8 min read
Brain-Computer Interface Competition Showcases AI-Powered Mind-Controlled Technology
Java Backend Technology
Java Backend Technology
Jul 6, 2021 · Operations

Why Do We Multitask in Remote Meetings? Insights from a Stanford‑Microsoft Study

Researchers from Stanford and Microsoft analyzed logs and surveys of 715 U.S. Microsoft employees during remote meetings, revealing that longer, larger meetings increase multitasking—such as emailing and document editing—by up to sixfold, negatively impacting focus, health, and meeting effectiveness, and they propose practical guidelines to curb this behavior.

Human-Computer InteractionMultitaskingproductivity
0 likes · 6 min read
Why Do We Multitask in Remote Meetings? Insights from a Stanford‑Microsoft Study
Tencent Mobility Industry Design Center
Tencent Mobility Industry Design Center
Sep 22, 2020 · Fundamentals

How Tencent’s Car‑Embedded Mini‑Programs Redefine In‑Vehicle UI Design

This guide explains how Tencent’s car‑embedded mini‑programs address the unique constraints of in‑vehicle infotainment by outlining experience highlights, design principles, interaction rules, visual standards, implementation steps, and future outlook for creating safe, efficient, and user‑friendly automotive interfaces.

Design GuidelinesHuman-Computer InteractionMini Program
0 likes · 15 min read
How Tencent’s Car‑Embedded Mini‑Programs Redefine In‑Vehicle UI Design
Programmer DD
Programmer DD
Apr 19, 2020 · Artificial Intelligence

How Gesture Recognition Transforms Mobile Gaming with Real‑Time AI Control

This article presents a gesture‑based human‑computer interaction system that uses Paddle Lite and MobileNet to enable real‑time control of games on Android phones, tablets, and embedded boards, detailing its architecture, data preparation, model training, and on‑device inference.

AndroidHuman-Computer InteractionMobile AI
0 likes · 11 min read
How Gesture Recognition Transforms Mobile Gaming with Real‑Time AI Control
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 7, 2020 · Artificial Intelligence

How Does Alibaba’s Tmall Genie Achieve Full‑Duplex Natural Dialogue?

This article explains the concept of full‑duplex natural dialogue for Alibaba’s Tmall Genie, illustrates interaction scenarios, and details the technical solution covering device‑side management, speech recognition, language understanding, synthesis, dialogue control, duration handling, and conversation flow.

ASRHuman-Computer InteractionNLU
0 likes · 8 min read
How Does Alibaba’s Tmall Genie Achieve Full‑Duplex Natural Dialogue?
DataFunTalk
DataFunTalk
Dec 9, 2019 · Artificial Intelligence

Automatic Construction of Knowledge Graphs: Methods, Challenges, and Applications

This article reviews the principles, techniques, and challenges of automatically building knowledge graphs, covering logical modeling, latent‑space analysis, human‑computer interaction, ontology support, and practical pipelines, and illustrates their use in network behavior analysis, intelligent Q&A, and recommendation systems.

Artificial IntelligenceHuman-Computer InteractionOntology
0 likes · 17 min read
Automatic Construction of Knowledge Graphs: Methods, Challenges, and Applications
Tencent Cloud Developer
Tencent Cloud Developer
Feb 26, 2019 · Artificial Intelligence

Tencent Cloud Intelligent Speech Technology: Development, Challenges and Practical Applications

Tencent Cloud's intelligent speech platform combines high‑accuracy ASR, advanced WaveNet‑based TTS, and solutions for noise, far‑field, and dialect challenges, enabling voice input, transcription, and customer‑service bots, with real‑world deployments in finance, museums, hotels, and other industry scenarios.

ASRHuman-Computer InteractionSpeech synthesis
0 likes · 8 min read
Tencent Cloud Intelligent Speech Technology: Development, Challenges and Practical Applications
High Availability Architecture
High Availability Architecture
May 28, 2018 · Artificial Intelligence

Interview with GIAC AI Forum Lecturer Long Mingkang on Building AI Platforms, Speech Recognition Challenges, and Future AI Trends

In this interview, Long Mingkang, Vice President of iFlytek's Cloud Computing Institute, shares his experience building large‑scale speech cloud services, discusses the technical hurdles of speech recognition and AI platform development, compares TensorFlow and MXNet, and offers insights on AutoML, industry trends, and how engineers can master AI.

AIAI PlatformsAutoML
0 likes · 13 min read
Interview with GIAC AI Forum Lecturer Long Mingkang on Building AI Platforms, Speech Recognition Challenges, and Future AI Trends
Hulu Beijing
Hulu Beijing
Apr 23, 2018 · Artificial Intelligence

How Intelligent Interaction Is Redefining Human‑Computer Interaction

This article explores the evolution of human‑computer interaction from its early interface concepts through multimodal and intelligent interaction stages, highlighting historical milestones, the rise of AI‑driven smart devices, emerging challenges such as bias, transparency, and the quest for universal interaction methods.

AIBiasDesign
0 likes · 12 min read
How Intelligent Interaction Is Redefining Human‑Computer Interaction
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 4, 2018 · Artificial Intelligence

How Tsinghua and Alibaba Are Shaping the Future of Human‑Machine Natural Interaction

In April 2018, Tsinghua University and Alibaba launched a joint Natural Interaction Experience Lab to explore next‑generation human‑machine interaction, aiming to give machines five senses and emotional understanding, and to apply this research across retail, smart homes, autonomous driving, and other emerging AI‑driven scenarios.

AIAlibabaHuman-Computer Interaction
0 likes · 8 min read
How Tsinghua and Alibaba Are Shaping the Future of Human‑Machine Natural Interaction
网易UEDC
网易UEDC
Sep 27, 2017 · Fundamentals

Beyond Screens: Rethinking Interfaces for Future Interaction Design

This article explores how interaction designers can expand the concept of interfaces beyond traditional screens, examining the evolution from CUI to NUI and showcasing diverse examples such as elevator panels, smart bikes, conversational robots, shape‑changing metal, and tactile displays for the blind.

Human-Computer InteractionInteraction DesignInterface
0 likes · 9 min read
Beyond Screens: Rethinking Interfaces for Future Interaction Design
Suning Design
Suning Design
May 4, 2017 · Artificial Intelligence

Can Voice Interaction Become the Next Main Human‑Machine Interface?

This article explores the evolution, current capabilities, design challenges, and future scenarios of intelligent voice interaction, arguing that voice will become one of the mainstream ways humans communicate with machines while highlighting technical limits, user experience principles, and suitable application domains.

AIDesignHuman-Computer Interaction
0 likes · 13 min read
Can Voice Interaction Become the Next Main Human‑Machine Interface?
Suning Technology
Suning Technology
Mar 17, 2017 · Artificial Intelligence

Will Intelligent Voice Interaction Become a Mainstream HCI Method?

This article explores the evolution of intelligent voice interaction—from its roots in natural language processing and early products like Siri to its potential to become a primary human-computer interface, discussing technical challenges, design principles, comparative advantages over graphical interfaces, and suitable application scenarios such as automotive, education, and customer service.

AIHuman-Computer Interactiondesign principles
0 likes · 14 min read
Will Intelligent Voice Interaction Become a Mainstream HCI Method?
58UXD
58UXD
Jan 21, 2016 · Fundamentals

Boosting UX with Anticipatory Interaction Design: Real‑World Examples

This article explores anticipatory interaction design, showing how platforms like Taobao, WeChat, browsers, and iOS predict user intentions to streamline sharing and navigation, and discusses two main strategies—aligning with user behavior and adapting to context—to create more efficient, respectful user experiences.

Human-Computer InteractionInteraction DesignProduct Design
0 likes · 8 min read
Boosting UX with Anticipatory Interaction Design: Real‑World Examples
ITPUB
ITPUB
Jan 17, 2016 · User Experience Design

Can Invisible Computers Redefine Everyday Interactions?

The article envisions a future where seamless, context‑aware invisible computers replace fragmented apps, using voice, gesture, AR projection and cognitive AI to simplify tasks like ordering pizza, selecting wine, and navigating, while discussing design challenges, trust, and the evolution of human‑computer interfaces.

Artificial IntelligenceHuman-Computer InteractionUser experience
0 likes · 22 min read
Can Invisible Computers Redefine Everyday Interactions?
Suning Design
Suning Design
Sep 16, 2014 · Fundamentals

Is Touch‑Screen Dominance the Real Future of Interaction Design?

This article reflects on the evolution of human‑computer interaction, questioning whether touch‑screen technology truly advances user experience, and explores alternative interaction concepts such as natural gestures, haptic feedback, and brain‑computer interfaces while emphasizing a human‑centered design approach.

HCIHuman-Computer InteractionInteraction Design
0 likes · 18 min read
Is Touch‑Screen Dominance the Real Future of Interaction Design?
Baidu Tech Salon
Baidu Tech Salon
Jun 23, 2014 · Artificial Intelligence

Algorithms as Evolving Entities: Lessons from Dog Domestication

Just as wolves gradually became dogs by learning human cues, modern algorithms must evolve to comprehend our intentions and values, turning from opaque decision‑makers into humane partners that enhance daily life without friction, lest their unchecked speed and logic create dangerous mismatches.

AlgorithmsHuman-Computer InteractionTechnology Evolution
0 likes · 13 min read
Algorithms as Evolving Entities: Lessons from Dog Domestication