Tagged articles
32 articles
Page 1 of 1
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 30, 2026 · Industry Insights

How RocketMQ LiteTopic Redesign Boosts High‑Concurrency AI Voice Interaction

This article analyzes the bottlenecks of real‑time AI voice agents in high‑concurrency scenarios and presents a cloud‑native messaging architecture built on Alibaba Cloud RocketMQ LiteTopic that ensures session stickiness, low latency, automatic channel management, and observable operations for scalable, reliable voice interactions.

LiteTopicMessage ArchitectureObservability
0 likes · 14 min read
How RocketMQ LiteTopic Redesign Boosts High‑Concurrency AI Voice Interaction
Alibaba Cloud Native
Alibaba Cloud Native
Mar 25, 2026 · Cloud Native

Building a Scalable Real‑Time Voice AI Agent with RocketMQ LiteTopic

This article analyzes the challenges of high‑concurrency voice AI agents—such as massive session management, tiny packet transmission, strict latency, and asynchronous result handling—and presents a detailed Cloud‑Native architecture using Alibaba Cloud RocketMQ LiteTopic to achieve stable, low‑latency, and automatically managed real‑time voice message pipelines.

AIMessage Queuevoice interaction
0 likes · 13 min read
Building a Scalable Real‑Time Voice AI Agent with RocketMQ LiteTopic
Weekly Large Model Application
Weekly Large Model Application
Mar 17, 2026 · Artificial Intelligence

Essential Features Every Voice Interaction System Must Support

The article provides a comprehensive analysis of core voice interaction system capabilities—including barge‑in, turn‑taking, multi‑turn dialogue, intent recognition, speaker identification, streaming latency, noise robustness, multilingual support, emotion handling, personalization, security, and deployment considerations—highlighting typical scenarios such as smart speakers, in‑car assistants, call centers, and meeting transcription.

ASRLatencyTTS
0 likes · 11 min read
Essential Features Every Voice Interaction System Must Support
BirdNest Tech Talk
BirdNest Tech Talk
Mar 2, 2026 · Artificial Intelligence

45 Powerful Claude Code Tips to Supercharge Your AI‑Powered Development

This comprehensive guide walks you through 45 practical Claude Code techniques—from customizing the status bar and mastering slash commands to using Git worktrees, managing context, automating tasks with containers, and leveraging plugins—providing concrete examples, code snippets, and step‑by‑step workflows that let you harness the full potential of Claude Code in real‑world software development.

AI DevelopmentAutomationClaude Code
0 likes · 65 min read
45 Powerful Claude Code Tips to Supercharge Your AI‑Powered Development
AI Engineering
AI Engineering
Jan 25, 2026 · Artificial Intelligence

ClawdBot Goes Viral: First AI Assistant Video Tutorial Inside

ClawdBot is a 24‑hour AI assistant that can clean your inbox, schedule meetings, analyze code, and execute voice‑controlled tasks; the guide explains its architecture, two deployment options (local or AWS), low cost, security pairing, quick tests, advanced features, and real‑world use cases.

AI AssistantAWSAutomation
0 likes · 8 min read
ClawdBot Goes Viral: First AI Assistant Video Tutorial Inside
DataFunTalk
DataFunTalk
Nov 5, 2025 · Artificial Intelligence

Why AI Agents Are Booming in 2025: Key Trends, Opportunities, and Market Insights

The 2025 AI Agent Bible report reveals that voice interaction, payment infrastructure, data competition, monitoring tools, and M&A activity are reshaping the AI Agent market, highlighting lucrative coding agents, high‑valuation customer‑service agents, cost pressures, and emerging vertical opportunities for entrepreneurs.

AI Startup FundingAI agentsAgent Monetization
0 likes · 21 min read
Why AI Agents Are Booming in 2025: Key Trends, Opportunities, and Market Insights
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 18, 2025 · Artificial Intelligence

One-Click Deployment of Cutting-Edge Text-to-Video and Voice Interaction Models

This article introduces the state‑of‑the‑art Step‑Video‑T2V text‑to‑video model and the Step‑Audio‑Chat voice interaction model, outlines their technical specifications and benchmark results, and provides a detailed step‑by‑step guide for deploying both models with a single click using Alibaba Cloud's PAI Model Gallery.

AI Model DeploymentPAI Model Gallerystate-of-the-art
0 likes · 9 min read
One-Click Deployment of Cutting-Edge Text-to-Video and Voice Interaction Models
ZhongAn Tech Team
ZhongAn Tech Team
Dec 8, 2024 · Artificial Intelligence

Weekly AI Digest Issue 5: Voice Interaction Trends, End‑to‑End vs. Chain Integration, and Enterprise Solutions

This issue examines the growing importance of voice interaction in AI, highlights Justin Uberti’s move to OpenAI and the launch of GPT‑4o, compares end‑to‑end large‑model and chain‑integration approaches, and offers practical enterprise deployment scenarios for both weak and strong voice‑based interactions.

AIChain IntegrationEnd-to-End
0 likes · 14 min read
Weekly AI Digest Issue 5: Voice Interaction Trends, End‑to‑End vs. Chain Integration, and Enterprise Solutions
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 15, 2024 · Artificial Intelligence

OpenAI Unveils GPT‑4o: An Omni‑Capable Multimodal Model Offered Free to All Users

OpenAI introduced GPT‑4o, a free, omni‑capable multimodal model that processes text, audio, and images together, delivers near‑human response latency, showcases impressive live demos, and will soon be available via a discounted API, marking a significant step forward in end‑to‑end AI research.

AI researchGPT-4oMultimodal AI
0 likes · 7 min read
OpenAI Unveils GPT‑4o: An Omni‑Capable Multimodal Model Offered Free to All Users
Meituan Technology Team
Meituan Technology Team
Mar 9, 2023 · Artificial Intelligence

Implementation and Practice of MRCP in Meituan Voice Interaction

This article details Meituan’s adoption of the Media Resource Control Protocol (MRCP) to standardize ASR and TTS integration, describing its architecture, key components, high‑availability deployment, and measured performance gains such as up to 55% latency reduction and a 15% increase in outbound call success rates.

ASRMRCPMeituan
0 likes · 24 min read
Implementation and Practice of MRCP in Meituan Voice Interaction
Baidu Geek Talk
Baidu Geek Talk
Aug 1, 2022 · Artificial Intelligence

Sugar BI: AI-Powered Business Intelligence Platform Architecture and Intelligent Visualization

Sugar BI, Baidu Cloud’s AI‑powered business intelligence platform, lets users create professional, zero‑code dashboards in minutes by connecting to 30+ data sources, leveraging Apache ECharts, intelligent chart recommendation, and natural‑language voice interaction to deliver automated analysis, visualization, and predictive insights.

AI-Powered AnalyticsBig DataBusiness Intelligence
0 likes · 15 min read
Sugar BI: AI-Powered Business Intelligence Platform Architecture and Intelligent Visualization
DataFunSummit
DataFunSummit
Apr 1, 2022 · Artificial Intelligence

Detecting Invalid Queries in Voice Interaction: Non‑Human Interaction and Ambiguous Intent Recognition

This talk presents a comprehensive study of invalid query detection in voice assistants, covering the definition of effective and ineffective queries, challenges of non‑human interaction and ambiguous intent recognition, data collection, model design, experimental results, user‑feedback loops, and future research directions.

invalid query detectionmachine learningnatural language understanding
0 likes · 20 min read
Detecting Invalid Queries in Voice Interaction: Non‑Human Interaction and Ambiguous Intent Recognition
DataFunTalk
DataFunTalk
Mar 20, 2022 · Artificial Intelligence

Detecting Invalid Queries in Voice Interaction: Non‑Human Interaction and Ambiguous Intent Recognition

This talk presents a comprehensive study of invalid query detection in voice assistants, covering the definition and taxonomy of invalid queries, challenges of non‑human interaction and ambiguous intent recognition, data collection and labeling strategies, feature engineering, deep neural network modeling, experimental results, user‑feedback loops, and current performance limits.

AIdialogue systeminvalid query
0 likes · 17 min read
Detecting Invalid Queries in Voice Interaction: Non‑Human Interaction and Ambiguous Intent Recognition
Volcano Engine Developer Services
Volcano Engine Developer Services
Oct 12, 2021 · Artificial Intelligence

How ByteDance’s AI‑Powered Audio Signal Processing Elevates Voice, VR, and VoIP

This article reviews ByteDance’s intelligent audio signal processing technologies, covering foundational algorithms, multimodal audio scaling, sound‑field reconstruction, and high‑quality low‑latency VoIP, and explains how these advances improve audio capture, immersive media, and smart voice interaction across devices.

AR/VR audioMultimodal AIVoIP
0 likes · 13 min read
How ByteDance’s AI‑Powered Audio Signal Processing Elevates Voice, VR, and VoIP
DataFunTalk
DataFunTalk
Mar 19, 2020 · Artificial Intelligence

Advances in Voice Interaction: 360's Intelligent Dialogue System Architecture and Core Technologies

This article presents a comprehensive overview of 360's voice interaction platform, detailing dialogue system fundamentals, platform architecture, and core technologies such as semantic understanding, dialog management, and question answering, all driven by deep learning and multimodal innovations.

AIKnowledge Graphdialogue system
0 likes · 16 min read
Advances in Voice Interaction: 360's Intelligent Dialogue System Architecture and Core Technologies
iQIYI Technical Product Team
iQIYI Technical Product Team
Jan 17, 2020 · Artificial Intelligence

Voice and Language Technologies in Natural Interaction: iQIYI HomeAI Speech Interaction System

The talk introduced iQIYI’s HomeAI platform, which combines user profiling (including voiceprint and age detection) with automatic video semantic extraction to enable natural, multi‑turn voice‑based video search—addressing hot‑content updates, contextual awareness, device environments, and personalized recommendations for screen‑less or accessibility‑focused users.

AIContext-Awareentity extraction
0 likes · 19 min read
Voice and Language Technologies in Natural Interaction: iQIYI HomeAI Speech Interaction System
Tencent Cloud Developer
Tencent Cloud Developer
Feb 13, 2019 · Mobile Development

Tencent Car‑Mounted Mini Program Architecture and Voice Interaction

Tencent’s car‑mounted mini‑program platform layers a JavaScript runtime (TBS), an extended WeChat framework with TAIS voice interaction, and diverse applications, enabling developers to adapt existing mini‑programs for vehicle head‑units with hands‑free voice control, contextual recommendations, safety checks, and cross‑OS support.

Mobile DevelopmentTencentUI adaptation
0 likes · 16 min read
Tencent Car‑Mounted Mini Program Architecture and Voice Interaction
iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 14, 2018 · Artificial Intelligence

Limitations of Language Models in Voice Interaction and HomeAI Solutions

iQIYI HomeAI tackles the bottleneck of static language models in voice assistants by separating phonetic and semantic processing, correcting ASR errors at the intent‑recognition layer with pinyin‑enhanced entity correction, thereby reducing error amplification in video‑on‑demand interactions and paving the way for adaptive, personalized voice experiences.

AILanguage Modelintent recognition
0 likes · 7 min read
Limitations of Language Models in Voice Interaction and HomeAI Solutions
AntTech
AntTech
Apr 3, 2018 · Artificial Intelligence

Intelligent IVR Voice Interaction: Architecture, Models, and Deployment at Ant Financial

The article explains how Ant Financial transformed traditional interactive voice response (IVR) into an AI‑driven, natural‑language service platform called MISA, detailing its architecture, machine‑learning models for guess‑question, problem identification, reverse questioning, and automated training, and reporting performance gains during high‑traffic events.

AIAnt FinancialIVR
0 likes · 12 min read
Intelligent IVR Voice Interaction: Architecture, Models, and Deployment at Ant Financial
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 3, 2018 · Artificial Intelligence

How Voice AI Is Powering Alibaba's IoT Revolution

In this keynote, Alibaba's chief scientist explains how voice AI serves as the natural interface for IoT, detailing the company's strategy to connect billions of devices through cloud infrastructure, AI-driven perception, and multimodal interaction across consumer and industrial applications.

AlibabaIoTartificial intelligence
0 likes · 14 min read
How Voice AI Is Powering Alibaba's IoT Revolution
Hujiang Design Center
Hujiang Design Center
Feb 1, 2018 · Fundamentals

What Are the 6 Interaction Design Trends Shaping 2018?

This article outlines six 2018 interaction design trends—including all‑sense experiences, screen‑less interfaces, emotionalized devices, natural voice interaction, AI‑driven personalization, cost‑effective interactions, and seamless online‑offline integration—backed by real‑world product examples and visual illustrations.

AIAR/VRInteraction Design
0 likes · 14 min read
What Are the 6 Interaction Design Trends Shaping 2018?
网易UEDC
网易UEDC
Oct 9, 2017 · Artificial Intelligence

What Does a Voice Interaction Designer Actually Do? A Practical Guide

This article explores the rise of voice interaction design, outlines the responsibilities and skills of a voice UI designer, reviews industry examples from major tech firms, and recommends essential resources for anyone interested in building conversational experiences.

Conversational AIProduct DesignVUI design
0 likes · 11 min read
What Does a Voice Interaction Designer Actually Do? A Practical Guide
21CTO
21CTO
Jul 15, 2017 · Artificial Intelligence

Build Your Own AI‑Powered Quiz Game with Google Assistant and Firebase

Google has open‑sourced a complete AI quiz‑game framework that lets developers create voice‑interactive trivia apps for Google Assistant, Android, iOS and smart speakers using API.AI, Cloud Functions, and Firebase with minimal effort.

AI Quiz GameCloud FunctionsDialogflow
0 likes · 6 min read
Build Your Own AI‑Powered Quiz Game with Google Assistant and Firebase
Suning Design
Suning Design
May 4, 2017 · Artificial Intelligence

Can Voice Interaction Become the Next Main Human‑Machine Interface?

This article explores the evolution, current capabilities, design challenges, and future scenarios of intelligent voice interaction, arguing that voice will become one of the mainstream ways humans communicate with machines while highlighting technical limits, user experience principles, and suitable application domains.

AIDesignHuman-Computer Interaction
0 likes · 13 min read
Can Voice Interaction Become the Next Main Human‑Machine Interface?
Suning Technology
Suning Technology
Mar 17, 2017 · Artificial Intelligence

Will Intelligent Voice Interaction Become a Mainstream HCI Method?

This article explores the evolution of intelligent voice interaction—from its roots in natural language processing and early products like Siri to its potential to become a primary human-computer interface, discussing technical challenges, design principles, comparative advantages over graphical interfaces, and suitable application scenarios such as automotive, education, and customer service.

AIHuman-Computer Interactiondesign principles
0 likes · 14 min read
Will Intelligent Voice Interaction Become a Mainstream HCI Method?
JD.com Experience Design Center
JD.com Experience Design Center
Oct 20, 2016 · Artificial Intelligence

Why Voice Interaction Outperforms Visual UI for Multitasking

Voice interaction offers scenario‑aware, hands‑free experiences that let users handle multiple tasks simultaneously, overcoming the visual focus of traditional GUIs, and its design benefits from Nielsen’s usability heuristics, cloud AI, and big‑data‑driven context awareness.

Nielsen heuristicsUX designartificial intelligence
0 likes · 10 min read
Why Voice Interaction Outperforms Visual UI for Multitasking