Tag

voice interaction

0 views collected around this technical thread.

ZhongAn Tech Team
ZhongAn Tech Team
Dec 8, 2024 · Artificial Intelligence

Weekly AI Digest Issue 5: Voice Interaction Trends, End‑to‑End vs. Chain Integration, and Enterprise Solutions

This issue examines the growing importance of voice interaction in AI, highlights Justin Uberti’s move to OpenAI and the launch of GPT‑4o, compares end‑to‑end large‑model and chain‑integration approaches, and offers practical enterprise deployment scenarios for both weak and strong voice‑based interactions.

Chain IntegrationEnterprise Solutionsai
0 likes · 14 min read
Weekly AI Digest Issue 5: Voice Interaction Trends, End‑to‑End vs. Chain Integration, and Enterprise Solutions
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 15, 2024 · Artificial Intelligence

OpenAI Unveils GPT‑4o: An Omni‑Capable Multimodal Model Offered Free to All Users

OpenAI introduced GPT‑4o, a free, omni‑capable multimodal model that processes text, audio, and images together, delivers near‑human response latency, showcases impressive live demos, and will soon be available via a discounted API, marking a significant step forward in end‑to‑end AI research.

AI researchGPT-4oOpenAI
0 likes · 7 min read
OpenAI Unveils GPT‑4o: An Omni‑Capable Multimodal Model Offered Free to All Users
Baidu Geek Talk
Baidu Geek Talk
Aug 1, 2022 · Artificial Intelligence

Sugar BI: AI-Powered Business Intelligence Platform Architecture and Intelligent Visualization

Sugar BI, Baidu Cloud’s AI‑powered business intelligence platform, lets users create professional, zero‑code dashboards in minutes by connecting to 30+ data sources, leveraging Apache ECharts, intelligent chart recommendation, and natural‑language voice interaction to deliver automated analysis, visualization, and predictive insights.

AI-Powered AnalyticsBig DataChart Recommendation
0 likes · 15 min read
Sugar BI: AI-Powered Business Intelligence Platform Architecture and Intelligent Visualization
DataFunSummit
DataFunSummit
Apr 1, 2022 · Artificial Intelligence

Detecting Invalid Queries in Voice Interaction: Non‑Human Interaction and Ambiguous Intent Recognition

This talk presents a comprehensive study of invalid query detection in voice assistants, covering the definition of effective and ineffective queries, challenges of non‑human interaction and ambiguous intent recognition, data collection, model design, experimental results, user‑feedback loops, and future research directions.

Natural Language UnderstandingSpeech Recognitioninvalid query detection
0 likes · 20 min read
Detecting Invalid Queries in Voice Interaction: Non‑Human Interaction and Ambiguous Intent Recognition
DataFunTalk
DataFunTalk
Mar 20, 2022 · Artificial Intelligence

Detecting Invalid Queries in Voice Interaction: Non‑Human Interaction and Ambiguous Intent Recognition

This talk presents a comprehensive study of invalid query detection in voice assistants, covering the definition and taxonomy of invalid queries, challenges of non‑human interaction and ambiguous intent recognition, data collection and labeling strategies, feature engineering, deep neural network modeling, experimental results, user‑feedback loops, and current performance limits.

Speech Recognitionaidialogue system
0 likes · 17 min read
Detecting Invalid Queries in Voice Interaction: Non‑Human Interaction and Ambiguous Intent Recognition
DataFunTalk
DataFunTalk
Mar 19, 2020 · Artificial Intelligence

Advances in Voice Interaction: 360's Intelligent Dialogue System Architecture and Core Technologies

This article presents a comprehensive overview of 360's voice interaction platform, detailing dialogue system fundamentals, platform architecture, and core technologies such as semantic understanding, dialog management, and question answering, all driven by deep learning and multimodal innovations.

Knowledge GraphNatural Language UnderstandingSpeech Recognition
0 likes · 16 min read
Advances in Voice Interaction: 360's Intelligent Dialogue System Architecture and Core Technologies
iQIYI Technical Product Team
iQIYI Technical Product Team
Jan 17, 2020 · Artificial Intelligence

Voice and Language Technologies in Natural Interaction: iQIYI HomeAI Speech Interaction System

The talk introduced iQIYI’s HomeAI platform, which combines user profiling (including voiceprint and age detection) with automatic video semantic extraction to enable natural, multi‑turn voice‑based video search—addressing hot‑content updates, contextual awareness, device environments, and personalized recommendations for screen‑less or accessibility‑focused users.

Natural Language ProcessingSpeech Recognitionai
0 likes · 19 min read
Voice and Language Technologies in Natural Interaction: iQIYI HomeAI Speech Interaction System
360 Quality & Efficiency
360 Quality & Efficiency
May 10, 2019 · Artificial Intelligence

Smart Speaker Voice Interaction Platform: Concepts, Processes, and Testing Metrics

This article introduces the architecture of smart speaker voice interaction systems, covering wake‑word activation, automatic speech recognition (ASR), natural language understanding (NLU), skill processing, text‑to‑speech synthesis (TTS), and the key performance and testing metrics for each component.

ASRNLUTTS
0 likes · 11 min read
Smart Speaker Voice Interaction Platform: Concepts, Processes, and Testing Metrics
Tencent Cloud Developer
Tencent Cloud Developer
Feb 13, 2019 · Mobile Development

Tencent Car‑Mounted Mini Program Architecture and Voice Interaction

Tencent’s car‑mounted mini‑program platform layers a JavaScript runtime (TBS), an extended WeChat framework with TAIS voice interaction, and diverse applications, enabling developers to adapt existing mini‑programs for vehicle head‑units with hands‑free voice control, contextual recommendations, safety checks, and cross‑OS support.

Mobile DevelopmentUI adaptationcar mini program
0 likes · 16 min read
Tencent Car‑Mounted Mini Program Architecture and Voice Interaction
DataFunTalk
DataFunTalk
Dec 13, 2018 · Artificial Intelligence

Machine Reading Comprehension: From Traditional QA Systems to End‑to‑End Models and Voice Interaction Trends

This article presents an overview of machine reading comprehension, covering the evolution from modular question‑answering systems to end‑to‑end neural models, discusses key datasets such as SQuAD and MS MARCO, and explores voice interaction technologies and future industry trends.

BERTNLPmachine reading comprehension
0 likes · 13 min read
Machine Reading Comprehension: From Traditional QA Systems to End‑to‑End Models and Voice Interaction Trends
iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 14, 2018 · Artificial Intelligence

Limitations of Language Models in Voice Interaction and HomeAI Solutions

iQIYI HomeAI tackles the bottleneck of static language models in voice assistants by separating phonetic and semantic processing, correcting ASR errors at the intent‑recognition layer with pinyin‑enhanced entity correction, thereby reducing error amplification in video‑on‑demand interactions and paving the way for adaptive, personalized voice experiences.

Speech Recognitionaiintent recognition
0 likes · 7 min read
Limitations of Language Models in Voice Interaction and HomeAI Solutions
AntTech
AntTech
Apr 3, 2018 · Artificial Intelligence

Intelligent IVR Voice Interaction: Architecture, Models, and Deployment at Ant Financial

The article explains how Ant Financial transformed traditional interactive voice response (IVR) into an AI‑driven, natural‑language service platform called MISA, detailing its architecture, machine‑learning models for guess‑question, problem identification, reverse questioning, and automated training, and reporting performance gains during high‑traffic events.

IVRNatural Language Understandingai
0 likes · 12 min read
Intelligent IVR Voice Interaction: Architecture, Models, and Deployment at Ant Financial