Tagged articles
21 articles
Page 1 of 1
Baidu Geek Talk
Baidu Geek Talk
Sep 15, 2025 · Artificial Intelligence

How Baidu’s AI Navigation Turns Voice Commands into Precise Actions

This article explains how Baidu Map’s AI navigation system converts spoken queries into accurate map instructions by combining speech recognition, intent parsing, large‑language‑model reasoning, tool calling, and memory‑reflection techniques, showcasing the underlying technologies that enable instant, context‑aware responses.

AILLMMap Services
0 likes · 13 min read
How Baidu’s AI Navigation Turns Voice Commands into Precise Actions
Baidu Maps Tech Team
Baidu Maps Tech Team
Jul 31, 2025 · Artificial Intelligence

How Baidu’s AI Voice Assistant Turns Speech into Precise Navigation Commands

This article explains how Baidu Map’s AI voice assistant converts spoken commands into precise navigation actions by detailing the speech‑to‑text pipeline, intent parsing, template and generative approaches, tool‑calling mechanisms, memory and reflection capabilities, and future directions for intelligent agents.

AIIntent ParsingLLM
0 likes · 14 min read
How Baidu’s AI Voice Assistant Turns Speech into Precise Navigation Commands
Huolala Tech
Huolala Tech
Jul 9, 2024 · Artificial Intelligence

Building an In-Car Voice Assistant: From Wake‑Word to NLP

This article details the end‑to‑end development of an in‑vehicle voice assistant, covering motivation, functional design, technology stack selection, dialogue flow, privacy, third‑party integration, wake‑word detection, on‑device speech recognition, noise filtering, NLP processing, and deployment considerations.

Voice Assistantin‑car technologynatural language processing
0 likes · 18 min read
Building an In-Car Voice Assistant: From Wake‑Word to NLP
DataFunTalk
DataFunTalk
Mar 15, 2024 · Artificial Intelligence

Application of Agent Technology in Voice Assistant Scenarios

Senior algorithm engineer Qi Jianwei from Xiaomi presents a comprehensive overview of building a large‑model‑centric Agent framework for voice assistants, covering prompt design, information retrieval, RAG processes, and future optimization directions to enhance performance and stability.

AgentPrompt engineeringVoice Assistant
0 likes · 2 min read
Application of Agent Technology in Voice Assistant Scenarios
DataFunSummit
DataFunSummit
Nov 20, 2022 · Artificial Intelligence

NLP Technology Applications and Research in Voice Assistants

This article presents an in‑depth overview of NLP techniques used in voice assistants, covering the end‑to‑end conversational AI pipeline, intent and slot modeling, multi‑turn dialog management, model deployment pipelines, quantization methods, and self‑learning strategies for continuous improvement.

Conversational AIModel QuantizationNLP
0 likes · 30 min read
NLP Technology Applications and Research in Voice Assistants
DataFunTalk
DataFunTalk
Dec 28, 2021 · Artificial Intelligence

Evaluation Framework and Methodology for OPPO XiaoBu AI Assistant

This article presents a comprehensive evaluation framework for OPPO's XiaoBu AI assistant, covering evaluation concepts, objectives, five key elements, sampling methods, dimension selection, annotation scoring, report generation, and a detailed Q&A that illustrates practical metrics and processes for voice and search services.

AI EvaluationMetricsOPPO
0 likes · 23 min read
Evaluation Framework and Methodology for OPPO XiaoBu AI Assistant
DataFunSummit
DataFunSummit
Dec 27, 2021 · Artificial Intelligence

Evaluation Framework and Methodology for OPPO XiaoBu AI Assistant

This article presents a comprehensive evaluation framework for OPPO's XiaoBu AI assistant, covering the concept and purpose of evaluation, the five key evaluation elements, data sampling strategies, dimension and rule selection, annotation scoring, reporting guidelines, and detailed procedures for assessing wake‑up, ASR, NLU, and TTS performance.

AI EvaluationMetricsReporting
0 likes · 20 min read
Evaluation Framework and Methodology for OPPO XiaoBu AI Assistant
DataFunTalk
DataFunTalk
Nov 5, 2021 · Artificial Intelligence

End-to-End Entity Extraction for Tmall Genie: Speech2Slot Model and Unsupervised Pre‑Training

This article presents the business background of Tmall Genie’s voice‑driven content‑on‑demand service, critiques the traditional pipeline for entity extraction, and details an end‑to‑end speech‑semantic model—including the Speech2Slot architecture, knowledge‑enhanced encoding, and Phoneme‑BERT unsupervised pre‑training—demonstrating significant performance gains in both generation and classification tasks.

Voice Assistantend-to-end modelentity extraction
0 likes · 14 min read
End-to-End Entity Extraction for Tmall Genie: Speech2Slot Model and Unsupervised Pre‑Training
58 Tech
58 Tech
Jun 16, 2021 · Artificial Intelligence

Improving Text Matching Accuracy in Voice Assistants: Experiments with Siamese Networks, BERT Models, and Advanced Tricks

This article evaluates classic Siamese networks, various BERT‑based pretrained models, and several training tricks such as adversarial training, k‑fold cross‑validation, and model ensembling on both a public similarity‑sentence competition dataset and an internal voice‑assistant standard question matching dataset, ultimately raising accuracy from 97.23 % to 99.5 %.

BERTSiamese NetworkVoice Assistant
0 likes · 15 min read
Improving Text Matching Accuracy in Voice Assistants: Experiments with Siamese Networks, BERT Models, and Advanced Tricks
Didi Tech
Didi Tech
Apr 29, 2021 · Artificial Intelligence

Design and Architecture of DiDi Driver-side Intelligent Voice Assistant "XiaoDi"

The document details DiDi’s driver‑side intelligent voice assistant “XiaoDi,” describing its three‑layer architecture—audio source switching controller, semantic‑parsing core, and business API—along with conflict‑resolution mechanisms, multi‑turn dialogue handling, and a four‑region UI design that together enhance driver safety, convenience, and well‑being.

AIDriver AppMobile Development
0 likes · 30 min read
Design and Architecture of DiDi Driver-side Intelligent Voice Assistant "XiaoDi"
DataFunTalk
DataFunTalk
Feb 11, 2021 · Artificial Intelligence

How to Build Successful AI Products: Insights on AI Development, NLP, and Product Strategies

This article explores the current state of AI, the evolution of NLP and voice assistants, common pitfalls in AI product development, and practical product‑management methods—including user segmentation, metric design, and lifecycle planning—to help engineers and product managers deliver effective AI‑driven solutions.

AINLPUser experience
0 likes · 19 min read
How to Build Successful AI Products: Insights on AI Development, NLP, and Product Strategies
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 7, 2020 · Artificial Intelligence

How Does Alibaba’s Tmall Genie Achieve Full‑Duplex Natural Dialogue?

This article explains the concept of full‑duplex natural dialogue for Alibaba’s Tmall Genie, illustrates interaction scenarios, and details the technical solution covering device‑side management, speech recognition, language understanding, synthesis, dialogue control, duration handling, and conversation flow.

ASRHuman-Computer InteractionNLU
0 likes · 8 min read
How Does Alibaba’s Tmall Genie Achieve Full‑Duplex Natural Dialogue?
Hujiang Technology
Hujiang Technology
May 17, 2018 · Artificial Intelligence

Technical Analysis of Google Duplex: Achieving Natural Conversational Interaction

The article provides a detailed technical breakdown of Google Duplex, explaining how its speech recognition, natural language understanding, dialogue management, and speech synthesis modules work together to produce task‑oriented, natural‑sounding conversations and discussing challenges such as handling refusals, conditional responses, context management, and future scalability and safety concerns.

Google DuplexVoice Assistantartificial intelligence
0 likes · 10 min read
Technical Analysis of Google Duplex: Achieving Natural Conversational Interaction
AntTech
AntTech
May 10, 2018 · Artificial Intelligence

MISA – Ant Financial’s AI Voice Service Assistant: Architecture, Deep‑Learning Models, and the AI Competition

The article introduces MISA, Ant Financial’s AI‑driven voice service assistant that uses deep‑learning models such as CNN and RNN for problem guessing, identification, and interactive clarification, details its system components and evaluation metrics, and describes the related AI competition focused on sentence‑similarity calculation.

AIDeep LearningVoice Assistant
0 likes · 14 min read
MISA – Ant Financial’s AI Voice Service Assistant: Architecture, Deep‑Learning Models, and the AI Competition
Meituan Technology Team
Meituan Technology Team
Mar 29, 2018 · Artificial Intelligence

AI-Powered Smart Assistant for Meituan Delivery Riders

Meituan’s AI‑powered Rider Smart Assistant uses voice‑based interaction, real‑time routing, ETA prediction and massive GPS data to solve NP‑hard dispatch problems, cut manual phone calls, shorten order‑acceptance latency and rider wait times, and deliver safer, faster, more efficient same‑city logistics for riders and customers.

AILogisticsVoice Assistant
0 likes · 22 min read
AI-Powered Smart Assistant for Meituan Delivery Riders