Tag

Speech AI

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
Jun 3, 2024 · Artificial Intelligence

Deploying Speech AI Services Quickly with NVIDIA Riva

This article explains how to use NVIDIA Riva to rapidly deploy speech AI services, covering Riva's overview, Chinese ASR model updates, TTS capabilities, customization options, the Quickstart tool, and a Q&A session that clarifies deployment, model fine‑tuning, and integration with NeMo and Triton.

ASRGPU AccelerationNVIDIA Riva
0 likes · 13 min read
Deploying Speech AI Services Quickly with NVIDIA Riva
DataFunTalk
DataFunTalk
Feb 13, 2024 · Artificial Intelligence

An Overview of NVIDIA NeMo: Open‑Source Framework for Speech AI, ASR, TTS, NLP and Large Language Model Training

This article introduces NVIDIA’s open‑source NeMo framework, detailing its PyTorch‑based architecture for Speech AI, ASR and TTS training, NLP and LLM support, GPU‑optimized parallelism, pre‑trained model resources, fine‑tuning techniques, and the accompanying NeMo Aligner and Framework tools.

ASRNVIDIA NeMoPyTorch
0 likes · 18 min read
An Overview of NVIDIA NeMo: Open‑Source Framework for Speech AI, ASR, TTS, NLP and Large Language Model Training
DataFunTalk
DataFunTalk
Jan 26, 2024 · Artificial Intelligence

Efficient Deployment of Speech AI Models on GPUs

This article explains how to efficiently deploy speech AI models—including ASR and TTS—on GPUs using NVIDIA's Triton Inference Server and TensorRT, covering background challenges, GPU‑based solutions, decoding optimizations, Whisper acceleration with TensorRT‑LLM, streaming TTS improvements, voice‑cloning pipelines, future plans, and a Q&A session.

ASRGPUInference
0 likes · 20 min read
Efficient Deployment of Speech AI Models on GPUs
DataFunSummit
DataFunSummit
Apr 18, 2023 · Artificial Intelligence

Best Practices for Deploying Speech AI on GPUs with Triton and TensorRT

This article presents comprehensive best‑practice guidelines for deploying conversational speech AI—including ASR and TTS pipelines—on GPU servers using NVIDIA Triton Inference Server and TensorRT, covering workflow overview, performance optimizations, streaming inference, and real‑world deployment tips.

ASRGPU deploymentSpeech AI
0 likes · 14 min read
Best Practices for Deploying Speech AI on GPUs with Triton and TensorRT
DataFunSummit
DataFunSummit
Dec 3, 2021 · Artificial Intelligence

Real‑Time Voice Dialogue: Practices, Challenges, and Duplex Conversation

This article presents an in‑depth overview of Alibaba's real‑time voice dialogue system, covering the Hotline XiaoMi robot, the unique challenges of spoken interactions such as colloquialism, multimodality and duplex communication, and the research advances in ASR‑robust SLU, emotion detection, colloquial processing, and duplex conversation modeling.

ASRSLUSpeech AI
0 likes · 22 min read
Real‑Time Voice Dialogue: Practices, Challenges, and Duplex Conversation
58 Tech
58 Tech
May 28, 2019 · Artificial Intelligence

Architecture and Design of an AI‑Powered Voice Robot System

The article describes the design and implementation of a voice robot platform, covering its background, layered architecture, dialogue flow, intent recognition techniques, micro‑service backend, and future improvements, highlighting how AI models and telephony integration enable automated multi‑turn voice interactions for sales and service scenarios.

Speech AIVoice Botdialogue system
0 likes · 11 min read
Architecture and Design of an AI‑Powered Voice Robot System
iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 7, 2018 · Artificial Intelligence

iQIYI Technical Salon – AI Technology Practice and Application (Chengdu Session)

On August 25, iQIYI’s Chengdu R&D Center hosted its second Technical Salon, featuring talks on AI-driven content understanding for short‑video feeds, speech synthesis and editing, industry‑standard speech recognition, semantic search ranking, anti‑spam UGC text analysis, and concluding with recruitment invites and a preview of the upcoming Shanghai salon.

AISpeech AIUGC Text Analysis
0 likes · 6 min read
iQIYI Technical Salon – AI Technology Practice and Application (Chengdu Session)