Tag

speech translation

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
Sep 19, 2023 · Artificial Intelligence

Simultaneous Speech Translation: Technical Background, System Architecture, and Key Challenges

This article reviews the technical background of simultaneous speech translation, compares offline and real‑time scenarios, details ASR and MT technologies, describes the system architecture and design strategies, and discusses the major challenges and solutions for deploying robust, low‑latency translation services.

ASRHuaweideep learning
0 likes · 16 min read
Simultaneous Speech Translation: Technical Background, System Architecture, and Key Challenges
DataFunTalk
DataFunTalk
Jul 30, 2022 · Artificial Intelligence

Technical Analysis of Huawei’s Offline Speech‑to‑Text and Length‑Constrained Speech Translation Systems in IWSLT 2022

This article reviews the IWSLT 2022 competition tasks, explains Huawei’s cascade offline speech‑to‑text translation pipeline, details four major technical innovations—including ensemble‑based ASR de‑noise, context‑aware re‑ranking, domain‑controlled training, and length‑control strategies—and presents experimental results that demonstrate Huawei’s leading performance across multiple language directions.

ASRHuaweiIWSLT
0 likes · 18 min read
Technical Analysis of Huawei’s Offline Speech‑to‑Text and Length‑Constrained Speech Translation Systems in IWSLT 2022
DataFunTalk
DataFunTalk
Jul 7, 2022 · Artificial Intelligence

Huawei Translation’s Achievements and Technical Solutions in IWSLT 2022 Speech Translation Tasks

This article reviews Huawei Translation’s top-ranking results in the IWSLT 2022 speech translation competition across speech‑to‑speech, offline speech‑to‑text, and length‑controlled translation tasks, and details their cascade and end‑to‑end technical approaches, including domain‑controlled ASR, context‑aware MT re‑ranking, and VITS‑based TTS.

ASRHuaweiIWSLT
0 likes · 13 min read
Huawei Translation’s Achievements and Technical Solutions in IWSLT 2022 Speech Translation Tasks
DataFunTalk
DataFunTalk
Dec 14, 2021 · Artificial Intelligence

Speech Translation: Enterprise Applications and Research

This article presents an overview of speech translation, discusses its motivations and applications at ByteDance, compares cascade and end‑to‑end modeling approaches, introduces advanced encoder and decoder designs such as LUT, Chimera, and COSTT, outlines progressive multi‑task training and data‑augmentation strategies, and shares experimental results and Q&A.

AIAudio Processingend-to-end models
0 likes · 16 min read
Speech Translation: Enterprise Applications and Research
DataFunSummit
DataFunSummit
Nov 18, 2021 · Artificial Intelligence

Enterprise Applications and Research of Speech Translation

This article reviews recent advances in speech translation, discusses ByteDance's practical deployments, compares cascade and end‑to‑end modeling approaches, introduces improved encoder‑decoder architectures and training strategies, and reports state‑of‑the‑art results on the IWSLT 2021 benchmark.

AIByteDancecascade model
0 likes · 15 min read
Enterprise Applications and Research of Speech Translation
DataFunTalk
DataFunTalk
Mar 10, 2020 · Artificial Intelligence

Interspeech 2019 Highlights: End‑to‑End Speech AI Technologies and Key Paper Summaries

The article reviews Interspeech 2019, summarizing major trends and representative papers in end‑to‑end speech recognition, synthesis, natural language understanding, speaker recognition, and speech translation, while also highlighting best student papers and providing resources for further study.

AIInterspeech 2019Natural Language Understanding
0 likes · 24 min read
Interspeech 2019 Highlights: End‑to‑End Speech AI Technologies and Key Paper Summaries