Tagged articles
4 articles
Page 1 of 1
DataFunTalk
DataFunTalk
Sep 21, 2023 · Artificial Intelligence

2023 Chinese Continuous Visual Speech Recognition Challenge (CNVSRC) Overview

The 2023 Chinese Continuous Visual Speech Recognition Challenge (CNVSRC), organized by Tsinghua University and partners, introduces the large-scale CN-CVS dataset, defines single- and multi-speaker lip‑reading tasks, provides baseline Conformer models, outlines registration, data access, evaluation metrics, and competition schedule.

ChallengeConformerDataset
0 likes · 7 min read
2023 Chinese Continuous Visual Speech Recognition Challenge (CNVSRC) Overview
Zuoyebang Tech Team
Zuoyebang Tech Team
May 19, 2022 · Artificial Intelligence

How to Achieve High‑Quality TTS with Only Minutes of Data

This article reviews neural speech synthesis, explains why large high‑quality paired data are essential, and presents a range of low‑resource solutions—including semi‑supervised pre‑training, cross‑language transfer, speaker embedding, and Conformer‑based model upgrades—demonstrating how the Zuoyebang team built a robust TTS system with as little as 7‑minute speaker recordings.

ConformerFastspeech2Speech synthesis
0 likes · 15 min read
How to Achieve High‑Quality TTS with Only Minutes of Data