Tagged articles

DFSMN

5 articles · Page 1 of 1

Nov 1, 2018 · Artificial Intelligence

How DFSMN Cuts Speech Synthesis Model Size by 75% and Quadruples Speed

Researchers propose a Deep Feedforward Sequential Memory Network (DFSMN) for speech synthesis that matches BLSTM quality while using only a quarter of the model size and achieving four times faster inference, making it ideal for memory‑constrained, real‑time edge devices.

DFSMNSpeech synthesisdeep learning

0 likes · 10 min read

How DFSMN Cuts Speech Synthesis Model Size by 75% and Quadruples Speed

Alibaba Cloud Developer

Oct 31, 2018 · Artificial Intelligence

How Deep‑FSMN and Low Frame Rate Accelerate Speech Recognition

This article introduces the Deep‑FSMN (DFSMN) architecture and its integration with low‑frame‑rate (LFR) processing, showing how the combined LFR‑DFSMN acoustic model achieves higher accuracy, smaller model size, faster training, and lower latency than traditional BLSTM‑based speech recognition systems on both English and Chinese large‑vocabulary tasks.

AIDFSMNacoustic modeling

0 likes · 12 min read

How Deep‑FSMN and Low Frame Rate Accelerate Speech Recognition

Alibaba Cloud Developer

Oct 23, 2018 · Artificial Intelligence

How DFSMN Cuts Speech Synthesis Model Size by 75% While Quadrupling Speed

This paper introduces a Deep Feedforward Sequential Memory Network (DFSMN) for statistical parametric speech synthesis that matches BLSTM quality with only a quarter of the model size and four times faster inference, making it ideal for memory‑constrained, real‑time IoT devices.

DFSMNIoT devicesReal-time inference

0 likes · 10 min read

How DFSMN Cuts Speech Synthesis Model Size by 75% While Quadrupling Speed

Alibaba Cloud Developer

Jun 8, 2018 · Artificial Intelligence

How Alibaba’s DFSMN Model Pushes Speech Recognition Accuracy to 96.04%

Alibaba’s DAMO Academy unveiled the DFSMN speech‑recognition model, open‑sourced on GitHub, which sets a new 96.04% accuracy record on LibriSpeech, trains three times faster than LSTM, and powers real‑world demos like AI cashiers and metro ticket machines.

AIAlibabaDFSMN

0 likes · 3 min read

How Alibaba’s DFSMN Model Pushes Speech Recognition Accuracy to 96.04%

Alibaba Cloud Developer

Jun 8, 2018 · Artificial Intelligence

How DFSMN Sets a New Record in Speech Recognition Accuracy and Speed

Alibaba's DAMO Academy has open‑sourced the Deep‑Feedforward Sequential Memory Network (DFSMN), a next‑generation speech‑recognition model that achieves a world‑record 96.04% accuracy on LibriSpeech, trains three times faster than LSTM, halves model size, and dramatically speeds up real‑time decoding.

DFSMNacoustic modelingdeep learning

0 likes · 17 min read

How DFSMN Sets a New Record in Speech Recognition Accuracy and Speed