Tagged articles

Mel Spectrogram

2 articles · Page 1 of 1

Dec 10, 2021 · Artificial Intelligence

Deep Learning for Automatic Speech Recognition (ASR): From Mel Spectrograms to CTC Decoding

This article explains the end‑to‑end deep‑learning pipeline for speech‑to‑text, covering audio digitization, preprocessing with librosa, conversion to Mel spectrograms and MFCCs, data augmentation, a CNN‑RNN architecture, CTC loss, decoding strategies and evaluation with word error rate.

ASRCTCMFCC

0 likes · 13 min read

Deep Learning for Automatic Speech Recognition (ASR): From Mel Spectrograms to CTC Decoding

Alibaba Cloud Developer

May 14, 2018 · Artificial Intelligence

How to Build Real-Time Voice Recognition on Mobile with TensorFlow Lite

This article explains how to implement client‑side human voice recognition on mobile devices using TensorFlow Lite, detailing the mel‑spectrogram feature extraction, algorithmic optimizations such as ARM instruction set and multithreading, model selection with Inception‑v3 CNN, training procedures, and deployment steps.

CNNMel SpectrogramTensorFlow Lite

0 likes · 16 min read

How to Build Real-Time Voice Recognition on Mobile with TensorFlow Lite