Tag

FBank

0 views collected around this technical thread.

58 Tech
58 Tech
Dec 21, 2020 · Artificial Intelligence

Voice Robot Sound Classification: Feature Extraction, VGGish Model, and Optimization Experiments

This article describes the end‑to‑end pipeline of a voice robot, covering speech framing, feature extraction (FBank, MFCC), the VGGish embedding network, various model architectures, experimental results on accuracy and recall, and future directions for improving sound‑type classification.

Deep LearningFBankMFCC
0 likes · 11 min read
Voice Robot Sound Classification: Feature Extraction, VGGish Model, and Optimization Experiments