Tagged articles

FBank

1 articles · Page 1 of 1

Dec 21, 2020 · Artificial Intelligence

Voice Robot Sound Classification: Feature Extraction, VGGish Model, and Optimization Experiments

This article describes the end‑to‑end pipeline of a voice robot, covering speech framing, feature extraction (FBank, MFCC), the VGGish embedding network, various model architectures, experimental results on accuracy and recall, and future directions for improving sound‑type classification.

FBankMFCCVGGish

0 likes · 11 min read

Voice Robot Sound Classification: Feature Extraction, VGGish Model, and Optimization Experiments