Tagged articles
3 articles
Page 1 of 1
21CTO
21CTO
Jun 28, 2021 · Artificial Intelligence

How Multimodal AI Detects Pornographic Videos: Image & Audio Fusion Explained

This article outlines a multimodal AI framework for detecting pornographic video content by combining image and audio analysis, detailing the challenges of visual and speech-based recognition, describing the DCNet and RANet model architectures, fusion strategies, and reporting experimental accuracy of 93.4% on a 3k test set.

AIAudio ClassificationDeep Learning
0 likes · 5 min read
How Multimodal AI Detects Pornographic Videos: Image & Audio Fusion Explained
Sohu Tech Products
Sohu Tech Products
May 12, 2021 · Artificial Intelligence

Zero‑Basis Food Sound Recognition with ASR: Theory, Workflow, and Complete Python Code

This article introduces the fundamentals of automatic speech recognition (ASR) for food‑sound classification, explains key audio representations and modeling approaches, and provides a fully runnable Python implementation using librosa, TensorFlow/Keras, and classic machine‑learning tools to train and predict on the Tianchi competition dataset.

ASRAudio ClassificationCNN
0 likes · 11 min read
Zero‑Basis Food Sound Recognition with ASR: Theory, Workflow, and Complete Python Code
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Feb 23, 2021 · Artificial Intelligence

How Deep Learning Detects Pornographic and ASMR Audio

This article explains a deep‑learning pipeline that preprocesses audio, extracts FBank features, applies SpecAugment, and uses a CNN‑BI‑LSTM‑Attention model to automatically identify pornographic and ASMR speech for content moderation.

ASMR detectionAudio ClassificationSpecAugment
0 likes · 8 min read
How Deep Learning Detects Pornographic and ASMR Audio