AI Engineering
AI Engineering
Apr 28, 2026 · Artificial Intelligence

Insanely Fast Whisper speeds audio transcription 19× with Flash Attention 2

The open‑source Insanely Fast Whisper CLI tool leverages Flash Attention 2 to accelerate OpenAI Whisper transcription by 19 times—cutting a 2.5‑hour audio from 31 minutes to just 98 seconds on an Nvidia A100—while preserving accuracy and adding multilingual, speaker‑diarization, and precise timestamp features.

CLI toolFlash Attention 2GPU acceleration
0 likes · 4 min read
Insanely Fast Whisper speeds audio transcription 19× with Flash Attention 2