Audio Processing App

Upload audio files (MP3 or M4A) for processing and transcription.
Intended use case: transcription of interviews.

Upload Audio Files

Default settings are working very well. Silence removal helps to reduce hallucination.

Remove Silence

Chunking reduces the load on the model. 10min chunks work really good.

Enable Chunking

tiny is the fastest, but the worst quality. Large-v3-turbo is the best, but slower.

Whisper Model Size

Language

Full Transcription

Segmented Transcription

Download Processed Files and Transcripts (ZIP)