Web application that converts audio and video to text using AI, supporting various formats and self-hosting.
-
Updated
Jan 12, 2025 - Python
Web application that converts audio and video to text using AI, supporting various formats and self-hosting.
A compact (offline) GUI media transcriber that enables you to search for local content based on its spoken words.
Takes audio (mp3) and text input (string) and force aligns the text to the audio. Uses stable-ts and whisperx.
Add a description, image, and links to the stable-ts topic page so that developers can more easily learn about it.
To associate your repository with the stable-ts topic, visit your repo's landing page and select "manage topics."