Whisper is a speech-to-text model built by OpenAI which can be easily run on local hardware, even with the largest model sizes. I will present a Streamlit-based tool to easily extract text snippets from video files using the Whisper model without being reliant on the command line.
Read More ›