Whisper-WebUI
github.comThis was a lab assignment I gave my students at KTH in Nov 2022Ö https://github.com/ID2223KTH/id2223kth.github.io/blob/master...
See some of their example solutions here (along with their projects in 2022): https://id2223kth.github.io/assignments/project/ServerlessML...
One of the student labs was gamified language learning - say the image in the language you are learning, and Whisper tells you whether you said it in an understandable way.
This Gradio implementation is a more polished version of their early efforts.
This says it can generate subtitle files, which is something I've wanted from whisper for a while. But does anyone know of a way to do that with just a cli tool that I can run locally? Like, ideally, just `whisper-make-subtitles ./*.mp4` to loop over every .mp4 in a directory and create matching subtitle files.
The whisper command at https://github.com/openai/whisper can generate .srt and .vtt subtitle files using --output_format.
whisper.cpp also has args for this
Well now I feel dumb:P In my defense, that requires finding that option (the readme doesn't discuss options) and knowing that .srt/.vtt are subtitle formats. This is great, thanks!
--help does list the info btw, but that wouldn’t have helped if you didn’t know the file format extensions
On Windows I use whisper-standalone-win: https://github.com/Purfview/whisper-standalone-win
It has a few customization features that are nice: https://github.com/Purfview/whisper-standalone-win/discussio...
Works miles better than plain faster-whisper, in my experience. Not sure if there's wildcard support but that's easily scripted.
I've used https://github.com/tomchang25/whisper-auto-transcribe to generate subtitles and then translate them to English and it worked fairly well. It's not professional-level, but it was good enough to understand what they were saying and enjoy foreign TV.
Also check out https://whishper.net