I often want to read the contents of YouTube videos but I don’t want to watch them, and YouTube’s captioning leaves a lot to be desired.
So I wrote a shell script that takes a URL, downloads the video, transcribes it with whisper, and turns it into a clean HTML page for reading.
https://github.com/llimllib/yt-transcribe
---
Another use case is being able to quickly jump to a specific spot inside a video. Could you add timestamps with links that jump directly to that point in the video?
I accomplished something similar by modifying oTranscribe:
- https://otranscribe.netlify.app/?vsl=definedefine
- https://otranscribe.netlify.app/?vsl=letter
---
Finally, I'm a windows user so a whisper.cpp version would be nice~