Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> yt-fts is a simple python script that uses yt-dlp to scrape all of a youtube channels subtitles and load them into an sqlite database that is searchable from the command line.

Critically, this is per channel. I wonder if we can optionally configure this to share the downloaded transcripts to a central repository so eventually a good proportion of youtube's transcripts could be downloaded as one big text file.



> share the downloaded transcripts to a central repository

Sure, are you willing to host it and handle the absolutely inevitable legal issues?


I wish IPFS was better. It'd be an obvious solution to this. Content hash the YouTube ID and then distribute hosting.


There is something similar to a central repository at https://filmot.com/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: