Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The VOSK library is good for transcription but there seems like a lack of good simple command line frontends for offline transcription of audio files.

mp4grep works and I've been using it but it has some unnecessary features if this is all you want to do (it's mainly designed to cache the transcriptions and let you search them rather than just write them to a text file) and hopefully someone will make a simpler command line transcription tool.



https://alphacephei.com/vosk/install#usage-examples demonstrates the bare-bones vosk-transcriber sample, and there's also https://www.assemblyai.com/blog/getting-started-with-espnet

I wasn't able to play with https://github.com/o-oconnell/mp4grep on ARM.


Ah, vosk-transcriber looks like it's decent, especially if you use srt output so you have timestamps. Probably no reason to use mp4grep for this purpose then.


I'm still hoping for a turn-key open source solution that includes speaker identification.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: