Hacker Newsnew | past | comments | ask | show | jobs | submit | braden-w's commentslogin

I totally agree with this hot take. Whispering is not there yet, but I eventually want it to store as many of the transcripts as plain text markdown, alongside your audio files, in a folder.

The idea is that as we add more local-first apps into the ecosystem (writing, etc.), they're share this context. Transcription would benefit immensely if you also had a writing app that you could trust to store your data. To execute that vision, we needed a transcription app where we have control over how data is stored, and the best solution was to build our own.


Thank you for the support! I'm glad to hear that it's been helping you since the start of the year. Totally agree on the transformation prompts. It's challenging to get the transformation model to not occasionally get short-circuited, especially when I end up having it format a dictated prompt. Instead of formatting, it executes the prompt.

Sorry to hear about the auto-paste feature and taskbar icons. We'll try to restore these in the future, and you can track taskbar here:

https://github.com/epicenter-so/epicenter/issues/607


Thanks for sharing a great alternative! It seems that that setup can go a long way for Linux users.


Thanks for shouting out some other great alternatives! The UI looks really clean.

Right now, the pricing is entirely free, and we are trying to expand our local model support to make it truly free. Subscriptions are up to the user right now.

Thanks for giving us a shot, and no pressure on using it! At the end of the day, I just want to build something that is open source and trustworthy, and hopefully will fit into the Epicenter ecosystem, the data layer that I talked about earlier in my post.


Thanks for flagging this, and sorry that this is happening! Does downloading the model manually work? I wonder if it's related to this:

https://github.com/epicenter-so/epicenter/issues/669


I don't think it's the same error, but without a good error message I don't know.

I did manually download the models and associated them, which are great but then the audio didn't work. On the browser version, it never asks me for permission for an audio device, and on the native version, it makes a file of 0 length and then complains it can't read the contents.

My read is that the project looks very interesting, and I'd love a FLOSS replacement for Aqua Voice, but this software isn't ready for everyday use yet, at least not on Linux.

I'd love to help somehow, whether that's a donation or experimenting if you point me to somewhere.


Thank you for the support! Sorry for the issues with FFmpeg. This is an active issue that we're tracking:

https://github.com/epicenter-so/epicenter/issues/674

We hope to fix notifications too thank you for the feedback and happy to hear you liked the system prompt!


Sorry for the delayed response, thank you for sharing these articles! I agree. I hope that we get a lot better open-source STT options in the future.


I really want to run it locally on a phone, but as a developer it's scary to think about making a native mobile app and having to work with the iOS toolchain I don't have bandwidth at the moment, but if anyone knows of any OSS mobile alternatives, feel free to drop them!


Thank you for the support, and agreed on OS-level integration. At least for me, I have trouble trusting any app unless they are open source and have a transparent codebase for audit :)


Awesome, thank you so much for bringing this to my attention and including it in the thread! Always cool to see other open source projects :)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: