Right now the limit is 100MB because of Cloudflare, no length limit, costs $1.30/h to run this, that's enough for a 2x RTX A6000 on Vast.ai can you check out the specs there
I've got people to test and usually with the 'medium' and 'large' models it works really efficiently. Honestly, I just use the large model for everything, because might as well have the best quality if you're going to do the effort.
Well the code is open source, I don't know what I plan to do with this, depends on how people like it, but for the meantime you can use it to transcribe stuff for free which is a victory unto itself
I just rebooted the server, now if the websocket disconnects (person closes the browser) it will kill the processing and move the queue, so that should help unclog the queue. I'm going to add a couple more queue touchups and then it should be stable again (no reboots), but running well in the meantime
Yeah I apologize, the queue is a little messed up. It's not showing your progress properly and it's not stopping people's processing if they left (their websocket dies). I'm going to fix these and reboot the server and the experience will be a lot better, sorry. It is working properly though and transcribing all this stuff but the queue needs some TLC, brb.
Yeah someone submitted a PR for those to be fixed, I'm just wary about restarting the server because I haven't setup a way to be able to reboot without losing the websockets
Well those inputs should all error unless they are a valid value.
Yes if nothing is input it will automatically detect the language based on the first 30s of input
This was just a personal project a couple hours ago so it's not setup properly to do safe reboots and a lot of other things, I was just using it locally and now it's in in the wild, will take some time to get everything refined and professional