Hacker News new | past | comments | ask | show | jobs | submit login

I did exactly this for my finnish anki flashcards. you can see the implementation here: https://github.com/w3p706/anki-gen-fin/blob/main/ankigenfin/...

If you are looking to convert very short texts or words into speach, I had best result with eleven_multilingual_v2 with the following text for tts "Hän sanoo rauhallisesti ja hitaasti: <break time=\"1.0s\" /> '${text}'" An then i use a postprocessing to split at the silence.

This was nessesary as you cannot set the language explicitly and it is detected from the input.

With eleven_turbo_v2_5 you can set the language, but the results are not as good.




This is a cool repo. Interesting approach using uralicNLP for morphology, that's not one I've seen before. This repo's README.md is excellent and thorough too - I'll probably come back to this in March and give it a spin for myself, just to see what you're up to in a little more detail.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: