Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Commercial TTS mostly sucks too.

There’s flashes of brilliance but most of it is noticeably computer generated.



The Gemini models and Eleven V3, and whatever internal audio model Sora 2 uses are about neck and neck in converging performance. They have some unexplainable flavor to them though. Especially Sora.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: