Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Do you mean images in the video like how the Google Translate app can do with the camera, or do you mean the audio within the video?


The audio within the video


Unfortunately, none that I'm aware of. For whatever reason, I find that speech to text is never as good as the accuracy scores claimed by those making the models.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: