Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Jeff you know what would be magical? Not just vanilla diarization "Speaker 1" and "2" but if the model can know from the conversation this speaker was referred to as "Jeff Harris" or "Jeff" so it uses that instead.



Or if we could even provide samples of what an example speaker sounds like in general so that it would always classify them the way we want.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: