Hacker News new | past | comments | ask | show | jobs | submit login

Neat but static ads are slowly becoming a thing of the past... Companies like Acast and Midroll are building dynamic ad solutions that inject dynamic ads into podcasts at the time of stream/download



Sounds like that's potentially solvable by breaking the podcast down into chunks by speaking voice, then flagging any sections of ~30s with a different speaking voice from the rest. Detecting guest speakers shouldn't get caught by this as there'd be more conversation rather than a mostly unbroken 30s chunk.


Speaker recognition would be really awesome. Do you know of any techniques more general than fingerprinting specific audio sections?

Edit: https://github.com/ppwwyyxx/speaker-recognition/ looks like the first of a number of good starting points.


I will publish more on that topic


sounds great! looking forward to it.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: