The issue is Job's training data is likely 99% his public "presentation voice" a...

jenny91 · on Oct 11, 2022

Not sure if you meant "million hours" as hyperbole; but that'd be about 114 years of non-stop conversation.

If there's ~2000 episodes of his podcast and he's talked in a bunch of other place too, it's probably less than 5000 hours.

abledon · on Oct 11, 2022

one could hire a _really good_ steve jobs voice actor to generate more training data for the AI algorithm?

ipaddr · on Oct 11, 2022

At that point using them to create the exact audio would be easier

bongobingo1 · on Oct 11, 2022

Yeah but what VC is interested in funding /that/?

olalonde · on Oct 11, 2022

Humans are expensive though. If you have a lot of speech to record, it might be cheaper to use the human to train the AI and then let the AI finish the rest.

flir · on Oct 11, 2022

Then you could just hire the actor to read Jobs' part directly?

Hiring people to train their replacements seems off to me.

sumnole · on Oct 11, 2022

Then you'd need to hire the actor for every part. After enough training with the actor, you won't need to hire the actor anymore.

permo-w · on Oct 11, 2022

ethically questionable, but financially it makes some sense

brushfoot · on Oct 11, 2022

There's also Respeecher, which lets you realistically "puppet" someone else's voice.

cpeterso · on Oct 11, 2022

What non-presentation source material do Steve Jobs voice actors train with? Seems like that same source material can be used to train the AI voice.

vanattab · on Oct 11, 2022

Would the fact that Joe's data is more standardized and produced the same way. Job's data is likely a mix of different volumes, echo levels, processing have an effect

EA · on Oct 11, 2022

One million hours = 114.2 years