Hello. I made this. I'm a bit shocked at how many people are getting things wron...

Zababa · on June 13, 2022

Thank you for this, it's great to have passionate people working on interesting projects like those and sharing them with everyone.

mrstone · on June 13, 2022

How do you have the money to support this project as an undergrad?

15ai · on June 13, 2022

I have not been with MIT in years. I had a successful exit not too long after graduation, and I've been spending most of my earnings on this project.

As an undergrad, I was completely broke. I figured that keeping the project free to use was the best thing I could possibly do with my research as I continued to work on it.

mixermachine · on June 13, 2022

Have you thought about other hosters? Maybe try contabo.com or hetzner.com Should be a lot cheaper if you mainly need CPU power.

15ai · on June 13, 2022

Unfortunately, this is mostly run on the GPU. Believe it or not, AWS was by far the cheapest.

ellieh · on June 13, 2022

Hetzner sometimes have GPU instances available on their auction site, and if you get in touch with them they can sometimes sort something out!

They used to sell them openly, but coin mining ruined that :(

funkycolemedina · on June 13, 2022

Lambda Labs offers both on-demand and reserved instances that are much cheaper than other providers - https://lambdalabs.com/service/gpu-cloud/pricing

zerr · on June 13, 2022

Why do you keep it up and burning the money? Can't you continue/do your research without a public website?

15ai · on June 13, 2022

>Why do you keep it up and burning the money?

I can afford it.

>Can't you continue/do your research without a public website?

Yes, but the website has multiple purposes. It serves as a proof of concept of a platform that allows anyone to create content, even if they can't hire someone to voice their projects.

It also demonstrates the progress of my research in a far more engaging manner - by being able to use the actual model, you can discover things about it that even I wasn't aware of (such as getting characters to make gasping noises or moans by placing commas in between certain phonemes).

It also doesn't let me get away with picking and choosing the best results and showing off only the ones that work (which I believe is a big problem endemic in ML today - it's disingenuous and misleading). Being able to interact with the model with no filter allows the user to judge exactly how good the current work is at face value.

junon · on June 13, 2022

Despite others here, I personally certainly think this is admirable. I've played with your models a long time ago with some colleagues of mine and we were all shocked how good it was, and that it was free.

I'm no stranger to passion projects, I have a lot of respect for people like you. This is great stuff.

15ai · on June 13, 2022

Thank you for the kind words. I know that HN is a tough crowd to please (I myself as well), so I hope that my next update will be well worth working for.

vsnf · on June 13, 2022

The Rise model in particular is amazingly good quality. I pranked a friend with some text from her a few minutes ago and he chastised me for wasting my money on hiring voice actors just to troll him.

So, excellent job.

_kuvn · on June 13, 2022

Do you cache results from this (especially the random samples provided)? It seems to be regenerating those, which might be expensive if lots of people are using the same prompts

Edit: also wanted to thank you for the Chell voice, it sounds completely true to life to me! (minus some jumping noises)

deeplearner1 · on June 13, 2022

Do you still plan to publish your research in a paper?

15ai · on June 13, 2022

Yes, I do. For the past three years, I have done nothing but work on this project nonstop. I've been working on massive improvements (that some have pointed out in this thread) that I've been stuck on for the past several months, but I'm getting close to finishing that up.

I don't feel comfortable publishing or releasing anything until I know for a fact that I can make no further improvements. It's not out of corporate greed or anything like that - I'm just really paranoid about getting out the best work possible.

maxander · on June 13, 2022

Respectfully, the perfect is the enemy of the good, and it’s entirely reasonable to publish what you have now. If later you make further improvements, you can simply publish again.

15ai · on June 13, 2022

You're completely correct, but I'm afraid this is more of a personal problem. I know I'll never be able to forgive myself if I figured out a solution to one of the more obvious problems with the model after I've already published it. I'd just be far more comfortable being happy with my own work before I release it to the wild. I know that this is selfish, and I apologize.

Dramatize · on June 13, 2022

Do you still ban users from using different AI voices together?

15ai · on June 13, 2022

I have no control over that. Exactly how do you expect me to enforce this?

imdsm · on June 13, 2022

This is a fantastic project but I wish you would expand Spongebob to include Patrick, Squidward, Plankton, and my favourite, Mr Krab$!