Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Good job buddy. You compared your first few prompt attempts on the RNG machine vs the cherrypicked outputs of other people. But also, I genuinely think you're pulling this argument away from what it was. Tell me if you can see a fidelity improvement from what the last few videogen products that came out. I can link pretty much two videos off-rip from that sub regarding this lipsync thing you seem to be honing in on.

https://www.reddit.com/r/aivideo/comments/1kp75j2/soul_rnb_i... Kling 2.0 output, a lot less overacted in the lipsync area.

https://www.reddit.com/r/aivideo/comments/1kls6gv/the_colorl... 2.0 output, multiple characters. Shows about the same consistency and ability to adapt to dynamic speech as Veo, which is to say it's far from perfect but passes the glance test.

https://www.reddit.com/r/aivideo/comments/1jerh56/worst_date... Kling 1.6 output. Does the lips a lot less visually jarring. The eyes are wonky, but that's generally still a problem with the video genAI space.

The things you'd profess that "will change the world" have been here. It takes maybe an extra one step, but the quality's been comparable. Yet they haven't 6 months ago. Or a month ago. Why's that? Is it perhaps, that people have a habit of overestimating how much use they can get out of these things in their current state like you are?



Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: