I got lucky and got in moments after it launched, managed to get a video of "A p...

echelon · 2024-12-09T19:24:25 1733772265

For those who can't try Sora out, Tencent's super recent HunYuan is 100% open source and outperforms Sora. It's compatible with fine tuning, ComfyUI development, and is getting all manner of ControlNets and plugins.

I don't see how Sora can stay in this race. The open source commoditization is going to hit hard, and OpenAI probably doesn't have the product DNA or focus to bark up this tree too.

Tencent isn't the only company releasing open weights. Genmo, Black Forest Labs, and Lightricks are developing completely open source video models, and that's .

Even if there weren't open source competitors, there are a dozen closed source foundation video companies: Runway, Pika, Kling, Hailuo, etc.

I don't think OpenAI can afford to divert attention and win in this space. It'll be another Dall-E vs. Midjourney, Flux, Stable Diffusion.

https://github.com/Tencent/HunyuanVideo

https://x.com/kennethlynne/status/1865528133807386666

https://fal.ai/models/fal-ai/hunyuan-video

OJFord · 2024-12-09T19:24:49 1733772289

> The Pelican inexplicably morphs to cycle in the opposite direction half way through

It's pretty cool though, the kind of thing that'd be hard if it was what you actually wanted!

vunderba · 2024-12-09T19:34:37 1733772877

"The Pelican inexplicably morphs to cycle in the opposite direction half way through"

Oof, if sora can't even manage to maintain an internal consistency of the world for a 5 second short, I can't imagine how exacerbated it'll be at longer video generation times.

benatkin · 2024-12-09T19:27:23 1733772443

That's an awful result. It turning around has absolutely nothing to do with what you asked for. It's similar in nature to what the chatbot in the recent and ongoing scandal said, saying to come home to her, when it should have known that the idea would be nonsensical or could be taken to mean something horrendous. https://apnews.com/article/chatbot-ai-lawsuit-suicide-teen-a...

So you were lucky indeed to be able to run your prompt and share it, because the result was quite illuminating, but not in a way that looks good for Sora and OpenAI as a whole.

vletal · 2024-12-09T19:23:23 1733772203

Image details 9/10 Animation 3/10 Temporal consistency 2/10

Verdict 4/10

ByThyGrace · 2024-12-09T19:23:10 1733772190

Did you notice the frame rate (so to speak) of what's happening down the lake is much lower than the pelican's bicycle animation?

pushcx · 2024-12-09T19:29:04 1733772544

I don't have a lot of mental model for how this works, but I was surprised to note that it seems to maintain continuity on the shapes of the bushes and brown spots on the grass that track out of frame on the left and then reappear as it pans back into frame.

benatkin · 2024-12-09T20:23:05 1733775785

That must be exactly it. The simulated scene extends beyond what the camera is currently capturing.

alberth · 2024-12-09T19:24:46 1733772286

Thanks, would you mind elaborate more on what you wrote below:

  Sora is built entirely around the idea of directly manipulating and editing and remixing the clips it generates, so the goal isn't to have it produce usable videos from a single prompt.

simonw · 2024-12-09T20:34:41 1733776481

If you watch the OpenAI announcement they spend most of their time talking about the editing controls: https://www.youtube.com/watch?v=2jKVx2vyZOY

rjtavares · 2024-12-09T19:24:08 1733772248

One of the highlights of any model release for me is checking your "pelican riding a bicycle" test.