Maybe its a combination of both.

Sharlin · 2024-11-11T14:30:24 1731335424

It seems exceedingly unlikely to me that frames from random YouTube videos would have been used to train image generation models. First off, they're difficult to extract and second, the quality of individual video frames is very low, especially if we're talking about 15 year old phone videos at what, 480p at the very best!

jsemrau · 2024-11-11T17:07:16 1731344836

You are probably right. I approached it from a high-value dataset perspective but would agree that fuzzy frames probably don't help much.