Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That was part of the "Winning Submission Documentation", not part of the rules. It's true that it was there, but to quote the authors:

We suspect that most competitors also did not realise these additional restrictions existed - we are unable to find any data posted in the External Data Thread which meets this threshold with a brief scan. During the competition, the rules on external data were repeatedly clarified, so this leaves us wondering why Kaggle never took the opportunity to clarify that external data must additionally follow the more restrictive rules for winning submission documentation.[1]

Here's a Kaggle competition admin saying:

he deadline to declare external data is on March 3rd. So you cannot add new external datasets after that deadline, but you can use any datasets that have been declared (which are not prohibited) on this thread.[2]

and clarrfying licensing:

So it is expected that competitors understand the external data they’re using and ensure it matches the requirements in the rules.

I’ve answered the question about BY-NC not being available for use by all (non-commercial use) and therefore violating the requirement that external data be available for use by all participants.[3]

Note nothing about there being extra "rules" in the "Winning Submission Documentation".

[1] https://www.kaggle.com/c/deepfake-detection-challenge/discus...

[2] https://www.kaggle.com/c/deepfake-detection-challenge/discus...

[3] https://www.kaggle.com/c/deepfake-detection-challenge/discus...



Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: