Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Had the same thought. :) I wonder if anyone has done a challenge using that game which has simpler inputs I believe?


Simplest RL algorithm (Q-learning) achieves 100m in QWOP: https://www.youtube.com/watch?v=e27TUmMkOA0

Although it found and exploited a local maximum of "knee scraping" technique (which humans can replicate) :)




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: