Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Quite a good read! Impressive results, it seems. Still think much more useful to research learning complex things without absurd compute/sample inefficiency/various hacks eg reward shapring (which, lets be honest, this seems to have a lot of), but still interesting results.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: