Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Evaluating Language-Model Agents on Realistic Autonomous Tasks (alignment.org)
1 point by todsacerdoti on Aug 15, 2023 | past
Prizes for matrix completion problems (alignment.org)
1 point by EvgeniyZh on May 8, 2023 | past
Can GPT-4 escape into the wild? (alignment.org)
2 points by p1esk on March 28, 2023 | past | 2 comments
Update on ARC's recent eval efforts (alignment.org)
2 points by todsacerdoti on March 20, 2023 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: