Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
from
login
Evaluating Language-Model Agents on Realistic Autonomous Tasks
(
alignment.org
)
1 point
by
todsacerdoti
on Aug 15, 2023
|
past
Prizes for matrix completion problems
(
alignment.org
)
1 point
by
EvgeniyZh
on May 8, 2023
|
past
Can GPT-4 escape into the wild?
(
alignment.org
)
2 points
by
p1esk
on March 28, 2023
|
past
|
2 comments
Update on ARC's recent eval efforts
(
alignment.org
)
2 points
by
todsacerdoti
on March 20, 2023
|
past
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: