Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I can't speak to the content of the actual game being played, but it wouldn't surprise me if there was an in-game text prompt:

> "The house that looks like a ripe tomato!"

that was transformed into a "user prompt" in a more instructional format

> "Go to the tomato house"

And both were used in the agent output. At least the Y-axes on the graphs look more reasonable than some other recent benchmarks.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: