Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

At 0:52 in their demo video, there is a grammatical inconsistency in the agent's text output. The annotations in the video are therefore suspected to be created by humans after the fact. Is Google up to their old marketing/hyping tricks again?

> SIMA 2 Reasoning:

> The user wants me to go to the ‘tomato house’. Based on the description ‘ripe tomato’, I identify the red house down the street.





I can't speak to the content of the actual game being played, but it wouldn't surprise me if there was an in-game text prompt:

> "The house that looks like a ripe tomato!"

that was transformed into a "user prompt" in a more instructional format

> "Go to the tomato house"

And both were used in the agent output. At least the Y-axes on the graphs look more reasonable than some other recent benchmarks.


The scene just before you describe has the user write "ripe tomato" in the description - you can see it in the video. The summary elides it, but the "ripe tomato" instruction is also clearly part of the context.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: