Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That reminded me to try a version of the riddle that I had come up with that I had never seen an LLM successfully answer:

  Me: I'd like you to solve this riddle for me. A farmer has a cabbage, a goat, a wolf and a lion,
  and needs to cross a river on a boat. If the goat is left alone with the cabbage, it will eat it.
  If the wolf is left alone with the goat, it will eat it. If the lion is left alone with the goat
  or wolf, it will eat them. The boat can only carry the farmer and one other thing across. How can
  the farmer safely transport everything across the river?
O3-mini spent a very long time on it (over a minute), delineating its various strategies that it was trying, and finally, correctly, concluded that the puzzle is unsolvable.

Good job!



o1 and deepseek r1 managed to get this first try as well (o1 in about 30 seconds and r1 hilariously took a couple minutes). If anyone has set up API access already I'd be curious if o1-mini also got it or if it took more than "the jump to CoT" to avoid pattern matching this one.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: