Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It scores 72.4 on NYT Connections, a significant improvement over the o1-mini (42.2) and surpassing DeepSeek R1 (54.4), but it falls short of the o1 (90.7).

(https://github.com/lechmazur/nyt-connections/)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: