Hacker News new | past | comments | ask | show | jobs | submit login

The parent comment was talking about coding specifically, not the average score. I see o1 at 69.69, and Claude 3.5 Sonnet at 67.13.



o1's score looks like exactly what I would expect Elon Musk to aim for with Grok's benchmarks




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: