Hacker News new | past | comments | ask | show | jobs | submit login

You can say R1-604b to disambiguate, just like we have llama 3 8b/70b etc.



These models are not of the same nature either. Their training was done in a different way. A uniform naming (even with explicit number of parameters) would still be misleading.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: