Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Benchmarks suggest this model loses to Deepseek-R1 in every one-shot comparison.

That's not particularly surprising though as the Medium variant is likely close to ten times smaller than DeepSeek-R1 (granted it's a dense model and not an MoE, but still).



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: