> This is exactly why it is not “US vs China”, the battle is between heavily-cap... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		LunaSea 8 months ago \| parent \| context \| favorite \| on: Open-R1: an open reproduction of DeepSeek-R1 > This is exactly why it is not “US vs China”, the battle is between heavily-capitalized Silicon Valley companies versus open source. Ah yes the "open source" code that was not released by the DeepSeek team and the tens of thousands of professional grade GPUs that were contributed by the "community". DeepSeek is based on Llama which was produced by ... Meta.

Palmik 8 months ago [–]

DeepSeek v3/r1 isn't based on llama architecture. It uniquely combines and contributes several novel approaches.

Meta never released a mixture of expert model (they failed to train a good one, according to reliable rumors). And MoE is just one of few ingredients that make DeepSeek v3/R1 interesting and good.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact