Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> This is exactly why it is not “US vs China”, the battle is between heavily-capitalized Silicon Valley companies versus open source.

Ah yes the "open source" code that was not released by the DeepSeek team and the tens of thousands of professional grade GPUs that were contributed by the "community".

DeepSeek is based on Llama which was produced by ... Meta.



DeepSeek v3/r1 isn't based on llama architecture. It uniquely combines and contributes several novel approaches.

Meta never released a mixture of expert model (they failed to train a good one, according to reliable rumors). And MoE is just one of few ingredients that make DeepSeek v3/R1 interesting and good.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: