Hacker News new | past | comments | ask | show | jobs | submit login

Deepseek v3 required about 1tb of VRAM / RAM so 10 A100.

There are various ways to run it with lower vram if you're ok with way worse latency & throughput

Edit: sorry this is for v3, the distilled models can be ran on consumer-grade GPUs






Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: