Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
bottled_poe
8 months ago
|
parent
|
context
|
favorite
| on:
Open-R1: an open reproduction of DeepSeek-R1
It will be very interesting to see if they can reproduce a similar model on the shoestring budget claimed by Deepseek.
anshumankmr
8 months ago
[–]
but deepseek hasn't claimed the figure touted by everyone for this particular R1 model, cause that 5.6mn was apparently for Deepseek's coder model
boroboro4
8 months ago
|
parent
[–]
5.6mn figure is for base Deepseek V3 model. Both instruction and reasoning tuning of it has neglectable cost in comparison with it.
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: