The R1 GitHub repo is way more exciting than I had thought. They aren't only ope...

roborovskis · 2025-01-20T15:46:32 1737387992

Where are you seeing this? On https://github.com/deepseek-ai/DeepSeek-R1/tree/main?tab=rea... I only see the paper and related figures.

ozgune · 2025-01-20T16:17:41 1737389861

I see it in the "2. Model Summary" section (for [2]). In the next section, I see links to Hugging Face to download the DeepSeek-R1 Distill Models (for [3]).

https://github.com/deepseek-ai/DeepSeek-R1?tab=readme-ov-fil...

scribu · 2025-01-20T16:27:46 1737390466

The repo contains only the PDF, not actual runnable code for the RL training pipeline.

Publishing a high-level description of the training algorithm is good, but it doesn't count as "open-sourcing", as commonly understood.

fabmilo · 2025-01-20T20:26:44 1737404804

was genuinely excited when I read this but the github repo does not have any code.

fsndz · 2025-01-20T16:32:04 1737390724

[flagged]

fsndz · 2025-01-20T17:35:56 1737394556

this means we are going to get o3 level open source models in a few months. So exciting !

torginus · 2025-01-20T18:01:17 1737396077

Is o3 that much better than o1? It can solve that Arc-AGI benchmark thing at huge compute cost, but even with o1, the main attraction (for me) seems to me that it can spit out giant blocks of code, following huge prompts.

I'm kinda ignorant, but I'm not sure in what way is o3 better.

bugglebeetle · 2025-01-20T18:34:48 1737398088

> It can solve that Arc-AGI benchmark thing at huge compute cost

Considering DeepSeek v3 trained for $5-6M and their R1 API pricing is 30x less than o1, I wouldn’t expect this to hold true for long. Also seems like OpenAI isn’t great at optimization.

Philpax · 2025-01-20T18:51:37 1737399097

OpenAI is great at optimisation - compare the cost of -4o to -4. They just haven't optimised o3 yet.

bugglebeetle · 2025-01-20T19:11:11 1737400271

4o is more expensive than DeepSeek-R1, so…? Even if we took your premise as true and we say they are as good as DeepSeek, this would just mean that OpenAI is wildly overcharging its users.

fsndz · 2025-01-20T21:22:13 1737408133

now openai has no other choice than shipping a cheaper version of o1 and o3. The alternative is everyone using r1 (self hosted or via openrouter, nebius AI, together AI and co)

fsndz · 2025-01-20T19:16:10 1737400570

yes o3 is better, but I would argue it is not yet clear for which cases it is absolutely crucial to use o3 instead of o1.

echelon · 2025-01-20T18:33:18 1737397998

This is how you do "Open" AI.

I don't see how OpenAI isn't cooked. Every single foundation model they have is under attack by open source.

Dall-E has Stable Diffusion and Flux.

Sora has Tencent's Hunyuan, Nvidia's Cosmos, LTX-1, Mochi, CogVideo.

GPT has Llama.

o1 has R1.

And like with R1, these are all extensible, fine tunable, programmable. They're getting huge ecosystems built up around them.

In the image/video space there are ComfyUI, ControlNets, HuggingFace finetrainers, LoRAs. People share weights and training data.

Open source is so much better to base a company on than a proprietary model and API.

...

It looks there is no moat.

parav · 2025-01-20T19:15:26 1737400526

The moat might be tiny at the frontier level. But the mainstream still only knows about ChatGpt. OpenAI won consumer before others even started.

meowface · 2025-01-20T19:29:49 1737401389

Which is funny because ChatGPT was sort of a random experiment and not like a planned attempt at a huge product launch.

fsndz · 2025-01-20T18:35:57 1737398157

indeed there is no moat. Open source will win !

ttul · 2025-01-20T18:52:56 1737399176

I think open source AI has a solid chance of winning if the Chinese keep funding it with great abandon as they have been. Not to mention Meta of course, whose enthusiasm for data center construction shows no signs of slowing down.