One reason to believe OpenAI here is that R1 will occasionally claim to be made ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		int_19h 10 months ago \| parent \| context \| favorite \| on: Deepseek R1 Distill 8B Q40 on 4 x Raspberry Pi 5 One reason to believe OpenAI here is that R1 will occasionally claim to be made by OpenAI, which in e.g. LLaMA finetunes is indicative of using synthetic data generated by ChatGPT. Note that this isn't necessarily o1. While o1 is specifically trained to do CoT, you can also make 4o etc produce it with the appropriate prompts, and then train on that output.

HPsquared 10 months ago [–]

I suppose it might be hard to avoid encountering ChatGPT outputs "in the wild" now, even if they don't explicitly use it for training material.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact