Great to see Unsloth here, how long did the training process take?? Also, the di... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		amrrs 7 months ago \| parent \| context \| favorite \| on: Train your own R1 reasoning model Great to see Unsloth here, how long did the training process take?? Also, the different version of the same og Colab didn't make a 135M model to learn the XML tags, so do you think 8 billion should be the minimum for use this?

danielhanchen 7 months ago [–]

Hey! Oh the notebook takes 1 to 3 hours on a free GPU - it's best to run it for 1 day for a full run - faster GPUs can be much better.

Yep - bigger models should help! Definitely give Llama a try!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact