Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Great to see Unsloth here, how long did the training process take??

Also, the different version of the same og Colab didn't make a 135M model to learn the XML tags, so do you think 8 billion should be the minimum for use this?



Hey! Oh the notebook takes 1 to 3 hours on a free GPU - it's best to run it for 1 day for a full run - faster GPUs can be much better.

Yep - bigger models should help! Definitely give Llama a try!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: