Hacker News new | past | comments | ask | show | jobs | submit login

Probably just trained on lots of GPT-4 output.



Apparently trained on lots of refusals too, speaks to the high competence of whoever was setting up the dataset. It's one string regex to filter them out and get more performance for fucks sake.


Oh, right, I remember hearing that was a technique to train LLMs. Interesting that it impacts it in such a way.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: