Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If you get the same result over and over again, it means the model is more overfit to a certain result. It does not mean the result is correct.


> model is more overfit to a certain result

From their communications, a massive amount of effort was put into making sure the model followed the system prompt. One might claim "overfit as a feature".


Thank you, this is one of the most understood 'facts', especially regarding "prompt hacking/jailbreaking"




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: