Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm attempting to port repetition penalty to the sampling code to make it less repetitive for gpt-2 based models. Model suggestions welcome! I will be able to get one of the smaller llamas loaded in here.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: