Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

To train small gpt-like models, there's also aitextgen: https://github.com/minimaxir/aitextgen


As the creator of aitextgen, I'm mixed on continuing support since there doesn't seem to be as much demand as expected for small GPT models given the success and cost-effectiveness of GPT-3/ChatGPT, unfortunately.

I still have a few ideas there (including another secret approach at better text generation) but it's hard to determine ROI.


I think what you have created still has great demand. It give devs who do not have the budget or need for the gigantic models, something to train and use for their own specific language tasks.

Not everyone is trying to replicate CHATGPT results for certain tasks.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: