Hacker News new | past | comments | ask | show | jobs | submit login

Yes. Based on conversations I’ve had with OpenAI staff, Davinci started unexpectedly developing the ability to answer longer questions as they scaled up normal InstructGPT fine-tuning some time in the past year. They don’t take down old models when the default one updates so you can see the version history implicitly in the availability of old models.



Do they do regression tests, and how do they verify them?

How do they know that a new version is actually an improvement?


[flagged]


It’s not that implausible. It’s trained on many examples of instructions followed by answers, and it’s meant to (and does) generalize to unseen instructions. After enough training, it also generalized to instructions of previously unseen length.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: