This is a strange document. They don't mention supervised instruction finetuning...

broast · on April 11, 2023

I know I can't be the only one that finds text completion much more useful and powerful than an agent that wants to chat with me.

outofpaper · on April 11, 2023

No you're not. I too enjoy working with text the completion LLMs have been able to do for some time. The issue with text completion is that most people don't want to be forced to think about possible document headers when they want inferred answers.

cubefox · on April 11, 2023

Another problem is that OpenAI doesn't want their customers to access them anymore. They may be considered too dangerous, since they are not just not instruction tuned, but also not censored (RLHF'd). So people have to use less powerful base models, which cancels out their increased flexibility.

tough · on April 11, 2023

> since they can exhibit "mode collapse", a loss in entropy, where they e.g. tend to produce content which is very similar in style.

so wait, is this why all these chatgpt answers in HN comments sound so similar and are thus easy to detect?

cubefox · on April 11, 2023

I guess so, at least this is what people are reporting who have a lot of experience with language models, like janus (see link in sibling).

Though I should mention that mode collapse doesn't just come from supervised instruction tuning (which let the model reply to requests instead of treating them as completion prompts), but also from things like RLHF, which bias the model to give certain replies rather than others.

strontian · on April 11, 2023

Very interested in this topic but haven’t experimented much with it yet, do you know of any good resources or writeups?

cubefox · on April 11, 2023

Yes: https://www.lesswrong.com/posts/t9svvNPNmFf5Qa3TA/mysteries-...

tikkun · on April 11, 2023

> GPT-3.5 it was removed from the API a few weeks ago.

context/what are you referring to?

cubefox · on April 11, 2023

See here: https://news.ycombinator.com/item?id=35242069

What the commenters there didn't realize at the time is that code-davinci-002 has nothing to do with the "Codex API" specifically. It is simply the GPT-3.5 foundation model without fine-tuning applied to it. See

https://platform.openai.com/docs/model-index-for-researchers