OP is incorrect. Embeddings are still needed since (1) context windows can't con...

zwily · on Nov 6, 2023

But the common use case of using a vector DB to pull in augmentation appears to now be handled by the Assistants API. I haven't dug into the details yet but it appears you can upload files and the contents will be used (likely with some sort of vector searching happening behind the scenes).

nextworddev · on Nov 6, 2023

"yet"

coding123 · on Nov 6, 2023

It's also much slower. LLMs are generating text token at a time. That's not very good for search.

Pre-search tokenization however, probably a good fit for LLMs.