Hacker Newsnew | past | comments | ask | show | jobs | submit | kaotown's commentslogin

Nice work. Any plans to add support for Pinecone Assistant?

The constBERT late-interaction model is a step forward in enabling practical implementation of multi-vector scoring in production search applications. Blog post shows how to easily integrate this technique into existing indexes to achieve near-LLM quality search results with negligible latency increase.

What are y'alls thoughts on this approach? I would be curious on people's experience with multi-vector retrieval in production. Are you using multi-stage pipelines for retrieval? How do you currently balance the tradeoffs between speed, accuracy, and cost?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: