More

jkb79 · on Jan 5, 2024

ColBER: Contextualized Late Interaction over BERT. That is just the name. It can be fine-tuned for retrieval using any encoder-only model like the ones you mention.

jimmySixDOF · on Jan 6, 2024

Yes thanks and anyone looking to swapout ColBERTv2 should look at example.2 "for any BERT/RoBERTa-like model" and then test against use cases to your heart's desire or at least until you run out of time to optimize lol!

jkb79 · on Oct 6, 2023

Yes, Vespa has a generic Tensor framework that allows you to index multiple vectors for a single field, see https://blog.vespa.ai/semantic-search-with-multi-vector-inde... for details.

field embeddings type tensor<float>(p{}, x[384]) to represent a multi-vector field { "0": [0.1....], "1": [0.2,..] }

jkb79 · on Oct 4, 2023

Actually, Vespa comes out of the same FAST company. Yahoo bought Overture/Altavista and a lot of other web search companies in 2003, including the web search division of FAST. The Enterprise search division of FAST was later acquired by Microsoft.

matsemann · on Oct 5, 2023

Ah, I see. Thanks for some background!

jkb79 · on Oct 4, 2023

Hehe, it was a joke, we don't have polar bears on the mainland of Norway. But, it was fun to show the photo to visitors from different countries. "Be careful when you walk back to the hotel".

jkb79 · on Oct 4, 2023

Disclaimer, I work on Vespa.

If you look for just pure vector similarity search, there are many alternatives. But Vespa's tensor support, multi-vector indexing and the ability to express models like colBERT (1) or cross-encoders makes it stand out if you need to move beyond pure vector search support.

Plus, for RAG use cases, it's a full blown text search engine as well, allowing hybrid ranking combinations. Also with many pure vector databases like Pinecone, you cannot describe an object with more than one vector, if you have different vector models for the object, you need different indexes, and then duplicate metadata across those indexes (if you need filtering + vector search).

1 https://blog.vespa.ai/pretrained-transformer-language-models...

ComputerGuru · on Oct 5, 2023

You might want to add a \ before the asterisk.

jkb79 · on Oct 4, 2023

Vespa was the internal code name for the project going back to 2005ish, vertical search platform.

itslennysfault · on Oct 4, 2023

Probably should've been Vespla by that logic

autoexec · on Oct 4, 2023

V-SPlat has a nice ring to it.

jkb79 · on Oct 4, 2023

Thank you for the shout-out Frank!

90-00-09 · on Oct 4, 2023

Off topic... looking forward to more engineers moving to Mastodon. I have Twitter/X blocked at DNS level and still fairly frequently encounter interesting accounts that I can't check out.

srameshc · on Oct 4, 2023

Every time I see Twitter/X here, I want to say this.

skrebbel · on Oct 5, 2023

How is it other people’s problem that you block a site that they’re on?

(Fwiw I have Twitter blocked too, though not for moral reasons)

AdamN · on Oct 5, 2023

Nobody's saying it's anybody else's problem - they're just looking forward to more content being on Mastodon (or elsewhwere) as people move away from X.

skrebbel · on Oct 5, 2023

, in response to a thread sharing the Twitter profile of a key person mentioned in the article, who also comments here.

The subtext is clearly “jkb, please move to Mastodon cause I blocked Twitter”. Not spelling that out doesn’t make it a lot less weird IMO.

90-00-09 · on Oct 5, 2023

It's not that weird, you are reading too much into the subtext. The web is in a transitory state, platforms change, people move. Wishing for more content to be available on a specific platform without blaming the author is an acceptable comment in my view.

SkyMarshal · on Oct 5, 2023

Is there a particular mastodon server or set of servers that the engineering community is favoring? I know it technically doesn’t matter in a federated network, but curious anyway.

tristan957 · on Oct 5, 2023

I'm on fosstodon. floss.social is also big.

jkb79 · on Oct 4, 2023

Yeah, I worked with the Flickr team on that project. Scaling to billions of photos, with partial update support of popularity for ranking.

Back then, the properties had to stand up their own Vespa cluster(s), later on we created a managed service out of it. And, yes, the original plan for Vespa was to be a Vertical Search Platform, that is where the name Vespa comes from. More on the history in this blog post https://blog.vespa.ai/vespa-is-becoming-its-own-company/

jkb79 · on Sept 8, 2023

>I don't see a ton of data that points towards us (vector DB ppl) building >towards traditional TREC/BEIR #s.

This is highly accurate, most vector database companies don't talk about the shortcomings of vector representations for search.

>TLDR: - I think pure vector search platforms should be evaluated differently than >traditional keyword search platforms - I think vector search is a tool -- simply >one of many -- that search engineers should use to make their search engine >results more relevant.

This is a contradiction. One one hand you say that you want to improve relevance, on the other hand you say that vector search as a tool cannot be evaluated as other models (tools).

We have plenty of open information retrieval datasets (both full retrieval and ranking) where you can compare different methods or tools and assess the relevance impact.

jkb79 · on Sept 3, 2023

It's an opinionated blog post published on Arxiv, masquerading as research.

IMHO, it's a gigantic self-own and doesn’t promote Lucene in a good way. For example, by demonstrating how they get only 10 QPS out of a system with 1TB of memory and 96 v-cpu's (after 4 warmups).

The HNSW implementation in Lucene is fair, and within the same order of magnitude as others. But, to get comparable performance, you must merge all immutable segments to a single segment, which all Lucene oriented benchmark does, but which is not that realistic for many production workloads where docs are updated/added in near real-time.

chii · on Sept 4, 2023

> but which is not that realistic for many production workloads where docs are updated/added in near real-time.

it really depends on how real time you need the search to be tho.

What i've seen is a green/blue lucene index. The updates happen on one (let's say the blue), while searches happen on the other (green). The segment merging happens periodically for the blue (or even smarter, let's say, after some known amount of time and updates combined), and then the index are switched. Depending on how often new documents come in, and "real time" you need, this may be sufficient.