More

jeadie · 2025-09-12T23:50:56 1757721056

We’re building vector indexes into Datafusion for search (starting with S3 vectors).

Open source at https://github.com/spiceai/spiceai

jeadie · 2025-05-23T02:15:01 1747966501

This is one of the ideas behind using DuckDB in github.com/spiceai/spiceai

anentropic · 2025-05-23T11:41:56 1748000516

That looks like an amazing "swiss army knife"...!

mrbungie · 2025-05-23T02:37:04 1747967824

Looks very cool! I will take a look, tysm!

jeadie · 2025-05-06T01:00:45 1746493245

There’s also https://github.com/spiceai/spiceai

jeadie · on Dec 4, 2024

This is a common feature now. If anything, for being so early to vector databases, Pinecone was rather late to integrating embeddings.

Timescale most recently added it but, yes a bunch of others: Weaviate, Spice AI, Marqo, etc.

gdj0nes · on Dec 4, 2024

A difference between Pinecone and many of the others you listed is that we host both embedding and reranking models in a serverless fashion. You pay for what you use while we manage the entire stack.

jimminyx · on Dec 4, 2024

Do any of the others also handle reranking?

iosjunkie · on Dec 4, 2024

Qdrant does with its ‘Query API’.

https://qdrant.tech/documentation/concepts/hybrid-queries/

And handles embedding creation with its fastembed package.

https://github.com/qdrant/fastembed

tech2trees · on Dec 5, 2024

Marqo does: https://www.marqo.ai/

cess11 · on Dec 4, 2024

I don't know about them, but Manticore does.

https://manticoresearch.com/use-case/vector-search/

jeadie · on Oct 18, 2024

Why not just federate Postgres and parquet files? That way the query planner can push down as much of the query and reduce how much data has to move about?

jeadie · on May 13, 2024

This looks functionally similar as using http://github.com/spiceai/spiceai with a postgreSQL data accelerator.

jeadie · on April 1, 2024

Spice AI | Senior Software Engineer | GMT+10 (e.g. Australia) through GMT-7 (e.g. Seatle/SF/LA) | Remote | Full Time

Spice AI provides building blocks for data and AI-driven applications by composing real-time and historical time-series data, high-performance SQL query, machine learning training and inferencing, in a single, interconnected AI backend-as-a-service.

We just launched github.com/spiceai/spiceai, a unified SQL query interface and portable runtime to locally materialize, accelerate, and query data tables sourced from any database, data warehouse, or data lake.

We're hiring experienced software engineers, ideally with Rust and/or Golang production experience. We're focused on large data and distributed systems, experience in these is important too. More details: https://spice.ai/careers#section-open-positions

svashish305 · on April 8, 2024

it says remote but the open positions are mostly hybrid

jeadie · on March 28, 2024

And yes, Iceberg is very high up on our list

jeadie · on March 28, 2024

Yes! It can connect to FlightSQL compatible servers (see https://docs.spiceai.org/data-connectors/flightsql ) and its also a FlightSQL compatible server

lukekim · on March 28, 2024

We also have a Grafana plugin we'll continue to improve to make it super easy to connect to Grafana, and Spice has a metrics endpoint and example Grafana dashboard for monitoring itself https://github.com/spiceai/spiceai/blob/trunk/monitoring/gra...

jeadie · on June 9, 2023

Have you seen github.com/marqo-ai/marqo? It does all this wrapping, and you don't even need to pay for OpenAI or pinecone

skelts · on June 9, 2023

+1 to Marqo. It's documents in documents out rather than vectors in vectors out. Much easier end to end.

adaro · on June 9, 2023

Sounds cool, will take a look.

Are there other embedding APIs that you like or have used?