Hi all! The team at Context Data launched an open source no-code ETL framework for Vector databases. The framework allows devs to pull data from multiple sources, embed and write final data to multiple vector database/store targets using just a yaml configuration file.
We've open-sourced our no-code ETL framework for Vector Data processing to allow AI engineers seamlessly process data from multiple data sources to ALL MAJOR vector databases using just a config file.
About a month and a half ago, I was at the Netflix Data Engineering Open Forum where I presented some work that I've been doing on building a Data architect agent.
I was reviewing the slides last night and decided to share them to the public.
I would really appreciate your thoughts and feedback.
After launching our MVP late last year, one of the most consistent feedback we got was around our transformations function where users complained about the clunkiness and the steep learning curve. After extensive interviews with our beta users, we came to the realization that over 90% of users had some experience using DBT or were using DBT within their data organizations.
Building enterprise level RAG solutions take more than building the application but also building the data platform and infrastructure that the application has to depend upon
A few months ago, I taught a class on Vector Databases for a TGE Data private client and then decided to record it into a short course for a wider audience.
The course is a mix of theory and demos discussing some of the underlying concepts of Vectors, Vector Databases, Indexing, Search Similarity and ending with demos specifically for Pinecone and Weaviate databases.
I already have a Udemy account so when I went to checkout and logged in it said the promo was for new users only. But I went back to the course page again and reloaded the page so that I was logged in and then I was able to buy it with the coupon applied.
We'd love your feedback!
Github: https://github.com/ContextData/VectorETL
Documentation: https://vectoretl.contextdata.dev/index.html