Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is super interesting. Where do you store the ebooks and the metadata?


Ebooks in an S3 compatible storage bucket, metadata in a Postgres database. That has the huge advantage of being able to do full text search and kNN similarity right in the database, for example.

Colibri is built around a pretty solid data schema (I hope). Check out the migrations folder if you’re curious :-)


Not to push LLMs into everything, but does it make sense to also implement semantic search by the way I love what you’re doing.


Semantic search is planned as part of book content search, look here: https://github.com/colibri-hq/colibri/issues/45

LLMs might make sense to interact with your collection, so that could find its way into the app at some point. Plus, I've been experimenting with generating llms.txt for all routes to point your own LLM to.

On the other hand, I'm concerned with LLMs quite intensely at work, so it's nice to spend some time with plain, honest-to-god SQL for now!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: