ruxudev's comments

ruxudev · 2026-01-15T08:49:30 1768466970

ruxudev · on April 11, 2023

All examples on this page assume manual prompt building, I didn't tackle this issue because I didn't want to enter the topic of automatically creating prompts via code.

But you are very right, this is an enormous issue right now to systems that create prompts programatically. I am actively looking for solutions for this problem and I am very interested if anyone has any good solutions for it.

ruxudev · on April 11, 2023

Very interesting use of ChatGPT and prompt engineering. For your problem, summarizing a large document, splitting the document into smaller parts is indeed the way to go. I also had problems myself with operating on large documents. In my case, I had an insurance policy that I wanted to extract information from.

My solution: use the OpenAI API to convert the document to OpenAI's embeddings and saving those embeddings to a vector database. Then, use similarity search on the database to find chunks of the document that might be related to my query and pass only those chunks to GPT for the information extraction prompt.

I plan to create a guide on how to tackle these problems after I consolidate my findings.

a_bonobo · on April 11, 2023

Very cool solution!

My solution was to write a bit of code that writes a CSV, then I used a langchain-based CSV agent. Since that one calls on pandas it effectively has no token limit, but it also has no overview of the data, only what pandas tells it.

agomez314 · on April 11, 2023

I'm increasingly seeing these kinds of solutions for similar tasks. I wonder if we are seeing the discovery of new abstractions from using LLMs.

monkeydust · on April 11, 2023

Also doing this, works really well. Check out James Briggs on YouTube. Excellent tutorials on how to achieve this.

FooBarWidget · on April 11, 2023

Can you refer to a specific video?

monkeydust · on April 11, 2023

https://youtu.be/tBJ-CTKG2dM

Goes into workings of retrieval augmentation with example

Beefin · on April 11, 2023

this is how we enable 10mb+ file ingestion and search /w http://mixpeek.com