Most online discussion doesn't contain the entire text. You can pick almost any ...

amelius · 2025-05-25T13:25:07 1748179507

Why use a search engine when you can use an LLM? ;)

mike_hearn · 2025-05-25T15:17:01 1748186221

Well, because the goal is to locate the exact documents in the training set and remove them, not answer a question...

amelius · 2025-05-25T15:32:09 1748187129

So you stream the training set through the context window of the LLM, and ask it if it contains the requested document (also in the context window).

The advantage is that it can also detect variations of the document.