An important Cursor feature that no one else seems to have implemented yet is do...

mapmap · 2025-05-09T06:50:55 1746773455

The continue.dev plugin for Visual Studio Code provides documentation indexing. You provide a base URL and a tag. The plugin then scrapes the documentation and builds a RAG index. This allows you to use the documentation as context within chat. For example, you could ask @godotengine what is a sprite?

conartist6 · 2025-05-09T10:28:04 1746786484

So this is why everything is going behind Anubis then?

GreenWatermelon · 2025-05-09T14:50:28 1746802228

Nah, Anubis combats systematic Scraping of the web by data scrapers, not actual user agents.

conartist6 · 2025-05-09T15:55:48 1746806148

A scraper in this case is the agent of the user. Doesn't make it not a scraper that can and will get trapped.

giordanol · 2025-05-09T12:41:26 1746794486

Cursor’s doc indexing is acc one of the few AI coding features that feels like it saves time. Embedding full doc sites, deduping nav/header junk, then letting me reference @docs inline actually improves context grounding instead of guessing APIs.

steveharman · 2025-05-08T20:56:58 1746737818

Just use the Context7 MCP ? Actually I'm assuming Void supports MCP.

gesman · 2025-05-09T06:10:07 1746771007

Context7 is missing lots of info pieces from the repos it indexing and getting overbloated with similar sounding repos, which is becoming confusing for LLM's.

Aeroi · 2025-05-09T01:00:38 1746752438

can you elaborate on how context7 handles document indexing or web crawling. If i connect to the mcp server, will it be able to crawl websites fed to it?

andrewpareles · 2025-05-08T21:33:05 1746739985

Agreed - this is one of the better solutions today.

andrewpareles · 2025-05-08T21:26:47 1746739607

This is a good point.We've stayed away from documentation assuming that it's more of a browser agent task, and I agree with other commenters that this would make a good MCP integration.

I wonder if the next round of models trained on tool-use will be good at looking at documentation. That might solve the problem completely, although OSS and offline models will need another solution. We're definitely open to trying things out here, and will likely add a browser-using docs scraper before exiting Beta.

RobinL · 2025-05-08T20:39:46 1746736786

I agree that on the face of it this is extremely useful. I tried using it for multiple libraries and it was a complete failure though, it failed to crawl fairly standard mkdocs and sphynx sites. I guess it's better for the 'built in' ones that they've pre-indexed

throwup238 · 2025-05-08T21:22:26 1746739346

I use it mostly to index stuff like Rust docs on docs.rs and rendered mdbooks. The RAG is hit or miss but I haven’t had trouble getting things indexed.