I’ve found it’s very useful for understanding snippets of code, like a 200-line ...

darkerside · on May 26, 2023

I can't think of a reason you couldn't specifically train an AI on your own large code base. After all, current LLMs are trained on effectively the entire internet.

zappchance · on May 26, 2023

Unless your documentation is 20~100x the size of your codebase and written in a conversational tone, the LLM won't be able to be asked any questions about it using English.

If your only aim is to use it like Copilot, sure, it's useful.

oceanplexian · on May 26, 2023

You might be able fine tune a model on pull requests if they have really high quality descriptions, high quality commit messages, and the code is well documented and organized.

still_grokking · on May 27, 2023

I'm really not sure this is irony or a serious comment.

sandos · on May 26, 2023

So the first step is to let the LLM write the documentation.... :)

still_grokking · on May 27, 2023

Sure. Because it understands the code so well.

blowski · on May 26, 2023

I haven't yet seen anything that can scan an entire codebase and build, say, a data lineage to understand how a value in the UI was calculated. I'm sure it's coming, though.

still_grokking · on May 27, 2023

It's coming, I'm sure.

Just right after we have invented AGI.