Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I’ve found it’s very useful for understanding snippets of code, like a 200-line function. But systems are more than “lots of 200-line functions” - there’s a lot of context hidden in flags, conditional blocks, data, Git histories.

Maybe one day, we’ll be able to run it over all the Git histories, Jira tickets, Confluence documentation, Slack conversations, emails, meeting transcripts, presentations, etc. Until then, the humans will need to stitch it all together as best they can.



I can't think of a reason you couldn't specifically train an AI on your own large code base. After all, current LLMs are trained on effectively the entire internet.


Unless your documentation is 20~100x the size of your codebase and written in a conversational tone, the LLM won't be able to be asked any questions about it using English.

If your only aim is to use it like Copilot, sure, it's useful.


You might be able fine tune a model on pull requests if they have really high quality descriptions, high quality commit messages, and the code is well documented and organized.


I'm really not sure this is irony or a serious comment.


So the first step is to let the LLM write the documentation.... :)


Sure. Because it understands the code so well.


I haven't yet seen anything that can scan an entire codebase and build, say, a data lineage to understand how a value in the UI was calculated. I'm sure it's coming, though.


It's coming, I'm sure.

Just right after we have invented AGI.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: