Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

When working with LLMs you can never exclude hallucinations entirely but we‘ve been carefully tuning the system over time and found it to be pretty high signal! The reason we display the code snippets is to make it easy to double check with the source


Can you "chunkify" output so that you can rate different elements independently? Like "This part of the answer is totally cool" and "Wait. This part right here includes a hallucination."

Then allow for feedback to be provided if an issue is spotted.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: