Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I believe it depends in inputs. For me, Claude 4 has consistently generated hallucinations, especially was pretty confident in generating invalid JSONs, for instance Grafana Dashboards, which were full of syntactic errors.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: