Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

SWE Bench doesn't even test bugfixing / feature dev properly after you achieve roughly 70% if you don't benchmaxx it .


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: