Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Developer Productivity AI Arena: Open Platform for Benchmarking AI Coding Agents (jetbrains.com)
2 points by janpio 24 days ago | hide | past | favorite | 1 comment


Very welcome for my work which is automatically generating Java code with JSON-compatible data types.

Codex CLI with the GPT-5-codex model is the current leader over Claude Code and Sonnet 4.5 in the current Java-oriented benchmarks.

The JetBrains contributed benchmarking harness will be soon available in a form that developers can use on their own.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: