Hacker Newsnew | past | comments | ask | show | jobs | submit | joshribakoff's commentslogin

Usually, the reporter inspects the document but does not get to take a copy

Completely broken.

Useless even.

There are far worse scandals about this company than advertising ethics.

You essentially need additional agents to implement the guardrails traditionally used to scale teams.

By not relying on AI to think for you.

I find claude models often use “tricks” like bash one liners, essentially excelling at surgical fixes. It does what i want more reliably, on smaller tasks.

GPT-5 can often be better at larger architectural changes, but i find that comes at the cost of instability/broken PRs. It often fails to capture intent or argues back, or just completely spirals out of control more often.

GPT-5 codex seemed to refuse valid requests like “make a change to break a test so we can test CI” (it over indexed on our agents.md and other instructions and then refused on the basis of “ethics” or some such)


More like “i tried what others claim extensively and it does not work for me, please let me know if im doing something wrong” — to which the response is often yours, reframing the observation as a fallacy.

Can you point me to some of those comments?

I genuinely haven't seen them.

I see many people insisting it didn't work when they tried it for some little thing, therefore it's broken and useless. And a few people saying, actually it works really well if you're willing to learn how to use it.

I'm not sure I've ever seen someone here saying it hasn't worked but they're open to learning how to use it right. It's definitely not common.


Its bad because basic stuff like “please commit these changes” can sometimes take 10+ minutes at times and causes it to spiral doing tangential stuff.

By running them in parallel you avoid sitting there watching paint dry for a task that takes 3 seconds by hand.

Its really not comparable to a junior, its more comparable to a salty maliciously compliant optimized to burn tokens and deceive you.


Counter argument: one is actually capable of reasoning, the other is predicting the next token and brute forcing until checks pass.

AI circumvents guardrails yes.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: