sjdjsin's comments

sjdjsin · 2025-10-11T17:07:20 1760202440

> This is the part where I simply don't understand the objections people have to coding agents

Because I have a coworker who is pushing slop at unsustainable levels, and proclaiming to management how much more productive he is. It’s now even more of a risk to my career to speak up about how awful his PRs are to review (and I’m not the only one on the team who wishes to speak up).

The internet is rife with people who claim to be living in the future where they are now a 10x dev. Making these claims costs almost nothing, but it is negatively effecting mine and many others day to day.

I’m not necessarily blaming these internet voices (I don’t blame a bear for killing a hiker), but the damage they’re doing is still real.

tptacek · 2025-10-11T17:08:57 1760202537

I don't think you read the sentence you're responding to carefully enough. The antecedent of "this" isn't "coding agents" generally: it's "the value of an agent getting you past the blank page stage to a point where the substantive core of your feature functions well enough to start iterating on". If you want to respond to the argument I made there, you have to respond to the actual argument, not a broader one that's easier (and much less interesting) to take swipes at.

sjdjsin · 2025-10-11T17:16:48 1760203008

My understanding of your argument is:

Because agents are good on this one specific axis (which I agree with and use fwiw), there’s no reason to object to them as a whole

My argument is:

The juice isn’t worth the squeeze. The small win (among others) is not worth the amounts of slop devs now have to deal with.

tptacek · 2025-10-11T17:23:56 1760203436

Sounds like a very poorly managed team.

roughly · 2025-10-11T19:33:24 1760211204

In tech? Say it ain't so.

dingnuts · 2025-10-11T19:46:42 1760212002

in any organization???

JasonSage · 2025-10-11T17:38:15 1760204295

I have to agree. My experience working on a team with mixed levels of seniority and coding experience is that everybody got some increase in productivity and some increase in quality.

The ones who spend more time developing their agentic coding as a skillset have gotten much better results.

In our team people are also more willing to respond to feedback because nitpicks and requests to restructure/rearchitect are evaluated on merit instead of how time-consuming or boring they would have been to take on.

bsder · 2025-10-12T00:32:03 1760229123

> My experience working on a team with mixed levels of seniority and coding experience is that everybody got some increase in productivity and some increase in quality.

Is that true? There have been a couple of papers that show that people have the perception that they are more productive because the AI feels like motion (you're "stuck" less often) when in reality it has been a net negative.

wilg · 2025-10-11T17:17:59 1760203079

Not sure what to tell you, if there's a problem you have to speak up.

fn-mote · 2025-10-11T17:36:01 1760204161

And the longer you wait, the worse it will be.

Also, update your resume and get some applications out so you’re not just a victim.

matwood · 2025-10-12T07:56:22 1760255782

What if your coworker was pushing tons of crap code and AI didn't exist? How would you deal with the situation then? Do that.

chotmat · 2025-10-12T09:13:36 1760260416

It's not the same because, with AI, they will likely be called anti-ai or anti-progress if they push back against it.

matwood · 2025-10-12T11:27:42 1760268462

Don't mention AI, just point out why the code is bad. I've had co-workers who were vim wizards and others who literally hunt and pecked to type. At no point did their tools ever come up when reviewing their code. AI is a tool like anything else, treat it that way. This also means that the OPs default can't be AI == bad; focus on the result.

j45 · 2025-10-11T17:24:39 1760203479

Maybe it's possible to use AI to help review the PRs and claim it's the AI making the PR's hyperproductive?

XenophileJKO · 2025-10-11T17:34:18 1760204058

Yes, this. If you can describe why it is slop, an AI can probably identify the underlying issues automatically.

Done right you should get mostly reasonable code out of the "execution focused peer".

austinjp · 2025-10-11T17:58:07 1760205487

In climate terms, or even simply in terms of $cost, this very much feels like throwing failing on a bonfire.

Should we really advocate for using AI to both create and then destroy huge amounts of data that will never be used?

XenophileJKO · 2025-10-11T19:23:36 1760210616

I don't think it is a long term solution. More like training wheels. Ideally the engineers learn to use AI to produce better code the first time. You just have a quality gate.

Edit: Do I advocate for this? 1000%. This isn't crypto burning electricity to make a ledger. This objectively will make the life of the craftsmanship focused engineer easier. Sloppy execution oriented engineers are not a new phenomenon, just magnified with the fire hose that an agentic AI can be.

j45 · 2025-10-12T01:27:58 1760232478

Who said anything about advocating for it.

What can keep up with the scale of it?

We know that AI is more capable by what's input into it for the prompt side so chances are code review might be a little more sensible.

Maybe this comment/idea will be a breakthrough in improving AI coding. :p

wahnfrieden · 2025-10-11T18:18:10 1760206690

The environmental cost of AI is mostly in training afaik. The inference energy cost is similar to the google searches and reddit etc loads you might do during handwritten dev last I checked. This might be completely wrong though

pastel8739 · 2025-10-11T19:00:29 1760209229

I hear this argument a lot, but it doesn’t hold water for me. Obviously the use of the AI is the thing that makes it worthwhile to do the training, so you obviously need to amortize the training cost over the inference. I don’t know whether or not doing so makes the environmental cost substantially higher, though.

yggdrasil_ai · 2025-10-12T00:51:38 1760230298

> If you can describe why it is slop, an AI can probably identify the underlying issues automatically

I would argue against this. Most of the time the things we find in review are due to extra considerations, often business, architectural etc, things which the AI doesn't have context of and it is quite bothersome to provide this context.

j45 · 2025-10-12T01:29:59 1760232599

I generally agree that vague 1 shot prompting might vary.

I also feel all of those things can be explained over time into a compendium that is input. For example, every time it is right, or wrong, comment and add it to an .md file. Better yet, have the CLI Ai tool append it.

We know what is included as part of a prompt (like the above) is more accurately paid attention to.

My intent isn't to make more work, it's just to make it easier to highlight the issues with code that's mindlessly generated, or is overly convoluted when a simple approach will do.

sjdjsin · 2025-10-10T15:01:04 1760108464

> i.e. ones that can run offline and fake apis/databases

I can see a place for this, but these are no longer e2e tests. I guess that’s what “hermetic” means? If so it’s almost sinister to still call these e2e tests. They’re just frontend tests.

> A) refactor pretty much everything underneath them without breaking the test

This should always be true of any type of tests unless it’s behavior you want to keep from breaking.

> B) test realistically (an underrated quality)

Removing major integration points from a test is anything but realistic. You can do this, but don’t pretend you’re getting the same quality as a colloquial e2e tests.

> C) write tests which more closely match requirements rather than implementation

If you’re ever testing implementation you’re doing it wrong. Tests should let you know when a requirement of your app breaks. This is why unit tests are often kinda harmful. They test contracts that might not exist.

westurner · 2025-10-10T21:21:19 1760131279

> try to isolate the bugs to smaller units of code (or interactions between small pieces of code).

This is why unit tests before e2e tests.

It's higher risk to build on components without unit tests test coverage, even if the paltry smoke/e2e tests say it's fine per the customer's input examples.

Is it better to fuzz low-level components or high-level user-facing interfaces first?

IIUC in relation to Formal Methods, tests and test coverage are not sufficient but are advisable.

Competency Story: The customer and product owner can write BDD tests in order to validate the app against the requirements

Prompt: Write playwright tests for #token_reference, that run a named factored-out login sequence, and then test as human user would that: when you click on Home that it navigates to / (given browser MCP and recently the Gemini 2.5 Computer Operator model)