More

fhd2 · 2026-02-19T18:58:32 1771527512

Users typically don't read code, developers (of the software) do.

If it's not worth reading something where the writer didn't take the time to write it, by extension that means nobody read the code.

Which means nobody understands it, beyond the external behaviour they've tested.

I'd have some issues with using such software, at least where reliability matters. Blackbox testing only gets you so far.

But I guess as opposed to other types of writing, developers _do_ read generated code. At least as soon as something goes wrong.

tptacek · 2026-02-19T19:01:56 1771527716

Developers do not in fact tend to read all the software they use. I have never once looked at the code for jq, nor would I ever want to (the worst thing I could learn about that contraption is that the code is beautiful, and then live out the rest of my days conflicted about my feelings about it). This "developers read code" thing is just special pleading.

hexaga · 2026-02-19T19:23:13 1771528993

You're a user of jq in the sense of the comment you're replying to, not a developer. The developer is the developer _of jq_, not developers in general.

fhd2 · 2026-02-19T19:30:17 1771529417

Yes, that's exactly how I meant it. I might _rarely_ peruse some code if I'm really curious about it, but by and large I just trust the developers of the software I use and don't really care how it works. I care about what it does.

agentultra · 2026-02-20T00:53:55 1771548835

As a developer of software I often have to care because it matters and so I read the code.

Source code is often written for other humans first and foremost.

abustamam · 2026-02-20T02:15:44 1771553744

I've had to dig into node modules to try to debug code from a closed source library that we depended on.

I'd much rather wade through AI slop than minified code, which may have previously been AI slop.

skydhash · 2026-02-20T03:44:17 1771559057

Minified code is not for humans, it may as well been bytecode.

tptacek · 2026-02-19T19:23:39 1771529019

We're talking about Show HN here.

orwin · 2026-02-19T21:18:24 1771535904

But you read your coworkers PRs. I decided this week I wouldn't read/correct the AIgen doc and unit tests from 3 of my coworkers today, because else I would never be able to work. They produce twice as much poor output in 10 time the number of line change, that's too much.

tptacek · 2026-02-19T21:20:03 1771536003

Right, I'm not arguing developers don't read their own code or their teammates code or anything that merges to main in a repo they're responsible for. Just that the "it's only worth reading if someone took the time to actually write it" objection doesn't meaningfully apply to code in Show HN's --- there's no expectation that code gets read at all. That's why moderation is so at pains to ensure there's some way people can play with whatever it is being shown ("sign up pages can't be Show HN's").

ozim · 2026-02-19T21:11:22 1771535482

Key part is *where reliability matters*, there are not that many cases where it matters.

We tell stories of Therac 25 but 90% of software out there doesn’t kill people. Annoys people and wastes time yes, but reliability doesn’t matter as much.

E-mail, internet and networking, operations on floating point numbers are only kind of somewhat reliable. No one is saying they will not use email because it might not be delivered.

abustamam · 2026-02-20T02:20:09 1771554009

10% is still quite a lot!

Reliability matters in lots of areas that aren't war. Ignoring obvious ones like medicine/healthcare and driving, I want my banking app to be reliable. If they charge me $100 instead of $1 because their LLM didn't realize their currency was stored in floating point dollars and not cents, then I may not die but I'd be pretty upset!

cobbal · 2026-02-19T21:46:30 1771537590

We guarantee 5 nines of uptime, and 1 nine of not killing people

iugtmkbdfil834 · 2026-02-19T21:27:40 1771536460

<< 90% of software out there doesn’t kill people.

As we give more and more autonomy to agents, that % may change. Just yesterday I was looking at hexapods and the first thing it tells you ( with a disclaimer its for competitions only ) that it has a lot of space for weapon install. I had to briefly look at the website to make sure I did not accidentally click on some satirical link.

ozim · 2026-02-19T21:55:45 1771538145

Main point is that there is many more lines of code of CRUD business apps running on AWS and instances of applications than even non-autonomous car software even though we do have lots of cars.

wussboy · 2026-02-19T21:21:10 1771536070

Most code will not kill people, but a lot of code could kill a business.

fhd2 · 2026-02-19T07:44:06 1771487046

Because even if an organisation hasn't rolled out generative AI tools and policies centrally yet, individuals might just use their personal plans anyway (potentially in violation with their contract)? I believe that's called "shadow AI".

shakna · 2026-02-19T19:16:25 1771528585

Its a $400+k report to a government, where the references either weren't audited, or only audited by the AI system that regurgitated them.

That requires more than a single person's involvement.

bojan · 2026-02-19T07:57:25 1771487845

Correct. Where I work we are only "allowed" to use AI since December 2025.

But obviously people were copy/pasting content to ChatGPT and Claude long before that.

fhd2 · 2026-02-12T18:02:06 1770919326

It wouldn't keep them from equipping _new_ models with additional sensors, spinning a story around how this helps them train the camera-only AI, or whatever.

Dumblydorr · 2026-02-13T13:56:55 1770991015

It’s vaporware and it’s dollars and cents. Tesla EVs are already too expensive. He has no margin to include thousands more on sensors, alternative being the lawsuits that would follow if he admits it was all vaporware.

fhd2 · 2026-02-11T20:24:38 1770841478

Fairly cynical indeed. Though I must admit that Anthropic's software - not the models, the software they build - seems to be generally plagued by quality issues. Even the dashboard is _somehow_ broken most of the time, at least whenever I try to do something.

fhd2 · 2026-02-11T10:13:24 1770804804

> Anyway, it's not related to CoPilot, but because Notepad makes links clickable now...

True, not related to CoPilot, but if I understand your conclusion right (which I'm not sure about), it's not _just_ that links are clickable now, it's because Notepad actually does something with the links. Otherwise it'd be a browser vulnerability, and Notepad couldn't seriously be blamed.

LiamPowell · 2026-02-11T10:25:46 1770805546

It's in fact the opposite. Browsers show a popup that asks if you really intended to click a link with a non http/https handler, notepad does not.

The actual RCE here would be in some other application that registers a URL handler. Java used to ship one that was literally designed to run arbitrary code.

fhd2 · 2026-02-11T11:42:55 1770810175

Ah, got it. Very different from where I suspected the issue then.

fhd2 · 2026-02-11T09:06:24 1770800784

I think most of us - if not _all_ of us - don't know how to use these things well yet. And that's OK. It's an entirely new paradigm. We've honed our skills and intuition based on humans building software. Humans make mistakes, sure, but humans have a degree and style of learning and failure patterns we are very familiar with. Humans understand the systems they build to a high degree, this knowledge helps them predict outcomes, and even helps them achieve the goals of their organisation _outside_ writing software.

I kinda keep saying this, but in my experience:

1. You trade the time you'd take to understand the system for time spent testing it.

2. You trade the time you'd take to think about simplifying the system (so you have less code to type) into execution (so you build more in less time).

I really don't know if these are _good_ tradeoffs yet, but it's what I observe. I think it'll take a few years until we truly understand the net effects. The feedback cycles for decisions in software development and business can be really long, several years.

I think the net effects will be positive, not negative. I also think they won't be 10x. But that's just me believing stuff, and it is relatively pointless to argue about beliefs.

fhd2 · 2026-02-10T10:20:40 1770718840

What kills me personally is that I'm constantly 80% there, but the remaining 20% can be just insurmountable. It's really like gambling: Just one more round and it'll be useful, OK, not quite, just one more, for hours.

CSSer · 2026-02-10T11:13:48 1770722028

Do you mean in terms of adding one more feature or in terms of how a feature you're adding almost works but not quite right?

I find the latter a lot more challenging to cut my losses when it's on a good run (and often even when I know I could just write this by hand), especially because there's as much if not more intrigue about whether the tool can accomplish it or not. These are the moments where my mind has drifted to think about it the exact way you describe it here.

fhd2 · 2026-02-10T19:26:44 1770751604

Yeah the latter. Doesn't happen much with super obvious stuff, but especially when I'm a bit fuzzy on requirements, it can be just endless iterations.

gedy · 2026-02-10T10:43:00 1770720180

And let's get real: AI companies will not be satisfied with you paying $20 or even $200 month if you can actually develop your product in a few days with their agents. They are either going to charge a lot more or string you along chasing that 20%.

bilekas · 2026-02-10T11:40:50 1770723650

That's an interesting business model actually : "Oh hey there, I see you're almost finished your project and ready to launch, watch theses adverts and participate in this survey to get the last 10% of your app completed"

jmkni · 2026-02-10T12:58:16 1770728296

Gambling is a great analogy, my lucks going to turn around, just one more prompt

co_king_3 · 2026-02-10T11:13:53 1770722033

If you think it's getting 80% right on its own, you're a victim of Anthropic and OpenAI's propaganda.

Melonai · 2026-02-10T12:03:55 1770725035

No I kind of see this too, but the 80% is very much the more simple stuff. AI genuinely saves me some time, but I always notice that if I try to "finish" a relatively complex task that's a bit unique in some regards, when a bit more complex work is necessary, something slightly domain-related maybe, I start prompting and prompting and banging my head against the terminal window to make it try to understand the issue, but somehow it still doesn't turn out well at all, and I end up throwing out most of the work done from that point on.

Sometimes it looks like some of that comes from AI generally being very very sure of its initial idea "The issue is actually very simple, it's because..." and then it starts running around in circles once it tries and fails, you can pull it out with a bit more prompting, but it's tough. The thing is, it is sometimes actually right, from the very beginning, but if it isn't...

This is just my own perspective after working with these agents for some time, I've definitely heard of people having different experiences.

fhd2 · 2026-02-10T10:08:03 1770718083

I feel agentic development is a time sink.

Previously, I'd have an idea, sit on it for a while. In most cases, conclude it's not a good idea worth investing in. If I decided to invest, I'd think of a proper strategy to approach it.

With agentic development, I have an idea, waste a few hours chasing it, then switch to other work, often abandoning the thing entirely.

I still need to figure out how to deal with that, for now I just time box these sessions.

But I feel I'm trading thinking time for execution time, and understanding time for testing time. I'm not yet convinced I like those tradeoffs.

Edit: Just a clarification: I currently work in two modes, depending on the project. In some, I use agentic development. In most, I still do it "old school". That's what makes the side effects I'm noticing so surprising. Agentic development pulls me down rabbit holes and makes me loose the plot and focus. Traditional development doesn't, its side effects apparently keep me focused and in control.

energy123 · 2026-02-10T12:24:09 1770726249

That's weird, I'm the opposite. Previously I would start coding immediately, because writing the code helps me figure out the what and the how, and because I'd end up with modular/reusable bits that will be helpful later anyway.

Now I sit on an idea for a long time, writing documentation/specs/requirements because I know that the code generation side of things is automated and effortlessly follows from exhaustive requirements.

sisnxb · 2026-02-10T15:03:29 1770735809

I used to do this, but found scoping my agent usage down to smaller chunks got better results than trying to do it all from the get go. And looking back it makes sense - code is the most expressive form we have to tell the computer what to do, not English.

The size of the chunk varies heavily on what I’m doing ofc.

energy123 · 2026-02-11T03:24:03 1770780243

How to do that for a complex greenfield project? Do you scope out just one module and let the agent code only that, with the intention of reusing that module later in the full project?

jcims · 2026-02-10T11:55:15 1770724515

>With agentic development, I have an idea, waste a few hours chasing it, then switch to other work, often abandoning the thing entirely.

How much of this is because you don't trust the result?

I've found this same pattern in myself, and I think the lack of faith that the output is worth asking others to believe in is why it's a throwaway for me. Just yesterday someone mentioned a project underway in a meeting that I had ostensibly solved six months ago, but I didn't even demo it because I didn't have any real confidence in it.

I do find that's changing for myself. I actually did demo something last week that I 'orchestrated into existence' with these tools. In part because the goal of the demo was to share a vision of a target state rather than the product itself. But also because I'm much more confident in the output. In part because the tools are better, but also because I've started to take a more active role in understanding how it works.

Even if the LLMs come to a standstill in their ability to generate code, I think the practice of software development with them will continue to mature to a point where many (including myself) will start to have more confidence in the products.

rvz · 2026-02-10T10:34:40 1770719680

If you do not know what you want to build, how to ask the AI what you want and are unable to tell what the correct requirements are; then it becomes a waste of time and money.

More importantly, As the problem becomes more complex, it then matters more if you know where the AI falls short.

Case study: Security researchers were having a great time finding vulnerabilities and security holes in Openclaw.

The Openclaw creators had a very limited background in security even when the AI entirely built Openclaw and the authors had to collaborate with the security experts to secure the whole project.

yason · 2026-02-10T10:40:42 1770720042

> If you do not know what you want to build

That describes the majority of cases actually worth working on as a programmer in the traditional sense of the word. You build something to begin to discover the correct requirements and to picture the real problem domain in question.

embedding-shape · 2026-02-10T10:52:42 1770720762

> You build something to begin to discover the correct requirements and to picture the real problem domain in question.

That's one way, another way is to keep the idea in your head (both actively and "in the background) for days/weeks, and then eventually you sit down and write a document, and you'll get 99% of the requirements down perfectly. Then implementation can start.

Personally I prefer this hammock-style development and to me it seems better at building software that makes sense and solves real problems. Meanwhile "build something to discover" usually is best when you're working with people who need to be able to see something to believe there is progress, but the results are often worse and less well-thought out.

rvz · 2026-02-10T11:53:35 1770724415

This.

It's better to have a solid concrete idea written down of the entire system that you know you want to build which has ironed out the limitations, requirements and the constraints first before jumping into the code implementation or getting the agent to write it for you.

The build-something-to-discover approach is not for building robust solutions in the long run. By starting with the code first without knowing what it is you are solving or just getting the AI to generate something half-working but breaks easily and changing it once again for it to become even more complicated just wastes more time and tokens.

Someone still has to read the code and understand why the project was built on a horrible foundation and needs to know how to untangle the AI vibe-coded mess.

lelanthran · 2026-02-10T12:46:48 1770727608

> You build something to begin to discover the correct requirements and to picture the real problem domain in question.

You lose that if the agent builds it for you, though; there is no iteration cycle for you, only for the agent. This means you are missing out on a bunch of learning that you would previously had gotten from actually writing something.

Prior to agents, more than once a week I'd be writing some code and use some new trick/technique/similar. I expect if you feel that there is no programming skills and tricks left for you to learn, then sure, you aren't missing out on anything.

OTOH, I've been doing this a long time, and I still learn new things (for implementation, not design) on each new non-trivial project.

lelanthran · 2026-02-10T12:41:31 1770727291

> With agentic development, I have an idea, waste a few hours chasing it, then switch to other work, often abandoning the thing entirely.

My experience with LLMs is that they will call any idea a good idea, one feasible enough to pursue!

Their training to be a people-pleaser overrides almost everything else.

croisillon · 2026-02-10T12:45:50 1770727550

it might depend how you word it, i specifically asked about a caldav and a firefox sync solution, explaining how much difficulty-adverse i was, and i have been berated both times

darkwater · 2026-02-10T11:34:50 1770723290

> Previously, I'd have an idea, sit on it for a while.

> With agentic development, I have an idea, waste a few hours chasing it,

What's the difference between these 2 periods? Weren't you wasting time when sitting on it and thinking about your idea?

latexr · 2026-02-10T11:53:14 1770724394

Sitting on an idea doesn’t have to mean literally sitting and staring at the ceiling, thinking about it. It means you have an idea and let it stew for a while, your mind coming back to it on its own while you’re taking a shower, doing the dishes, going for a walk… The idea which never comes back is the one you abandon and would’ve been a waste of time to pursue. The idea which continues to be interesting and popping into your head is the worthwhile one.

When you jump straight into execution because it’s easy to do so, you lose the distinction.

mfru · 2026-02-10T12:14:15 1770725655

They for sure weren't vaporating hundreds of litres of water and wasting a bunch of electricity while doing it

shakna · 2026-02-10T11:47:22 1770724042

Sitting on an idea doesn't necessarily mean being inactive. You can think at the same time as doing something else. "Shower thoughts" are often born of that process.

darkwater · 2026-02-10T12:59:41 1770728381

I know, and letting an agent/LLM "think" about some ideas does not waste your time either. Yes, it "wastes" energy and you need to read and think about the results after, we don't have neural interfaces to computer, so the inner thinking feedback loop will always be faster. But I keep thinking GP comment was unfair: you can just have your idea in the background to check whether it is good or not exactly the same, and after that time "discuss" it with an LLM, or ask it to implement the idea because you think it's solid enough. It's a false dichotomy.

fhd2 · 2026-02-10T20:02:58 1770753778

It's not a complaint about LLMs, it's a complaint about my own behaviour. Maybe I didn't make that clear enough.

Maybe like one of the siblings said, I just shouldn't jump right into execution even if that barrier has been lowered. But it sure is tempting for me.

darkwater · 2026-02-11T08:22:58 1770798178

I understand you, and I felt the same for a few days: the dopamine rush was hitting hard. You just need to control it (with a very big "just"), like any other dopamine rush.

yieldcrv · 2026-02-10T10:59:19 1770721159

with agentic development, I've finally considered doing open source work for no reason aside from a utility existing

before, I would narrow things down to only the most potentially economically viable, and laugh at ideas guys that were married to the one single idea in their life as if it was their only chance, seemingly not realizing they were competing with people that get multiple ideas a day

back to the aforementioned epiphany, it reminds me of the world of Star Trek where everything was developed for its curiosity and utility instead of money

fhd2 · 2026-02-09T19:26:33 1770665193

> Ads do not influence the answers ChatGPT gives you.

I wonder if this is a don't-break-product-value thing, or just compliance (ads need to be clearly labeled, but OpenAI seems like it has the risk appetite to ignore that kind of thing).

imron · 2026-02-09T19:36:44 1770665804

It’s a boil the frog thing.

wtfHN26 · 2026-02-09T20:03:13 1770667393

OpenAI has to do this if it wants to get big advertisers.

Ads need to be clearly marked as per FTC.

> According to guidelines from the Federal Trade Commission (FTC) in the U.S. and similar regulatory bodies worldwide, online advertisements—including sponsored content, native advertising, and influencer posts—must be readily identifiable as paid content to prevent deceiving consumers.

simianwords · 2026-02-09T19:27:45 1770665265

its a trust thing because the market they operate in is tight - no reason to quickly move to the next option.

i personally would never touch chatgpt if i knew the answers were biased for certain companies.

fhd2 · 2026-02-08T15:02:08 1770562928

(1) is not something the typical employee can do, in my experience. They're expected to work eight hours a day. Though I suppose the breaks could be replaced with low effort / brain power work to implement a version of that.

ryukoposting · 2026-02-08T15:37:42 1770565062

Work for a smaller company with more reasonable expectations of a knowledge worker.

You're an engineer, not a manager, or a chef, or anything else. Nothing you do needs to be done Monday-Friday between the hours of 8 and 5 (except for meetings). Sometimes it's better if you don't do that, actually. If your work doesn't understand that, they suck and you should leave.

antirez · 2026-02-08T15:02:48 1770562968

Yep, slow QA, things that also make the real difference in quality.