Hacker Newsnew | past | comments | ask | show | jobs | submit | MadnessASAP's commentslogin

The threat model doesn't really change for agents that already have "web fetch" (or equivalent) enabled. The agent is free to communicate with untrusted websites[1]. As before, the firewall remains at what private information the agent is allowed to have.

[1] If anything the threat gets somewhat reduced by the ability to point directly at a trusted domain and say "use this site and it's (presumably) trusted tools."


Fair point about web fetch already being a trust boundary. The difference I see is that web fetch returns data, but WebMCP tools can define actions. A tool called "add_to_cart" is a lot more dangerous than fetching a product page. The agent trusts the tool's name and description to decide whether to call it, and that metadata comes from the site.

But yeah, if you're already letting agents browse freely, the incremental risk might be smaller than I'm imagining.


It's genuinely amazing how good they are at reverse engineering.

I have a silly side project that ended up involving the decompilation of a toaster ovens firmware, the firmware of the programmer for said toaster ovens MCU, and the host side programming software. They were able to rip through them without a problem, didn't even have ghidra setup, they just made their own tools in python.


But now your asking the compiler to also be a daemon. The compiler devs to add and maintain a not insignificant feature. The compiler to keep everything it caches for queries (whoops, look at that RAM usage climb!), and to cache data suitable for answering LSP queries (gosh! It's climbing a lot!)

Why does it need to be a daemon? Why isn't the on-disk cache enough? Running a compiler without making changes is instant. Many operations of LSPs feel slower than compiler's incremental compilation.

What cache is needed that the compiler doesn't already have?


I personally mostly enjoyed using https://pcb.etaras.com/studio.html

The big feature for me was being able to have multiple photos of the PCB side by side with perspective corrections so locations were correlated across the board.


Anybody who has worked a service/retail job can tell you that the person literally shitting on the floor rarely is the one to clean it up.

And unfortunately that extends to the metaphor as well. Society would like to see those responsible for the mess to also be responsible for the cleanup. However society expects that everybody but the mess maker will be left cleaning up.


Yeah, tax payers will pay the refund, and the interest accrued on the refund -- when the makaes it's wats through the courts in 3 years

As I tell my friends

You dont see those "coincidental" ads because your phone is listening to you, you see them because your freind showed interest in the product and theirs enough information to infer they talked to you about it. The good news is, your phone isn't listening to you without your consent. The bad news is, because it doesnt need to.


It's pretty typical these days for software, particularly games of the DX9-11 eras to perform better on Wine/Proton then they do under native Windows on the same hardware.


They rarely are IO constrained.


I wish I could believe in more conspiracy theories. At least then I might believe there was some sort of master plan, that some individual or group had some image of a better world (to them) and that the world was being steered somewhere.

Unfortunately no, it just seems to be greed, incompetence, and incompetent greed. At least when a tank drives over a protestor somebody gets to be on the side of the tank. When the bus goes off a cliff because the driver sold the steering wheel everybody dies.


Absolutely. It’s not some grand replacement theory. It’s not an intellectual master plan. It’s mostly plain greed and cynicism from the powerful, plus ignorance or a resigned belief that people cannot be changed from everyone else.

I’m in the second group. When a majority of people miss the basics, when a large chunk treat internet content as daily reality rather than algorithmically served rage bait, it feels like there’s nothing you can do.

A friend once told me, “I wish I were more schizo like before, it was much more fun,” and in a bleak way, I get it. I’d almost prefer it if there really were a coherent plan, some deliberate attempt by the mighty to steer civilization. But right now it mostly looks like greed and cynicism. These days, a lot of it seems to be coming out of Silicon Valley but it will change as it always does like it did before.


The owner of 4chan met with an Epstein associate 3 days before reinstating /pol/ which lead to the destruction of America.

Epstein was trying to remove tax on banker bonuses in the UK for some reason.

There might not be a single master plan but holy hell is this stuff intertwined with everything that happens.


Schizos would be schizos anywhere else. Widely available access to information which are biased towards your own bias mostly did that. Most of the people don't understand technology in general nor the algoritmic content suggestion. That is what the real problem is.


>Schizos would be schizos anywhere else.

May I introduce you to https://en.wikipedia.org/wiki/Sluggish_schizophrenia

>Most of the people don't understand technology in general nor the algoritmic content suggestion. That is what the real problem is.

There isn't a common understanding of these mechanisms, because the first thing they were used for, was to brand as "defective" anyone pursuing such understanding on their own terms.

Of course you could always do it by the book i.e. go in blind and debt-enslave yourself until loss of capacity for disentanglement. A small number of such functionaries are indeed required to maintain a colony; and then some surplus ones to keep the first one in their place.

Is that a "conspiracy"? In the sense that you're stuck breathing in sync with a lot of strangers, sure. In the sense of secret master plan? Nah bruh, it's all been out in the open all along. Just mindkillingly terrifying to most of yall. Hence all the phatics.


>I wish I could believe in more conspiracy theories.

Username checks out... well, I can help ya.

You start out easy, like "who invented all those damn conspiracy theories and introduced them into the public culture, anyway?"


My experience is that Codex follows directions better but Claude writes better code.

ChatGPT-5.2-Codex follows directions to ensure a task [bead](https://github.com/steveyegge/beads) is opened before starting a task and to keep it updated almost to a fault. Claude-Opus-4.5 with the exact same directions, forgets about it within a round or two. Similarly, I had a project that required very specific behaviour from a couple functions, it was documented in a few places including comments at the top and bottom of the function. Codex was very careful in ensuring the function worked as was documented. Claude decided it was easier to do the exact opposite, rewrote the function, the comments, and the documentation to saynit now did the opposite of what was previously there.

If I believed a LLM could be spiteful, I would've believed it on that second one. I certainly felt some after I realised what it had done. The comment literally said:

  // Invariant regardless of the value of X, this function cannot return Y
And it turned it into:

  // Returns Y if X is true


That's so strange. I found GPT to be abysmal at following instructions to the point of unusability for any direction-heavy role. I have a common workflow that involves an orchestrator that pretty much does nothing but follow some simple directions [1]. GPT flat-out cannot do this most basic task.

[1]: https://github.com/Vibecodelicious/llm-conductor/blob/main/O...


Strange behaviour and LLMs are the iconic duo of the decade. They've definitley multiplied my productivity, since now instead of putting off writing boring code or getting stuck on details till I get frustrated and give up I just give it to an agent to figure out.

I don't thing my ability read, understand, and write code is going anywhere though.

Neat tool BTW, I'm in the market for something like that.


I've found this orchestrator+reviewer+judge setup to yield much better results than anything else I've tried. And it's such a simple setup - a few markdown files.

I'm also creating one that is similar, but purpose-built for making the plans that this setup can orchestrate. It still needs some tweaking to get agents to follow it better - it still takes additional prompting to nudge it down the proper path. But I've had similar benefits - sending plans through this adversarial review loop has yielded significant improvements in final output.

https://github.com/Vibecodelicious/llm-conductor/blob/main/p...


Unrelated but this just happened and I thought of you ;-)

  › commit this, **SKIP BEADS**


  • Ran git status -sb
    └ ## main...origin/main [ahead 4]
       M containers/frigate.nix
      ?? .beads/bd.sock.startlock

  • I see an untracked .beads/bd.sock.startlock file that might be normal but needs clarification. I also note the requirement to include a bead and confirm bead readiness
    before proceeding, so I'll ask whether to create the bead and if the untracked file should be included in the commit.

  ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────

  • I can’t skip beads for this repo. The AGENTS.md rules are explicit: no work (including commits) without an open bead. Please confirm you want me to create a bead for
    this commit.
I don't know what's wrong with your Codex, but mine can't bring itself to break the rules.


From my experience, OpenAI Codex loves reverse engineering work. In one case it did a very thorough job of disassembling a 8051 MCUs firmware and how it spoke to its attached LCD controller.

Another (semi-related) project, given the manufacturers of above MCUs proprietary flashing SDK, it found the programmers firmware, extracted the decryption key from the updating utility, decrypted the firmware and accompanying flashing software and is currently tracing the necessary signals to use an Arduino as a programmer.

So not only is it willing, it's actually quite good at it. My thinking is that reverse engineering is a lot of pattern recognition and not a lot of "original thinking". I.e. the agent doesn't need to come up with anything new, just recognise what already exists.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: