More

jaytaylor · 2026-02-08T06:23:41 1770531821

(DTU creator here)

I did have an initial key insight which led to a repeatable strategy to ensure a high level of fidelity between DTU vs. the official canonical SaaS services:

Use the top popular publicly available reference SDK client libraries as compatibility targets, with the goal always being 100% compatibility.

You've also zeroed in on how challenging this was: I started this back in August 2025 (as one of many projects, at any time we're each juggling 3-8 projects) with only Sonnet 3.5. Much of the work was still very unglamorous, but feasible. Especially Slack, in some ways Slack was more challenging to get right than all of G-Suite (!).

Now I'm part way through reimplementing the entire DTU in Rust (v1 was in Go) and with gpt-5.2 for planning and gpt-5.3-codex for execution it's significantly less human effort.

IMO the most novel part to this story is Navan's Attractor and corresponding NLSpec. Feed in a good Definition-of-Done and it'll bounce around between nodes until it gets it right. There are already several working implementations in less than 24 hours since it was released, one of which is even open source [0].

[0] https://github.com/danshapiro/kilroy

ukuina · 2026-02-08T07:46:51 1770536811

Been toying around with DTs myself for a few months. Until December, LLMs couldn't correctly hold large amounts of modeled behavior internally.

Why the switch from Go to Rust?

jaytaylor · 2026-02-08T19:48:15 1770580095

I'm testing a theory that large-scale (LoC) generated projects in Rust tend to have fewer functional bugs compared to e.g. Go or Java because Rust as a language is a little stricter.

I've not yet formed a full opinion or conclusion, but in general I'm starting to prefer Rust.

Re: generalizing mocks, it sounds interesting but after getting full-fidelity clones of so many multi-billion dollar SaaS offerings, I really like it and am hooked. It pays nice dividends for developing using agentic coders at high scale. In a few more model releases having your own exhaustive DTU could become trivial.

knuckleheads · 2026-02-08T07:45:33 1770536733

Are the digital twins open source anywhere, or available as a service somehow? They sound useful to use!

2026-02-08T07:53:12 1770537192

[dead]

sincerely · 2026-02-08T09:49:27 1770544167

Am I growing too paranoid, or are you using AI to generate the comments posted on this account?

rob · 2026-02-08T10:16:08 1770545768

It's 100% another bot account:

https://news.ycombinator.com/threads?id=Zakodiac

This one's a bit clever in that it actually comments back.

I feel like I've been pointing them out too much lately so I wanted to wait until somebody else did first.

They all seem to take advantage of accounts that are a few years old with zero posts and then suddenly make a bunch of AI-generated comments on a single day, like this one did (account from 2023, no posts until today.)

The last bot I pointed out that did the same thing ended up having its "owner" make a post about it that didn't get any attention:

https://news.ycombinator.com/item?id=46901199

ealexhudson · 2026-02-08T11:15:01 1770549301

What would be great, and I don't know if @dang / the mods would take on requests like this, would be for bot participants to be allowed but the account flagged. So e.g. the user name just says "[bot] Zakodiac" or something.

As well as being an ethical approach - I think it's wrong to try to impersonate humans and/or not announce AI output as AI - it would also be handy for new filter options: all bot posts are OK, hide bot leaf comments, or hide all threads with bot comments. etc.

[edited as my robot unicode/emoji char didn't come through]

hmcamp · 2026-02-08T12:02:22 1770552142

How can you tell?

hmcamp · 2026-02-08T12:03:21 1770552201

What are the signals or tells?

simonw · 2026-02-08T12:57:42 1770555462

Comments like "X is the right track [...] Then finish with a question?" do have a bit of an LLM smell to them.

The finishing with a question thing is prevalent with both accounts on Twitter, presumably because it "drives engagement" with the accounts.

It's particularly frustrating because it amplifies how much time is wasted - people don't just waste time reading comments by bots, they then invest effort in thinking about and replying to them.

jaytaylor · 2026-02-08T08:06:30 1770537990

> The Go to Rust rewrite is interesting - was that driven by performance or more about the ecosystem/tooling for this kind of work?

I'm testing a theory that large-scale (LoC) generated projects in Rust tend to have fewer functional bugs compared to e.g. Go or Java because Rust as a language is a little stricter.

I've not yet formed a full opinion or conclusion, but in general I'm starting to prefer Rust.

Re: generalizing mocks, it sounds interesting but after getting full-fidelity clones of so many multi-billion dollar SaaS offerings, I really like it and am hooked. It pays nice dividends for developing using agentic coders at high scale. In a few more model releases having your own exhaustive DTU could become trivial.

2026-02-08T08:22:12 1770538932

[dead]

throwup238 · 2026-02-08T12:48:39 1770554919

> The tradeoff is LLMs still struggle to produce good idiomatic Rust consistently so it takes more iteration cycles to get there (good agent tooling helps, linting/checks/etc.) The compile times on those iterations can be brutal sometimes depending on the project size which adds up for sure. The crafty agents can still find ways to satisfy the compiler without actually solving the problem correctly too, so the cheating risk of course doesn't fully go away.

I’ve gone ahead and completely banned ‘unwrap_or_default’ and a bunch of other helpful functions because LLMs just cannot be trusted to use them properly.

jaytaylor · 2026-02-08T02:37:41 1770518261

Can this work for Claude? I think it might be raw API only.

spondyl · 2026-02-08T04:27:10 1770524830

I'm not sure I understand the question? Are you perhaps asking if messages can be batched via Claude Code and/or the Claude web UI?

jaytaylor · 2026-02-08T04:37:05 1770525425

Yes, Claude code.

HumanOstrich · 2026-02-08T07:16:19 1770534979

jaytaylor · 2026-02-07T23:18:37 1770506317

(StrongDM AI team member here)

This is great feedback, appreciate you taking the time to post it. I will set some agents loose on optimization / purification passes over CXDB and see which of these gaps they are able to discover and address.

We only chose to open source this over the past few days so it hasn't received the full potential of technical optimization and correction. Human expertise can currently beat the models in general, though the gap seems to be shrinking with each new provider release.

nmilo · 2026-02-07T23:33:25 1770507205

Hey! That sounds an awful lot like code being reviewed by humans

jaytaylor · 2026-02-07T22:31:17 1770503477

I'm one of the StrongDM trio behind this tenet. The core claim is simple: it's easy to spend $1k/day on tokens, but hard (even with three people) to do it in a way that stays reliably productive.

jaytaylor · 2026-01-14T19:50:23 1768420223

https://jaytaylor.com (personal site)

All of my web properties have been ad-free since the beginning, going on 25 years. Cheers.

jaytaylor · 2025-11-24T21:12:28 1764018748

(Co-creator here) This is one of the use cases for Leash.

https://github.com/strongdm/leash

Check it out, feedback is welcome!

Previously posted description: https://news.ycombinator.com/item?id=45883210

jaytaylor · 2025-11-11T01:49:35 1762825775

This is a really neat project .

At my company (StrongDM) we recently open-sourced a tool in this space called Leash: https://github.com/strongdm/leash

By default it runs in docker, and also includes an extra sophisticated macOS-native --darwin mode which goes beyond the capabilities and guarantees of the likes of sandbox-exe, bubblewrap, and in some ways docker. Leash provides visibility into and control over every command and network request attempted by the coder agent. Would appreciate any feedback, and will try to get in touch with the author (Gordon).

Now I'll definitely look into automatically supporting pass-through auth for at least gh cli in Leash - always looking for what folks will find useful.

corv · 2025-11-11T02:34:47 1762828487

Interesting! The sandboxing space definitely deserves more attention.

On the other side of the spectrum, we're working on a lightweight approach that augments user namespaces with libseccomp to filter syscalls via BPF.

https://github.com/corv89/shannot

jaytaylor · 2025-11-11T05:47:15 1762840035

Leash does it via eBPF today. Are you open to a collab?

corv · 2025-11-11T06:04:37 1762841077

Absolutely. I’ll send you an email

jaytaylor · 2025-11-10T16:19:41 1762791581

Have you seen Leash?

https://github.com/strongdm/leash

It even has a --darwin macOS-native mode which goes beyond the capabilities and guarantees of sandbox-exec and bubblewrap.

Full-disclosure: I am one of the authors.

jaytaylor · 2025-10-09T23:33:41 1760052821

https://jaytaylor.github.io/hn-live2

Enjoy!

jaytaylor · on Aug 25, 2024

X-Mouse Button Control (XMBC for short) is a handy Windows app to override mouse buttons to do arbitrary other actions on a program/App -specific basis.

https://www.highrez.co.uk/downloads/XMouseButtonControl.htm

However the docs (which are excellent!) were published by the developer as PDF-only, so I did a nice transform to HTML and posted it on my personal website. I will do my best to host it until I'm dead, and also submitted it to TIA:

https://www.highrez.co.uk/downloads/X-Mouse%20Button%20Contr...

https://jaytaylor.com/x-mouse-button-docs/

Forever TIA link: https://web.archive.org/web/20240825203006/https://jaytaylor...

Cheers,

Jay