More

kvdveer · 2026-03-04T07:56:23 1772610983

Two things are holding back current LLM-style AI of being of value here:

* Latency. LLM responses are measured in order of 1000s of milliseconds, where this project targets 10s of milliseconds, that's off by almost two orders of magnitute.

* Determinism. LLMs are inherently non-deterministic. Even with temperature=0, slight variations of the input lead to major changes in output. You really don't want your DB to be non-deterministic, ever.

qeternity · 2026-03-04T09:49:19 1772617759

> LLMs are inherently non-deterministic.

This isn't true, and certainly not inherently so.

Changes to input leading to changes in output does not violate determinism.

magicalhippo · 2026-03-04T11:35:34 1772624134

> This isn't true

From what I understand, in practice it often is true[1]:

Matrix multiplication should be “independent” along every element in the batch — neither the other elements in the batch nor how large the batch is should affect the computation results of a specific element in the batch. However, as we can observe empirically, this isn’t true.

In other words, the primary reason nearly all LLM inference endpoints are nondeterministic is that the load (and thus batch-size) nondeterministically varies! This nondeterminism is not unique to GPUs — LLM inference endpoints served from CPUs or TPUs will also have this source of nondeterminism.

[1]: https://thinkingmachines.ai/blog/defeating-nondeterminism-in...

qeternity · 2026-03-04T18:15:46 1772648146

Yes, lots of things can create indeterminism. But nothing is inherent.

yomismoaqui · 2026-03-04T11:37:58 1772624278

Quoting:

"But why aren’t LLM inference engines deterministic? One common hypothesis is that some combination of floating-point non-associativity and concurrent execution leads to nondeterminism based on which concurrent core finishes first."

From https://thinkingmachines.ai/blog/defeating-nondeterminism-in...

qeternity · 2026-03-04T18:15:52 1772648152

Yes, lots of things can create indeterminism. But nothing is inherent.

simonask · 2026-03-04T08:00:42 1772611242

> 1000s of milliseconds

Better known as "seconds"...

olau · 2026-03-04T08:09:28 1772611768

The suggestion was not to use an LLM to compile the expression, but to use an LLM to build the compiler.

kvdveer · 2026-02-28T18:55:36 1772304936

How do you propose we measure signal? Lines of code is renowned for being a very bad measure of anything, and I really can't come up with anything better.

politician · 2026-02-28T20:23:29 1772310209

The OP said that they kept what they liked and discarded the rest. I think that's a reasonable definition for signal; so, the signal-to-token ratio would be a simple ratio of (tokens committed)/(tokens purchased). You could argue that any tokens spent exploring options or refining things could be signal and I would agree, but that's harder to measure after the fact. We could give them a flat 10x multiplier to capture this part if you want.

mirekrusin · 2026-02-28T21:56:36 1772315796

I'm going to call it out as bullshit, you can't dig out "what you like" from "hundreds agents running all the time".

_pdp_ · 2026-02-28T22:33:16 1772317996

One of our projects has 1.2K open pull requests.

https://i.postimg.cc/Jnfk9b8g/Xnapper-2026-02-28-22-25-42.pn...

We probably accept 1-2 per day.

I personally discard code for the tiniest of reasons. If something feels off moments after I open the PR, it gets deleted. The reason we still have 1.2K open PRs is because we can't review all of them in time.

The most likely solution is to delete all of them after a month or two. By that time the open PRs on this project alone will be at least 10-20 more.

mirekrusin · 2026-03-01T04:42:16 1772340136

Doesn't seem like too efficient process, no? Seems to me like investment in better quality of the output is exactly what is needed here, wouldn't you agree?

larusso · 2026-03-01T07:04:46 1772348686

I feel they sit of on the opposite end of the OP here. One wants to write out specs to control the agent implementation to achieve a one shot execution. Other side says: let’s won’t waste time of humans writing anything.

I’m personally torn. A lot of the spec talk and now here in combination with TDD etc feels like the pipe dreams of the mid 2000. There was this idea of the Architect role who writes UML and specs. And a normal engineer just fills in the gaps. Then there was TDD. Nothing against it personally. But trying to write code in test first approach when you don’t really have a clue how a specific platform/system/library works had tons of overhead. Also the side effect of code written in the most convenient way to be tested and not to be executed. All in all to throw this ideas together for AI now… But throwing tokens out of the window and hoping for the token lottery to generate the best PR is also not the right direction in my book. But somebody needs to investigate in both extremes I say.

mirekrusin · 2026-03-01T10:24:13 1772360653

Actually, nobody said the spec needs to be written by humans.

My personal opinion: with today's LLMs, the spec should be steered by a human because its quality is proportional to result quality. Human interaction is much cheaper at that stage — it's all natural language that makes sense. Later, reasoning about the code itself will be harder.

In general, any non-trivial, valuable output must be based on some verification loop. A spec is just one way to express verification (natural language — a bit fuzzy, but still counts). Others are typecheckers, tests, and linters (especially when linter rules relate to correctness, not just cosmetics).

Personally, on non-trivial tasks, I see very good results with iterative, interactive, verifiable loops:

- Start with a task

- Write spec in e.g. SPEC.md → "ask question" until answer is "ok"/proceed

- Write implementation PLAN.md — topologically sorted list of steps, possibly with substeps → ask question

- For each step: implement, write tests, verify (step isn't done until tests pass, typecheck passes, etc.); update SPEC/PLAN as needed → ask question

- When done, convert SPEC.md and PLAN.md into PR description (summary) and discard

("Ask question" means an interactive prompt that appears for the user. Each step is gated by this prompt — it holds off further progress, giving you a chance to review and modify the result in small bits you can actually reason about.) The workflow: you accept all changes before confirming the next step. This way you get code deltas that make sense. You can review and understand them, and if something's wrong you can modify by hand (especially renames, which editors like VS Code handle nicely) or prompt for a change. The LLM is instructed to proceed only when the re-asked answer is "ok".

This works with systems like VSCode Copilot, not so much with CC cli.

I'm looking forward to an automated setup where the "human" is replaced by an "LLM judge" — I think you could already design a fairly efficient system like this, but for my work LLMs aren't quite there yet.

That said, there's an aspect that shouldn't be forgotten: this interactive approach keeps you in the driving seat and you know what's happening with the codebase, especially if you're running many of these loops per day. Fully automated solutions leave you outside the picture. You'll quickly get disconnected from what's going on — it'll feel more like a project run by another team where you kind of know what it does on the surface but have no idea how. IMO this is dangerous for long-term, sustainable development.

kvdveer · 2026-02-20T07:40:09 1771573209

From my experience, LLMs understand prompt just fine, even if there are substantial typos or severe grammatical errors.

I feel that prompting them with poor language will make them respond more casually. That might be confirmation bias on my end, but research does show that prompt language affects LLM behavior, even if the prompt message doesn't change/

kvdveer · 2026-02-14T13:15:31 1771074931

Infinitief scrolling is only mentioned in the title. The actual legislation focuses on addictive patterns of which infinite scroll is just one. The exact formulation will of course matter a lot, but it will not simply be banning infinite scroll, as that would be trivial to circumvent.

gunapologist99 · 2026-02-14T13:25:55 1771075555

Fair point, but such regulatory ambiguity in the U.S. would likely be unconstitutionally vague:

https://en.wikipedia.org/wiki/Vagueness_doctrine#Unconstitut...

The article reads as if the EC chose one or a few apps that they didn't like, and then wrote a regulation based on that app's key features.

kvdveer · 2026-02-11T17:51:13 1770832273

It is not immune, but it limits #1 and #2.

The "lethal trifecta" is a limited view on security, as it's mostly concerned with leaking data. This solution focuses on a different aspect: the ability of rogue actions (instead of rogue communications per #3).

kvdveer · 2026-02-09T19:48:40 1770666520

None of those documents reliably state my city of residence. At best they document where I once lived, but not even that is guaranteed.

xnyan · 2026-02-10T00:09:56 1770682196

Not updating your DL after changing your address is a crime* in all US states. I'm not as familiar with law elsewhere, but would be surprised if that's not true most other places.

*There are exceptions for active duty military personal and other limited exceptions.

VohuMana · 2026-02-10T08:29:08 1770712148

It is a law but rarely enforced, also some places like Washington are primarily digital meaning you update your DL address online but they don’t print a new ID unless you request it or your DL is expired

tosti · 2026-02-10T03:10:27 1770693027

That's pathetic. It would mean you can't live anywhere without a street address, such as a camp site or a ship. You also can't be a nomad.

So much for "land of the free".

mcmcmc · 2026-02-10T03:45:34 1770695134

Unless you’re wild camping, campsites have addresses. So do marinas where a ship would need to be docked more or less regularly to establish residency.

As for being a nomad, you don’t need a driver’s license or any kind of ID to wander if you’re willing to sleep rough. If you want to drive on public roadways though, you better have a primary address where the courts can send someone if you kill someone in a traffic accident and bail.

tosti · 2026-02-10T04:38:43 1770698323

Docking is expensive, so no. It's also only needed once per 5 years or so for maintenance.

Government fining you a ticket doesn't mean your address has to be on the drivers license. They could register the number plate to an SSN for instance.

mcmcmc · 2026-02-10T17:16:40 1770743800

Did you skip my last sentence? A traffic ticket is not the worst thing you can do in an automobile. And not everyone eligible for a drivers license will have an SSN.

tosti · 2026-02-11T05:32:11 1770787931

Bailing from a traffic incident is a crime itself. Good luck getting away with that.

And why would someone not get their SSN if they're old enough to drive?

smcin · 2026-02-13T04:17:04 1770956224

- some resident aliens (if not authorized to work), B-1/B-2 Visitors, WB/WT (Visa Waiver Program), nonresident aliens

- their spouses and dependents e.g. F-2 Dependents, J-2 Dependents, H-4 Dependents: of H-1B, H-2A, H-2B, or H-3 visa holders.

- Undocumented immigrants: Individuals without lawful status who have a US tax filing requirement.

Read "Who's eligible for an ITIN" https://www.irs.gov/tin/itin/individual-taxpayer-identificat...

AlotOfReading · 2026-02-10T06:55:18 1770706518

For most situations where you don't have a permanent address, your address is either the place where you receive mail or the courthouse.

direwolf20 · 2026-02-10T10:41:40 1770720100

And if you don't have a place to receive mail?

1718627440 · 2026-02-10T16:15:21 1770740121

Receiving mail is a requirement for participating in the legal system, which is a requirement for citizens.

direwolf20 · 2026-02-10T16:55:30 1770742530

Laws of the government can't override laws of physics. If you don't have a place where you can receive mail, do they just arrest you or what? Do they assign a PO box to you?

1718627440 · 2026-02-11T08:14:43 1770797683

If you fail to comply with legally delivered court messages (to your registered mail address), then yes the police is going to fetch you.

jcgrillo · 2026-02-10T04:14:32 1770696872

In some places in the U.S. it is (or at least was until recently) illegal to be healthy and unemployed.

direwolf20 · 2026-02-10T10:41:17 1770720077

America is one of the least free countries — they think they're free because the guns=free principle is drilled into them since birth, but it's a lie.

malfist · 2026-02-09T22:19:39 1770675579

You are legally required to update those within 10 days of moving.

kvdveer · 2026-01-27T17:09:01 1769533741

This is especially true if the marketing team claims that humans were validating every step, but the actual humans did not exist or did no such thing.

If a marketer claims something, it is safe to assume the claim is at best 'technically true'. Only if an actual engineer backs the claim it can start to mean something.

kvdveer · 2026-01-26T14:28:27 1769437707

There's a daily token limit. While I've never run into that limit while operating Claude as a human, I have received warnings that I'm getting close. I imagine that an unattended setup will blow through the token limit in not too much time.

kvdveer · 2026-01-14T08:19:07 1768378747

> How close are we to smart dust I wonder? How small can we make wireless communications?

There's two limiting factors for 'smart dust': power (batteries are the majority weight and volume of this vape), and antennae (minimum size determined by wavelength of carrier wave).

I believe you can fit an NFC module in a 5x5mm package, but that does externalize the power supply.

cruffle_duffle · 2026-01-14T15:49:45 1768405785

We are going to have to rethink power for smart dust. Like consider that no creature out there is powered by batteries. From the biggest land animal to the smallest microbe it’s all chemistry.

Maybe the smart dust will have to eat microbes and stuff to stay active.

As for communication, we can’t go shoving antennas in them as then they’d be larger than dust. And you can’t use the optical part of the spectrum because of interference with basically everything. You can’t use wavelengths smaller either as you get into UV and high radiation. There is the terahertz radio spectrum [0] between 3mm and 30um that is pretty open and not utilized at all because we haven’t figured out how to make good transmitters. Plus the spectrum isn’t very useful as it isn’t very penetrating and water vapor absorbs it… and it requires lots of power.

Smart dust might have to be more of a distributed computer or something. Or a micro machine that uses chemistry and mechanical magic to do its operations.

[0] https://en.wikipedia.org/wiki/Terahertz_radiation

TeMPOraL · 2026-01-14T20:51:19 1768423879

> From the biggest land animal to the smallest microbe it’s all chemistry.

Batteries are chemistry. ATP is a chemical battery.

The difference between living things and our machines is primarily in manufacturing methods: we do things in bulk, because we reach from the top with crude, meter-scale tools; nature glues things up from lots of tiny biomolecular nanomachines, and each of those tiny machines has to carry its own power source!

Still, it's highly likely that any form of "smart dust" will resemble living cells as much as, or even more so, it will resemble miniature devices we build today, simply because that's the kind of chemistry that's efficient at smaller scales.

slow_typist · 2026-01-14T09:19:48 1768382388

RFID tags are powered wirelessly, one could imagine powering smaller particles when operating on higher frequencies (RFID is on 13.something MHz requiring relatively large coils). A directional antenna could send a pulsed beam to power a subset of the particles in the area and afterwards receive their signals.

regularfry · 2026-01-14T10:21:23 1768386083

It needs to be in the infrared spectrum at least to be useful for smart dust, otherwise the package size is still dominated by the size of the antenna. Even mm-wave radar is marginal here.

volemo · 2026-01-14T11:51:51 1768391511

So... smart dust powered by the sun? Cool!

slow_typist · 2026-01-14T12:48:27 1768394907

Okay if you take dust literally. The important part is that the particles fly. Like dandelion seeds.

DANmode · 2026-01-14T17:23:42 1768411422

> There's two limiting factors for 'smart dust': power

RFID is historically powered by one of three methods,

one of which is completely wireless/battery-free.

kvdveer · 2025-12-02T08:28:12 1764664092

Linux requires root for raw sockets, which _can_ be used to send pings, but also numerous other things.

The trick used here only allows pings. This trick is gated behind other ACLs.