zaxioms's comments

zaxioms · 2026-02-10T14:30:13 1770733813

If you wanted to implement this in real life, who plays the role of Alice?

spragl · 2026-02-10T14:37:32 1770734252

I think that whatever organization that issues your passport, would be a natural choice for setting this up. But it could be some other authority. In a way it is the identity owners and the providers that decide who they will trust as authorities.

zaxioms · 2026-01-06T19:50:13 1767729013

> Blake embodies the “bits to atoms” shift underway in America. Before founding Boom, he was designing internet coupons for Groupon.

What is this? I can't find easily the meaning of "bits to atoms." Is this meaning that US is going away from digital "exports"?

skrebbel · 2026-01-06T19:55:31 1767729331

No the meaning is that there's a wave of hardware innovators who started as software innovators. Musk (Paypal) is the most obvious example. The Boom CEO featured here is another. There's a fair bunch more, I agree with the author that a weirdly high % of founders of cool hardware / deeptech companies have a software engineering background. Like, you'd expect that space to be dominated by mechanical engineers, electrical engineers and physicists etc but somehow it isn't.

snowwrestler · 2026-01-06T20:29:23 1767731363

Because people who led software companies (which have insanely high margins) have access to the vast capital needed for serious heavy hardware development. They then hire the engineers they need to succeed.

notahacker · 2026-01-06T22:15:42 1767737742

Yep. I suspect they're also better at getting warm intros to and selling their vision to Silicon Valley CEOs than your average aerospace engineer with a dream.

Towaway69 · 2026-01-06T19:52:16 1767729136

From virtual goods to physical goods I assume.

Never heard before either so you're not alone.

EDIT: the reference to America is, again I assume, the trend to bring manufacturing back to the US from mainly China.

zaxioms · 2025-11-27T20:38:24 1764275904

It's cool, but I genuinely cannot fathom why they are targeting natural language proofs instead of a proof assistant.

natrys · 2025-11-27T23:05:09 1764284709

Well they do that too: https://huggingface.co/deepseek-ai/DeepSeek-Prover-V2-671B

But I suppose the bigger goal remains improving their language model, and this was an experimentation born from that. These works are symbiotic; the original DeepSeekMath resulted in GRPO, which eventually formed the backbone of their R1 model: https://arxiv.org/abs/2402.03300

gjm11 · 2025-11-28T02:58:37 1764298717

The "obvious" thing to try, which presumably some people are trying pretty hard right now[1], is to (1) use a mathematically-tuned LLM like this one to propose informal Next Things To Try, (2) use an LLM (possibly the same LLM) to convert those into proof assistant formalism, (3) use the proof assistant to check whether what the LLM has suggested is valid, and (4) hook the whole thing together to make a proof-finding-and-verifying machine that never falsely claims to have proved something (because everything goes through that proof assistant) and therefore can tolerate confabulations from LLM #1 and errors from LLM #2 because all those do is waste some work.

[1] IIRC, AlphaProof is a bit like this. But I bet that either there's a whole lot of effort on this sort of thing in the major AI labs, or else there's some good reason to expect it not to work that I haven't thought of. (Maybe just the "bitter lesson", I guess.)

It would doubtless be challenging to get such a system to find large difficult proofs, because it's not so easy to tell what's making progress and what isn't. Maybe you need LLM #3, which again might or might not be the same as the other two LLMs, to assess what parts of the attempt so far seem like they're useful, and scrub the rest from the context or at least stash it somewhere less visible.

It is, of course, also challenging for human mathematicians to find large difficult proofs, and one of the reasons for them is that it's not so easy to tell what's making progress and what isn't. Another major reason, though, is that sometimes you need a genuinely new idea, and so far LLMs aren't particularly good at coming up with those. But a lot of new-enough-ideas[2] are things like "try a version of this technique that worked well in an apparently unrelated field", which is the kind of thing LLMs aren't so bad at.

[2] Also a lot of the new-enough-ideas that mathematicians get really happy about. One of the cool things about mathematics is the way that superficially-unrelated things can turn out to share some of their structure. If LLMs get good at finding that sort of thing but never manage any deeper creativity than that, it could still be enough to produce things that human mathematicians find beautiful.

mamami · 2025-11-27T21:00:24 1764277224

Natural language is a lot more, well, readable than say lean. You get a lot less intuition and understanding of what the model is attempting to do in the first place.

blazespin · 2025-11-27T23:05:43 1764284743

More training data on advanced math. Lean is cool, but it's mostly about formalizing stuff we already know.

zaxioms · 2025-11-28T01:48:47 1764294527

Ok I guess I could have told you that. What I really meant is that in the future where LLMs are doing new math (which I'm skeptical of, but I digress) I would not trust any of it unless it was formally verified.

blazespin · 2025-11-28T20:11:00 1764360660

if you read the paper that is the intention, to guide stuff like lean.

i don't think llm is a great pure rlvr

Davidzheng · 2025-11-28T02:49:30 1764298170

I think there's a lot of baggage doing it in lean. like what the libraries are at currently. how things are implemented. which things are not implemented, etc. but it still remains to be seen what wins (my money would be on informal)

zaxioms · 2025-10-26T03:29:07 1761449347

Does anyone have a similar article with more detail? I don't quite want to read the datasheet of your favorite microprocessor, but I would like a decent amount more detail than what's provided. Especially before UEFI/BIOS.

tremon · 2025-10-26T13:53:03 1761486783

At a very high level, the processor initialization procedure is similar to most microcontrollers:

- processor is powered up with RESET pin asserted, making sure that all circuits are in a known-safe state.

- when RESET is deasserted, all autonomous circuits (e.g. core control units, cache controllers, ALUs) go through a hardware setup phase.

- the processor accesses the boot firmware according to the platform design. There's many options here, but the three most common are 1) the boot ROM is permanently mapped to a dedicated hardware address, and the processor uses its memory controller to access that memory; 2) the boot ROM is directly connected to the processor via dedicated I/O pins; 3) the initial boot code is located on the processor itself (factory-programmed EEPROM).

- if the boot ROM is not mapped to a memory address, the data is loaded into CPU cache (external RAM needs to be initialized first, and that task is usually performed by the firmware).

- the primary cpu core starts executing boot code (UEFI/BIOS/etc) from said predefined hardware address. Other cores (and processors) are usually left uninitialized throughout the startup process, and it is left to the operating system to bring up the remaining parts of the system.

Specifically for Intel, see https://binarydebt.wordpress.com/2018/10/06/how-does-an-x86-... . Or see https://electronics.stackexchange.com/a/726918 and its comments to get an idea of the differences between platforms.

zaxioms · 2025-09-13T04:28:53 1757737733

Rice's theorem is about decidability, not difficulty. But you are right that assuming P != NP there is no algorithm for efficient SAT (and other constraint) solving.

zaxioms · 2025-09-13T04:24:45 1757737485

I'm surprised to hear this. Modern SAT solvers can easily handle many problems with hundreds of thousands of variables and clauses. Of course, there are adversarial problems where CDCL solvers fail, but I would be fascinated if you can find industrial (e.g. human written for a specific purpose) formulas with "dozens of variables" that a solver can't solve fairly quickly.

LegionMammal978 · 2025-09-13T06:46:29 1757745989

One thing that I spent a particularly long time trying to get working was learning near-minimum-size exact languages from positive and negative samples. DFAMiner [0] has a relatively concise formulation for this in terms of variables and clauses, though I have no way to know if some other reformulation would be better suited for SAT solvers (it uses CaDiCaL by default).

It usually starts taking a few seconds around the ~150-variable mark, and hits the absolute limit of practicality by 350–800 variables; the number of clauses is only an order of magnitude higher. Perhaps something about the many dependencies in a DFA graph puts this problem near the worst case.

The annoying thing is, there do seem to be heuristics people have written for this stuff (e.g., in FlexFringe [1]), but they're all geared toward probabilistic automata for anomaly detection and similar fuzzy ML stuff, and I could never figure out how to get them to work for ordinary automata.

In any case, I eventually figured out that I could get a rough lower bound on the minimum solution size, by constructing a graph of indistinguishable strings, generating a bunch of random maximal independent sets, and taking the best of those. That gave me an easy way to filter out the totally hopeless instances, which turned out to be most of them.

[0] https://github.com/liyong31/DFAMiner

[1] https://github.com/tudelft-cda-lab/FlexFringe

zaxioms · 2025-05-25T02:37:43 1748140663

I'm fascinated that Epic Games are the authors. Does anyone know what their motivation for this research would be?

troupo · 2025-05-25T06:53:25 1748156005

The original claim was to create a new programming language for developing games.

Everything that has come out of Verse so far has been "oh look, I designed this language I wanted to design and the only relation it has to gaming is the title of the paper"

We don't even know if it's a good programming language for anything, as it doesn't exist outside Unreal for Fortnite

batterseapower · 2025-05-25T07:25:25 1748157925

Reinventing the programming paradigm has been Tim Sweeney's hobby horse since before 2006: https://groups.csail.mit.edu/cag/crg/papers/sweeney06games.p...

After Epic's wild success hes now got a bit of cash to splash on making his dream a reality :-)

wormius · 2025-05-25T19:22:42 1748200962

Yeah Sweeney was really active on the programming theory site http://lambda-the-ultimate.org/

It looks like it's currently down (not sure if a glitch or what).

woolybully · 2025-05-25T03:58:30 1748145510

Tim Sweeney & Epic are creating the language Verse whose logic functional underpinnings will make concurrency & correctness easier in Fortnite. https://youtu.be/ub-dls31Pyw