More

hgl · 2025-06-19T04:46:42 1750308402

It’s fascinating to think about what true GUI for LLM could be like.

It immediately makes me think a LLM that can generate a customized GUI for the topic at hand where you can interact with in a non-linear way.

karpathy · 2025-06-19T04:57:57 1750309077

Fun demo of an early idea was posted by Oriol just yesterday :)

https://x.com/OriolVinyalsML/status/1935005985070084197

spamfilter247 · 2025-06-19T17:54:03 1750355643

My takeaway from the demo is less that "it's different each time", but more a "it can be different for different users and their styles of operating" - a poweruser can now see a different Settings UI than a basic user, and it can be generated realtime based on the persona context of the user.

Example use case (chosen specifically for tech): An IDE UI that starts basic, and exposes functionality over time as the human developer's skills grow.

superfrank · 2025-06-19T08:01:36 1750320096

On one hand, I'm incredibly impressed by the technology behind that demo. On the other hand, I can't think of many things that would piss me off more than a non-deterministic operating system.

I like my tools to be predictable. Google search trying to predict that I want the image or shopping tag based on my query already drives me crazy. If my entire operating system did that, I'm pretty sure I'd throw my computer out a window.

iLoveOncall · 2025-06-19T11:03:15 1750330995

> incredibly impressed by the technology behind that demo

An LLM generating some HTML?

superfrank · 2025-06-19T16:00:39 1750348839

At a speed that feels completely seamless to navigate through. Yeah, I'm pretty impressed by that.

iLoveOncall · 2025-06-20T16:59:00 1750438740

Read the code that is actually being generated. It's only the content of the page, which itself is loaded progressively.

It takes 2 seconds to generate an extremely basic 300 characters page of content. Again, what is impressive here?

It's not fast, it gives the illusion of being fast.

superfrank · 2025-06-21T04:56:21 1750481781

I know what it's doing and I'm impressed. If you understand what it's doing and aren't impressed, that's cool too. I think we just see things differently and I doubt either of us will convince the other one to change their mind on this

hackernewds · 2025-06-19T06:49:37 1750315777

it's impressive but it seems like a crappier UX? that none of the patterns can really be memorized

asterisk_ · 2025-06-20T06:04:19 1750399459

I feel like one quickly hits a similar partial observability problem as with e.g. light sensors. How often do you wave around annoyed because the light turned off.

To get _truly_ self driving UIs you need to read the mind of your users. It's some heavy tailed distribution all the way down. Interesting research problem on its own.

We already have adaptive UIs (profiles in VSC anyone? Vim, Emacs?) they're mostly under-utilized because takes time to setup + most people are not better at designing their own workflow relative to the sane default.

aprilthird2021 · 2025-06-19T06:23:45 1750314225

This is crazy cool, even if not necessarily the best use case for this idea

throwaway314155 · 2025-06-19T21:26:35 1750368395

I would bet good money that many of the functions they chose not to drill down into (such as settings -> volume) do nothing at all or cause an error.

It's a fronted generator. It's fast. That's cool. But is being pitched as a functioning OS generator and I can't help but think it isn't given the failure rates for those sorts of tasks. Further, the success rates for HTML generation probably _are_ good enough for a Holmes-esque (perhaps too harsh) rugpull (again, too harsh) demo.

A cool glimpse into what the future might look like in any case.

superconduct123 · 2025-06-19T19:17:49 1750360669

That looks both cool and infuriating

suddenlybananas · 2025-06-19T07:13:10 1750317190

Having different documents come up every time you go into the documents directory seems hellishly terrible.

falcor84 · 2025-06-19T08:24:06 1750321446

It's a brand of terribleness I've somewhat gotten used to, opening Google Drive every time, when it takes me to the "Suggested" tab. I can't recall a single time when it had the document I care about anywhere close to the top.

There's still nothing that beats the UX of Norton Commander.

sensanaty · 2025-06-19T07:23:23 1750317803

Ah yes, my operating system, most definitely a place I want to stick the Hallucinotron-3000 so that every click I make yields a completely different UI that has absolutely 0 bearing to reality. We're truly entering the "Software 3.0" days (can't wait for the imbeciles shoving AI everywhere to start overusing that dogshit, made-up marketing term incessantly)

dang · 2025-06-19T19:15:51 1750360551

"Please don't fulminate."

"Don't be curmudgeonly. Thoughtful criticism is fine, but please don't be rigidly or generically negative."

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

"Please respond to the strongest plausible interpretation of what someone says, not a weaker one that's easier to criticize."

https://news.ycombinator.com/newsguidelines.html

danielbln · 2025-06-19T07:36:14 1750318574

Maybe we can collect all of this salt and operate a Thorium reactor with it, this in turn can then power AI.

sensanaty · 2025-06-19T07:40:28 1750318828

We'll need to boil a few more lakes before we get to that stage I'm afraid, who needs water when you can have your AI hallucinate some for you after all?

TeMPOraL · 2025-06-19T12:21:49 1750335709

Who needs water when all these hot takes come from sources so dense, they're about to collapse into black holes.

sensanaty · 2025-06-19T13:39:48 1750340388

Is me not wanting the UI of my OS to shift with every mouse click a hot take? If me wanting to have the consistent "When I click here, X happens" behavior instead of the "I click here and I'm Feeling Lucky happens" behavior is equal to me being dense, so be it I guess.

TeMPOraL · 2025-06-19T17:17:16 1750353436

No. But you interpreting and evaluating the demo in question as suggesting the things you described - frankly, yes. It takes a deep gravity well to miss a point this clear from this close.

It's a tech demo. It shows you it's possible to do these things live, in real time (and to back Karpathy's point about tech spread patterns, it's accessible to you and me right now). It's not saying it's a good idea - but there are obvious seeds of good ideas there. For one, it shows you a vision of an OS or software you can trivially extend yourself on the fly. "I wish it did X", bam, it does. And no one says it has to be non-deterministic each time you press some button. It can just fill what's missing and make additions permanent, fully deterministic after creation.

cjcenizal · 2025-06-19T05:00:01 1750309201

My friend Eric Pelz started a company called Malleable to do this very thing: https://www.linkedin.com/posts/epelz_every-piece-of-software...

whatarethembits · 2025-06-20T01:07:36 1750381656

I'm curious where this ends up going.

Personally I think its a mistake; at least at "team" level. One of the most valuable things about a software or framework dictating how things are done is to give a group of people a common language to communicate with and enforce rules. This is why we generally prefer to use a well documented framework, rather than letting a "rockstar engineer" roll their own. Only they will understand its edge cases and ways of thinking, everyone else will pay a price to adapt to that, dragging everyone's productivity down.

Secondly, most people don't know what they want or how they want to work with a specific piece of software. Its simply not important enough, in the hierarchy of other things they care about, to form opinions about how a specific piece of software ought to work. What they want, is the easiest and fastest way to get something done and move on. It takes insight, research and testing to figure out what that is in a specific domain. This is what "product people" are supposed to figure out; not farm it out to individual users.

swader999 · 2025-06-20T04:08:11 1750392491

You bake those rules into the folders in a Claude.md file and it becomes it's guide when building or changing anything. Ubiquitous language and all that Jazz.

jonny_eh · 2025-06-19T06:22:25 1750314145

An ever-shifting UI sounds unlearnable, and therefore unusable.

dang · 2025-06-19T06:24:18 1750314258

It wouldn't be unlearnable if it fits the way the user is already thinking.

guappa · 2025-06-19T09:20:32 1750324832

AI is not mind reading.

dang · 2025-06-19T18:16:39 1750356999

Behavioral patterns are not unpredictable. Who knows how far an LLM could get by pattern-matching what a user is doing and generating a UI to make it easier. Since the user could immediately say whether they liked it or not, this could turn into a rapid and creative feedback loop.

kevinventullo · 2025-06-20T06:56:15 1750402575

So, if the user likes UI’s that don’t change, the LLM will figure out that it should do nothing?

One problem LLM’s don’t fix is the misalignment between app developers’ incentives and users’ incentives. Since the developer controls the LLM, I imagine that a “smart” shifting UI would quickly devolve into automated dark patterns.

dang · 2025-06-20T22:26:44 1750458404

A user who doesn't want such changes shouldn't be subjected to them in the first place, so there should be nothing for an LLM to figure out.

I'm with you on disliking dark patterns but it seems to me a separate issue.

NitpickLawyer · 2025-06-19T14:07:34 1750342054

A sufficiently advanced prediction engine is indistinguishable from mind reading :D

OtherShrezzing · 2025-06-19T07:19:28 1750317568

A mixed ever-shifting UI can be excellent though. So you've got some tools which consistently interact with UI components, but the UI itself is altered frequently.

Take for example world-building video games like Cities Skylines / Sim City or procedural sandboxes like Minecraft. There are 20-30 consistent buttons (tools) in the game's UX, while the rest of the game is an unbounded ever-shifting UI.

skydhash · 2025-06-19T13:08:01 1750338481

The rest of the game is very deterministic where its state is controlled by the buttons. The slight variation is caused by the simulation engine and follows consistent patterns (you can’t have building on fire if there’s no building yet).

9rx · 2025-06-19T13:53:00 1750341180

Tools like v0 are a primitive example of what the above is talking about. The UI maintains familiar conventions, but is laid out dynamically based on surrounding context. I'm sure there are still weird edge cases, but for the most part people have no trouble figuring out how to use the output of such tools already.

sotix · 2025-06-19T11:56:44 1750334204

Like Spotify ugh

dpkirchner · 2025-06-19T04:51:50 1750308710

Like a HyperCard application?

necrodome · 2025-06-19T05:31:02 1750311062

We (https://vibes.diy/) are betting on this

diggan · 2025-06-19T11:06:19 1750331179

Border-line off-topic, but since you're flagrantly self-promoting, might as well add some more rule breakage to it.

You know websites/apps who let you enter text/details and then not displaying sign in/up screen until you submit it, so you feel like "Oh but I already filled it out, might as well sign up"?

They really suck, big time! It's disingenuous, misleading and wastes people's time. I had no interest in using your thing for real, but thought I'd try it out, potentially leave some feedback, but this bait-and-switch just made the whole thing feel sour and I'll probably try to actively avoid this and anything else I feel is related to it.

necrodome · 2025-06-19T16:24:23 1750350263

Thanks for the benefit of the doubt. I typed that in a hurry, and it didn’t come out the way I intended.

We had the idea that there’s a class of apps [1] that could really benefit from our tooling - mainly Fireproof, our local-first database, along with embedded LLM calling and image generation support. The app itself is open source, and the hosted version is free.

Initially, there was no login or signup - you could just generate an app right away. We knew that came with risks, but we wanted to explore what a truly frictionless experience could look like. Unfortunately, it didn’t take long for our LLM keys to start getting scraped, so the next best step was to implement rate limiting in the hosted version.

[1] https://tools.simonwillison.net/

diggan · 2025-06-19T17:24:13 1750353853

My complaint isn't about that you need to protect it with a login/signup, but where in the process you put that login/signup.

Put it before letting people enter text, rather than once they've entered text and pressed the button, and people won't feel mislead anymore.

jchrisa · 2025-06-19T23:27:11 1750375631

The generation is running while you login, so this appreciable decreases wait time from idea to app, because by the time you click through the login, your app is ready. (Vibes DIY CEO here.)

If login takes 30 seconds, and app gen 90, we think this is better for users (but clearly not everyone agrees.) Thanks for the feedback!

stoisesky · 2025-06-19T09:24:31 1750325071

This talk https://www.youtube.com/watch?v=MbWgRuM-7X8 explores the idea of generative / malleable personal user interfaces where LLMs can serve as the gateway to program how we want our UI to be rendered.

nbbaier · 2025-06-19T04:50:32 1750308632

I love this concept and would love to know where to look for people working on this type of thing!

stuartmemo · 2025-06-19T15:15:54 1750346154

It's probably Jira. https://medium.com/question-park/all-aboard-the-ai-train-b03...

semi-extrinsic · 2025-06-19T09:04:16 1750323856

Humans are shit at interacting with systems in a non-linear way. Just look at Jupyter notebooks and the absolute mess that arises when you execute code blocks in arbitrary order.

bicepjai · 2025-06-22T01:57:48 1750557468

What is the mess you are referring with regards to Jupyter notebooks ?

semi-extrinsic · 2025-06-22T18:42:00 1750617720

If you run cells out of order, you get weird results. Thus you have efforts like marimo which replace jupyter with something that reruns all dependent cells.

hgl · 2025-06-01T08:39:13 1748767153

Not to criticize the article, which is very well written, just some extra info:

It seems for the author, the custom installer is mainly used for accepting user SSH public key, terminfo, and maybe also locale.

Almost none of the packages the author listed get used, including zsh. Since NixOS is installed via nixos-anywhere, it runs a bash script to do everything, and all the script's dependencies will be pulled by nix.

For people who don't want to build a custom installer, or their cloud environment doesn't allow one, you can simply host a script somewhere and download and run it on the remote machine to add your SSH public key and other customizations, including partitioning the disk.

Note that the author used disko to partition the disk declaratively. Disko won't work for a machine with very limited ram, because disko runs in the installer, and needs to install tools to the ram to do the partition.

I wrote a nix configuration library[1] that also does NixOS installation (uses nixes-anywhere under the hood), where you can choose between using disko, a default script[2] that handles 90% of the use cases (using only the default tools available on a vanilla NixOS installer, so nothing gets installed to the ram), or your own script.

[1] https://github.com/hgl/nixverse

[2] https://github.com/hgl/nixverse/blob/main/load/partitionScri...

secure · 2025-06-01T08:46:49 1748767609

> Almost none of the packages the author listed get used, including zsh

Just to clarify: the point of having packages like lshw and zsh available is not for the case of performing the automated installation (where, yes, they are not used), but for the case where I want to interactively poke around in a booted installer to inspect the target system.

hgl · 2025-06-01T08:55:10 1748768110

That's fair, having a remote shell environment that you feel comfortable to poke around is pretty great.

For git, you commented "for checking out github.com/stapelberg/configfiles". I wonder if you sometimes install NixOS locally from the installer? If so, I can understand having those packages around can be very useful.

secure · 2025-06-01T09:09:05 1748768945

Yes, I use this same config snippet throughout my installs and haven’t gotten around to managing my home with Nix yet.

Later, I refactored this config snippet into a Flake that I include: https://github.com/stapelberg/nix/ …but that’s for a follow-up blog post :)

dicytea · 2025-06-01T08:58:05 1748768285

> Note that the author used disko to partition the disk declaratively. Disko won't work for a machine with very limited ram, because disko run in the installer, and needs to install tools to the ram to do the partition.

This is only true if you use the disko-install tool, which is a horrible footgun[^1]. The safest approach is to just use the default disko command, then nixos-install.

[^1]: https://github.com/nix-community/disko/issues/947

hgl · 2025-06-01T09:26:36 1748769996

Thanks for bringing the disko command to my attention.

However, since we are talking about installing NixOS declaratively, and it's done through nixos-anywhere, which will install it[0] to the ram unfortunately.

[0]: https://github.com/nix-community/nixos-anywhere/blob/abb0d72...

eptcyka · 2025-06-01T10:02:56 1748772176

The beauty of nix is that you can trivially build a custom installer - I do it all the time, for each different host. Since it is simple to do, you can choose to make trivial changes. You do not have to, but you definitely can.

hgl · 2025-05-09T06:26:50 1746772010

Do you happen to have links to such discussions and analyses?

hgl · 2025-05-09T06:01:46 1746770506

It seems Reddit also punishes ban avoiding. If they find a way to associate my banned accounts with a preprovisioned account, that account can also get banned?

I don't know how they do it, but I have tried using a completely new IP and private browsing, getting a new account, and posting in r/CatAdvice that r/NewToReddit says is newcomer friendly, but still got shadow banned.

hgl · on July 12, 2023

> ... teenager asks why they need to learn calculus

> But if we avoid hard things

I don't see how you can justify the former by arguing the latter. These two are orthogonal. If I were that teenager, I think what I really would want to ask is that why it has to be calculus instead of some other things that is also hard but with obvious real world application like writing a small 3D game engine.

And my answer to that question is you probably shouldn't if your were in an ideal education system. You would be taught what interesting interactions you could have with the physical world, and be induced to discover calculus or some other math tools that helps you understand how the interactions really work and demonstrates you really need such tools. You're more likely to grasp them when you're driven by curiosity.

hgl · on July 3, 2023

I wonder if Google is Xerox of our era, developing promising technologies only to be made practical and commercialized by other entities? Transformer is one example. Not sure if recent advancement in quantum computing is another.

hgl · on June 5, 2023

This seems to be another iPhone moment, but I wonder what’s its killer feature? iPhone had the killer feature of phone calls, so everyone has a reason to buy one, I can’t come up with any for AR.

Don’t get me wrong, I’m actually incredibly excited about AR, I just can’t imagine how it becomes mainstream. It can of course be mainstream if it’s just like glasses and has all day battery, but it still seems pretty far away.

hgl · on May 27, 2023

Many people seem to be unenthusiastic about it because the limitation on query strings are usually large enough.

I personally like this addition, because it no longer requires all queries be shoehorned into query strings. You can use any syntax you like, be it SQL or GraphQL etc.

hgl · on May 30, 2021

This is akin to 2D beings saying time is the third dimension and being satisfied with the conclusion and happily living in their 2D world.

hgl · on April 28, 2020

This seems like a pretty comprehensive book. Will check out. Thanks.