More

thinkling · 2026-02-17T19:11:04 1771355464

For comparisonI think the current leader in pelican drawing is Gemini 3 Deep Think:

https://bsky.app/profile/simonwillison.net/post/3meolxx5s722...

konart · 2026-02-17T19:34:12 1771356852

My take (also Gemini 3 Deep Think): https://gemini.google.com/share/12e672dd39b7

Somehow it's much better now.

jazzyjackson · 2026-02-17T19:51:38 1771357898

I’m not familiar with Gemini, isn’t this just a diffusion model output? The Pelican test is for the llm to produce SVG markup.

konart · 2026-02-17T20:01:04 1771358464

Yeah, I was so amazed by the result I didn't even realize Gemini used Nano Banana while producing the result.

kingbob000 · 2026-02-17T23:38:38 1771371518

Is that actually better? That pelican has arms sprouting out of its wings

thinkling · 2026-02-16T19:08:29 1771268909

You can see up-thread that the same model will produce different answers for different people or even from run to run.

That seems problematic for a very basic question.

Yes, models can be harnessed with structures that run queries 100x and take the "best" answer, and we can claim that if the best answer gets it right, models therefore "can solve" the problem. But for practical end-user AI use, high error rates are a problem and greatly undermine confidence.

thinkling · 2026-02-09T21:10:39 1770671439

Most importantly, Slack limits the amount of message history you get to keep if you’re not paying. And the payment plans are per-user fees which quickly becomes non-viable for non-commercial use.

thinkling · 2026-02-04T22:10:26 1770243026

Ideally, ethical buyers would cause the market to line up behind ethical products. For that to be possible, we have to have choices available to us. Seems to me Anthropic is making such a choice available to see if buyers will line up behind it.

fogzen · 2026-02-05T00:46:48 1770252408

“Ideally” is doing a lot of heavy lifting here.

thinkling · 2026-01-28T17:33:56 1769621636

The WF store I frequent has lousy cell reception, so add th step “open Settings app and get on store’s wifi” (and who knows what all that lets them track).

thinkling · 2025-12-16T22:20:35 1765923635

Yes, cement absorbs CO2 as it sets, there are reams of "green cement" startups based on that premise like CarbonBuilt. This paper presents new estimates on how much is actually taken up and what factors matter, but the abstract does not mention whether there is any actionable information. Yawn.

thinkling · 2025-12-14T23:25:36 1765754736

See discussion elsewhere in this thread on updating to 15.7.3:

https://news.ycombinator.com/item?id=46264741

thinkling · 2025-12-11T18:47:20 1765478840

The #1 problem I have typing on my iPhone is that I hit letter keys (mostly 'n') instead of the space bar and the phone just doesn't anticipate this as a possible typo and doesn't offer the right corrections. (I have AutoCorrect off.) It doesn't seem able to learn that this is a common typo, either.

____tom____ · 2025-12-11T22:10:11 1765491011

Hah! I have exactly the opposite problem, I hit the space bar, instead of N, and the iPhone doesn't understand this a possible typo, so all the suggestions and auto-corrects are wrong.

rezonant · 2025-12-11T21:12:31 1765487551

Interesting. Just tried this out on Pixel's gboard and it does seem to correct this sort of issue

thinkling · 2025-12-08T18:30:59 1765218659

Claude Code usage probably isn't counted as "chatbot" use. Also, I think you're overestimating how many people program vs. how many people are using AI chatbots as the new websearch. Orders of magnitude more of the latter.

HarHarVeryFunny · 2025-12-08T18:41:46 1765219306

Sure - US has 1M developers vs 300M pop. At least the Claude Code developers are paying for it though, vs only 5% of users paying for ChatGPT.

thinkling · 2025-12-03T23:50:25 1764805825

The one current paying user of the app I've seen in this discussion called it "Wanderlog". FYI on the stickiness of the current name.

richiebful1 · 2025-12-04T00:05:41 1764806741

wanderlog is a separate web service

https://wanderlog.com/