More

eterm · 2026-01-20T23:02:33 1768950153

This makes me think LLMs would be interesting to set up in a game of Diplomacy, which is an entirely text-based game which soft rather than hard requires a degree of backstabbing to win.

The findings in this game that the "thinking" model never did thinking seems odd, does the model not always show it's thinking steps? It seems bizarre that it wouldn't once reach for that tool when it must be being bombarded with seemingly contradictory information from other players.

qbit42 · 2026-01-20T23:14:34 1768950874

https://noambrown.github.io/papers/22-Science-Diplomacy-TR.p...

eterm · 2026-01-20T23:30:10 1768951810

Thanks, it would be fascinating to repeat that today, a lot has changed since 2022 especially with respect to consistency of longer term outcomes.

open-paren · 2026-01-21T01:15:17 1768958117

It’s been done before

https://every.to/diplomacy (June 2025)

eterm · 2026-01-20T23:05:40 1768950340

Reading more I'm a little disappointed that the write-up has seemingly leant so heavily on LLMs too, because it detracts credibility from the study itself.

lout332 · 2026-01-20T23:51:17 1768953077

Fair point. The core simulation and data collection was done programmatically - 162 games, raw logs, win rates. The analysis of gaslighting phrases and patterns was human-reviewed. I used LLMs to help with the landing page copy, which I should probably disclose more clearly. The underlying data and methodology is solid, you can check it here: https://github.com/lout33/so-long-sucker

eterm · 2026-01-17T16:50:03 1768668603

So, this link is actually 5 days old, if you hover the "2 hours ago" you'll see the date 5 days ago.

HN second-chance pool shenanigans.

alt227 · 2026-01-17T18:34:19 1768674859

Can you point to any documentation which explains how this works?

Genuinely interested.

azhenley · 2026-01-17T18:51:19 1768675879

Dang gave some explanation here: https://news.ycombinator.com/item?id=26998308

eterm · 2026-01-15T16:38:50 1768495130

There was one much more successful EV, although it too was niche: The UK had "perhaps 40,000 milk floats" in the 1970s and 1980s before supermarkets took over as primary milk distributors. ( https://zavanak.com/transport-topics/british-electric-cv-his... )

dboreham · 2026-01-15T17:57:58 1768499878

When I was a kid in Edinburgh no milk was delivered by ICE vehicle. It was either electric or horse. Also Sean Connery's first job..

eterm · 2026-01-15T13:20:34 1768483234

If only there were some kind of international system of standard units.

blitzar · 2026-01-15T13:25:25 1768483525

Olympic swimming pools for liquids, times around the the earth for length and number of double decker busses for height.

literalAardvark · 2026-01-15T13:48:32 1768484912

You jest, but times around the Earth is the actual origin of the Meter. Kinda.

The history is quite interesting and well worth checking out.

I can't recommend a book on the subject, but I do heartily recommend "Longitude", which is about the challenges of inventing the first maritime chronometers for the purpose of accurately measuring longitude.

EA · 2026-01-15T14:18:46 1768486726

The original meter (1790s France) was defined as 1/10,000,000 of the distance from the equator to the North Pole along a meridian.

literalAardvark · 2026-01-15T14:32:55 1768487575

Not sure if you're correcting me, but yes, that is "a" path around the Earth.

It's not the most aesthetic one, but it was at the time the most able to be measured.

RobotToaster · 2026-01-15T13:36:10 1768484170

For smaller lengths and radiation bananas are also acceptable.

lostlogin · 2026-01-15T14:09:47 1768486187

A good physicist can calculate banana equivalent dose other head. Always ask for it when dealing with radiation.

jasomill · 2026-01-15T18:28:48 1768501728

Don't forget packs of cigarettes as a more convenient unit for measuring volumes significantly smaller than Olympic-sized pools.

There is, of course, no more need to standardize on a specific brand or style of cigarette than on a specific depth of Olympic-sized pool.

brookst · 2026-01-15T13:43:47 1768484627

Don’t forget cheetahs for velocity and elephants for weight.

blitzar · 2026-01-15T18:33:32 1768502012

A horse as a measure of power and a crocodile bite as a measure of compression strength

dredmorbius · 2026-01-15T20:10:24 1768507824

Or vice versa.

coldcode · 2026-01-15T14:18:35 1768486715

I thought all measurements in data centers were in US football fields.

Nifty3929 · 2026-01-15T16:46:43 1768495603

For the floor area or length/width it is, but if you want the height then that's in Empire State Buildings.

paulddraper · 2026-01-15T16:27:39 1768494459

There are standard units, yes.

eterm · 2026-01-14T19:02:52 1768417372

Business rates are a devolved matter, Scotland set their own rates.

eterm · 2026-01-14T18:19:40 1768414780

I always think by law any ISP that advertises speed and a has a cap must express the cap in terms of the advertised speed.

So telcos can advertise "Up to 200Mbps" for their package.

But then if they have a 2GB cap, they also need to say, "Caps at 80 seconds of usage".

Because that's what you're paying for at that speed, 80 seconds of usage per month.

Sure, you're not always (or indeed never) doing 200Mbps, but then you're not getting the speed you paid for.

throawayonthe · 2026-01-14T19:02:14 1768417334

i don't think that makes sense, most connections you make never reach 200Mbps because they don't need to

eterm · 2026-01-14T19:12:07 1768417927

That's kind of my point, ISPs use that max speed in their advertising when it isn't really relevant, especially if it hits your cap in a minute or two.

bscphil · 2026-01-14T21:14:36 1768425276

It is relevant, though. I have 1.2 Gbps down with a 2 TB monthly cap. I've never hit the monthly cap even once, but by your standard I have "1.2 Gbps down for 3 hours, 42 minutes".

But that doesn't change the reality that it matters to me that a 20 GB video that a friend took at my wedding downloads in just 2 minutes rather than the ~30 minutes it would take if I had a 100 Mbps connection.

eterm · 2026-01-14T21:35:31 1768426531

Right, but 3+ hours of top speed per month is a lot, 80 seconds isn't.

Your cap is over 150 times that equivalent. If you had an 80 second hard cap, you couldn't even download that 20GB video.

digiown · 2026-01-15T02:11:02 1768443062

1.2Gbps down but only 2TB cap? I hope that's really cheap since if I pay for that I'd expect to do stuff like downloading LLMs, etc, all the time.

eterm · 2026-01-13T21:50:16 1768341016

The lichess one might be in "multi-line" mode

eterm · 2026-01-13T20:21:18 1768335678

I can't remember the artist but there's a fun song about how they used to pick up second hand LPs really cheap and then they got popular and too expensive, then discovered second hand CDs are really cheap now.

Frank turner-ish vibes but I don't think it was actually him.

It's completely un-googlable though, and even the LLMs aren't much help on this one.

nluken · 2026-01-13T22:01:41 1768341701

Oh! I know this one! You're thinking of Jeffrey Lewis & The Voltage's LPs from 2019: https://www.youtube.com/watch?v=3urXygZXb74'

te_chris · 2026-01-14T10:11:49 1768385509

Love it. Not often you get music threads like this on HN!

eterm · 2026-01-13T22:49:07 1768344547

Nice one, thanks!

richrichardsson · 2026-01-14T10:07:55 1768385275

Humans prove to have some value in the LLM age after all! /s

eterm · 2026-01-10T17:33:15 1768066395

It's a shame, because there may well be a kernel of truth to some of it, but it's dipped so deep in LLMage that it taints the rest.

functional_dev · 2026-01-10T18:07:15 1768068435

English is not my first language, and you nailed it. I used LLM to "polish" it. Probably too much. But I am open for questions if you like :)

eterm · 2026-01-10T17:13:25 1768065205

A few days ago I got the nature scapes but with a, "This would make an awesome prompt huh?" as the tagline and a link to more AI shoe-horned in.