Hacker Newsnew | past | comments | ask | show | jobs | submit | more eterm's commentslogin

This makes me think LLMs would be interesting to set up in a game of Diplomacy, which is an entirely text-based game which soft rather than hard requires a degree of backstabbing to win.

The findings in this game that the "thinking" model never did thinking seems odd, does the model not always show it's thinking steps? It seems bizarre that it wouldn't once reach for that tool when it must be being bombarded with seemingly contradictory information from other players.



Thanks, it would be fascinating to repeat that today, a lot has changed since 2022 especially with respect to consistency of longer term outcomes.


It’s been done before

https://every.to/diplomacy (June 2025)


Reading more I'm a little disappointed that the write-up has seemingly leant so heavily on LLMs too, because it detracts credibility from the study itself.


Fair point. The core simulation and data collection was done programmatically - 162 games, raw logs, win rates. The analysis of gaslighting phrases and patterns was human-reviewed. I used LLMs to help with the landing page copy, which I should probably disclose more clearly. The underlying data and methodology is solid, you can check it here: https://github.com/lout33/so-long-sucker


So, this link is actually 5 days old, if you hover the "2 hours ago" you'll see the date 5 days ago.

HN second-chance pool shenanigans.


Can you point to any documentation which explains how this works?

Genuinely interested.


Dang gave some explanation here: https://news.ycombinator.com/item?id=26998308


There was one much more successful EV, although it too was niche: The UK had "perhaps 40,000 milk floats" in the 1970s and 1980s before supermarkets took over as primary milk distributors. ( https://zavanak.com/transport-topics/british-electric-cv-his... )


When I was a kid in Edinburgh no milk was delivered by ICE vehicle. It was either electric or horse. Also Sean Connery's first job..


If only there were some kind of international system of standard units.


Olympic swimming pools for liquids, times around the the earth for length and number of double decker busses for height.


You jest, but times around the Earth is the actual origin of the Meter. Kinda.

The history is quite interesting and well worth checking out.

I can't recommend a book on the subject, but I do heartily recommend "Longitude", which is about the challenges of inventing the first maritime chronometers for the purpose of accurately measuring longitude.


The original meter (1790s France) was defined as 1/10,000,000 of the distance from the equator to the North Pole along a meridian.


Not sure if you're correcting me, but yes, that is "a" path around the Earth.

It's not the most aesthetic one, but it was at the time the most able to be measured.


For smaller lengths and radiation bananas are also acceptable.


A good physicist can calculate banana equivalent dose other head. Always ask for it when dealing with radiation.


Don't forget packs of cigarettes as a more convenient unit for measuring volumes significantly smaller than Olympic-sized pools.

There is, of course, no more need to standardize on a specific brand or style of cigarette than on a specific depth of Olympic-sized pool.


Don’t forget cheetahs for velocity and elephants for weight.


A horse as a measure of power and a crocodile bite as a measure of compression strength


Or vice versa.


I thought all measurements in data centers were in US football fields.


For the floor area or length/width it is, but if you want the height then that's in Empire State Buildings.


There are standard units, yes.


Business rates are a devolved matter, Scotland set their own rates.


I always think by law any ISP that advertises speed and a has a cap must express the cap in terms of the advertised speed.

So telcos can advertise "Up to 200Mbps" for their package.

But then if they have a 2GB cap, they also need to say, "Caps at 80 seconds of usage".

Because that's what you're paying for at that speed, 80 seconds of usage per month.

Sure, you're not always (or indeed never) doing 200Mbps, but then you're not getting the speed you paid for.


i don't think that makes sense, most connections you make never reach 200Mbps because they don't need to


That's kind of my point, ISPs use that max speed in their advertising when it isn't really relevant, especially if it hits your cap in a minute or two.


It is relevant, though. I have 1.2 Gbps down with a 2 TB monthly cap. I've never hit the monthly cap even once, but by your standard I have "1.2 Gbps down for 3 hours, 42 minutes".

But that doesn't change the reality that it matters to me that a 20 GB video that a friend took at my wedding downloads in just 2 minutes rather than the ~30 minutes it would take if I had a 100 Mbps connection.


Right, but 3+ hours of top speed per month is a lot, 80 seconds isn't.

Your cap is over 150 times that equivalent. If you had an 80 second hard cap, you couldn't even download that 20GB video.


1.2Gbps down but only 2TB cap? I hope that's really cheap since if I pay for that I'd expect to do stuff like downloading LLMs, etc, all the time.


The lichess one might be in "multi-line" mode


I can't remember the artist but there's a fun song about how they used to pick up second hand LPs really cheap and then they got popular and too expensive, then discovered second hand CDs are really cheap now.

Frank turner-ish vibes but I don't think it was actually him.

It's completely un-googlable though, and even the LLMs aren't much help on this one.


Oh! I know this one! You're thinking of Jeffrey Lewis & The Voltage's LPs from 2019: https://www.youtube.com/watch?v=3urXygZXb74'


Love it. Not often you get music threads like this on HN!


Nice one, thanks!


Humans prove to have some value in the LLM age after all! /s


It's a shame, because there may well be a kernel of truth to some of it, but it's dipped so deep in LLMage that it taints the rest.


English is not my first language, and you nailed it. I used LLM to "polish" it. Probably too much. But I am open for questions if you like :)


A few days ago I got the nature scapes but with a, "This would make an awesome prompt huh?" as the tagline and a link to more AI shoe-horned in.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: