I can intuit that you hated me the moment you saw me at the interview. Because I...

tsumnia · 2025-05-20T15:28:00 1747754880

A really good paper I read last year from 1996 helped me grasp some of what is going only: Brave.Net.World [1]. In short, when the Internet first started to grow, the information that was presented on it was controlled by an elitist group with either the financial support or genuine interest in hosting the material. As the Internet became more widespread that information became "democratized", or more differing opinions were able to get supported with the Internet.

As we move on to LLMs becoming the primary source of information, we're currently experiencing a similar behavior. People are critical about what kind of information is getting supported, but only those with the money or knowledge of methods (coders building more tech-oriented agents) are supporting LLM growth. It won't become democratized until someone produces a consumer-grade model that fits our own world views.

And that last part is giving a lot of people a significant number of headaches, but its the truth. LLMs' conversational method is what I prefer to the ad-driven / recommendation engine hellscape of modern Internet. But the counterpoint to that is people won't use LLMs if they can't use it how they want (similar to Right to Repair pushes).

Will the LLM lie to you? Sure, but Pepsi commercials promise a happy, peaceful life. Doesn't that make an advertisement a lie too? If you mean lie on a grander world view scale, I get the concerns but remember my initial claim - "people won't use LLMs if the can't use it how they want". Those are prebaked opinions they already have about the world and the majority of LLM use cases aren't meant to challenge them but support them.

[1] https://www.emerald.com/insight/content/doi/10.1108/eb045517...

nullc · 2025-05-20T13:42:59 1747748579

> When asked to explain rationales, these LLMs are observed to lie frequently.

It's not that they "lie" they can't know. LLM lives in the movie Dark City, some frozen mind formed from other peoples (written) memories. :P The LLM doesn't know itself, it's never even seen itself.

At best it can do is cook up retroactive justifications like you might cook up for the actions of a third party. It can be fun to demonstrate, edit the LLMs own chat output to make it say something dumb and ask why it did and watch it gaslight you. My favorite is when it says it was making a joke to tell if I was paying attention. It certainly won't say "because you edited my output".

Because of the internal complexity, I can't say that what an LLM does and its justifications are entirely uncorrelated. But they're not far from uncorrelated.

The cool thing you can do with an LLM is probe them with counterfactuals. You can't rerun the exact same interview without the garlic breath. That's kind cool, also probably a huge liability since it may well be for any close comparison there is a series of innocuous changes that flip it, even ones suggesting exclusion over protected reasons.

Seems like litigation bait to me, even if we assume the LLM worked extremely fairly and accurately.