More

latentsea · 2025-04-26T01:21:42 1745630502

This quote to me feels more along the "no computer will ever need more than 640kb of RAM" lines in terms of historical accuracy. Like whoops, nope.

latentsea · 2025-04-25T16:43:22 1745599402

Well... we have needed to put a tonne of work into engineering safer outcomes for behavior generated by natural general intelligence, so...

latentsea · 2025-04-25T04:44:49 1745556289

> what would you answer if I asked you this question?

I don't know.

latentsea · 2025-04-24T07:55:54 1745481354

To be fair, it does sound like a pretty tough place to stomach.

latentsea · 2025-04-24T07:52:05 1745481125

> They shared the fondness for torture of accused criminals with the ancient Romans.

Well, with their "hostage justice" using torture tactics to extract false confessions relied on in court and their actual prisons where you have to work 8 hours a day and aren't allowed to say a single word while you do, I guess modern day Japan still has a thing for torturing accused criminals.

latentsea · 2025-04-19T22:41:49 1745102509

> We might be incentivizing answers that sound right with reinforcement learning as opposed to answers that are actually right.

We do this with other humans, so I don't know that we know how to avoid doing the same with machines.

latentsea · 2025-04-19T22:39:40 1745102380

Only problem is, in the real world, always being truthful isn't the thing that will maximize your reward function.

latentsea · 2025-04-16T14:41:47 1744814507

> AI can't have accountability

Sure it can. You just have to bake into the reward function "if you do the wrong thing, people will stop using you, therefore you need to avoid the wrong thing".

Then you wind up at self-preservation and all the wholly shady shit that comes along with it.

I think the AI accountability problem is the crux of the "last-mile" problem in AI, and I don't think you can necessarily solve it without solving it in a way that produces results you don't want.

isaacremuant · 2025-04-17T08:14:41 1744877681

I don't want to get into a semantics argument but that's not accountability. That's just one more behavior prompt/indication but you can't fire the LLM. It might still do the wrong thing.

latentsea · 2025-04-11T02:09:12 1744337352

Yup. And this is why I think the "last mile" problem in AI is basically unsolvable.

latentsea · 2025-04-11T01:58:09 1744336689

Enshittification