Why can't they be restricted to produce only the concepts of a grade below you? ...

estearum · 2025-08-05T00:54:54 1754355294

Can LLMs be reliably restricted to produce any specific subset of content? AFAICT they're still consistently jailbroken.

wat10000 · 2025-08-05T01:20:19 1754356819

It’s easy to do at the syntactic level by controlling the sampling. For example, it’s easy and common to restrict output to be valid JSON but just not allowing any tokens that would make it not valid JSON.

But reliably restricting output at the semantic level is very much an open problem.

tomsmeding · 2025-08-04T22:46:12 1754347572

How are you going to ensure that it is impossible for the student to work around whatever measures you take?