Yes. I strongly feel that reinforcement learning should be applied to punish the...

earthboundkid on Sept 25, 2023 | parent | context | favorite | on: Knuth's 20 Questions for ChatGPT

Yes. I strongly feel that reinforcement learning should be applied to punish the LLMs for speculating about their past behavior. They should respond along the lines of “I’m sorry, I don’t know why I said 3 + 5 is 9, but I will try to answer again.”