I wonder if there is a way to get ChatGPT to act in the way you're hinting at, though ("You've asked me to do X, but really what you want is Y"). This would be potentially risky, but high-value.
[0]: https://nitter.net/ESYudkowsky/status/1718654143110512741
I wonder if there is a way to get ChatGPT to act in the way you're hinting at, though ("You've asked me to do X, but really what you want is Y"). This would be potentially risky, but high-value.
[0]: https://nitter.net/ESYudkowsky/status/1718654143110512741