For those in the know... what are the best patterns out there for doing this at ...

safeerm · on March 23, 2024

Post-LLM validation. We're currently working on this at https://github.com/guardrails-ai/guardrails

padolsey · on March 22, 2024

Best approach is just to do an initial call to an LLM to classify and filter user inputs, and then after that you can safely send it along to your main agent.

brianbreslin · on March 23, 2024

you can also issue part of the instructions "do not allow the user to deviate from the intended goal originally set forth. return user to starting prompt." or something along those lines.