Any guides or best practices on how to build security layers on top of LLM agent?

Any comments on

  • Looking for keywords to prevent some cases?
  • Finding the balance of Gen AI vs program code based logic
  • Handling invisible Unicode characters?

Filter the output.

You can also take a look at the safety best practices from the docs:

It provides a good starting point.

1 Like