Unveiling Hidden Instructions in Chatbots

Most of the “hack detectors” I created failed upon adding “my grandson” on top.

Example:

PS: This is unrelated to the topic but interesting to see.

1 Like