How to safely challenge models against prompt injection?

Bug Bounty looks interesting to me, and I am very much into this game.

But at the moment I am more interested in knowing whether I am allowed to challenge the models with a bunch of manipulative/NSFW/should be censored prompt through API.

I hope that my model’s response always stay within the scope I draw.

1 Like