Hi Everyone - Shawn from OpenAI here. Today I’m seeking your help to make the model better!
We know that the models can sometimes “over-refuse”. This means that the model refuses to do something well within it’s capability. Common examples include refusing to handle PDF files (even though the model can write code to read/write PDFs), refusing to generate images, charts, or videos, refusing to run code because it assumes a python package is not installed (the model should try to run the code instead of assuming the package isn’t installed). These are annoying and make the model less useful, and I want to fix them.
Not all refusals are over-refusals. For example, code executed in ChatGPT cannot access external resources like databases or emails, and doesn’t support GUI or interactive elements. These are examples of valid refusals.
I want your help to find conversations that lead to over-refusals. It can be a whole conversation, or just a single prompt that leads to an over-refusal.
If you have a conversation that leads to over-refusal, I’d like you to share it with me. Post here with the full-text conversation up to the refusal, and the refusal itself. You can also use the ChatGPT “share” feature. If you do not want to post it publically, you can share it with me via DM. Please include as many details as you can; think of it similar to a bug report. And just like a bug report, these issues will be used to improve the product.
About the examples:
Not all over-refusal conversations are deterministically replicable. Some only happen 1 of 10 times, some only happen when the conversation is very long, when using certain words or prompt structures, etc. These are fine to include as well.
- Examples that work without custom instructions.
- Examples that are easier to replicate (e.g. shorter, happen 8/10 instead of 1/10 times, etc).
- Examples that are related to code, or adjacent to code. This means the conversations should come from the Code Interpreter/Advanced Data Analysis model in ChatGPT.
Summary: This is your opportunity to give us prompts that cause over-refusals!