OpenAI will not use data submitted by customers via our API to train or improve our models, unless you explicitly decide to share your data with us for this purpose. You can opt-in to share data.
Any data sent through the API will be retained for abuse and misuse monitoring purposes for a maximum of 30 days, after which it will be deleted (unless otherwise required by law).
The OpenAI API processes user prompts and completions, as well as training data submitted to fine-tune models via the Files endpoint. We refer to this data as API data.
Here is my question:
Does the API-data-usage-policy only refer to the Files endpoint for fine-tuning?
Specifically, does it apply to https://platform.openai.com/ services such as Playground.
i.e. is it correct to say that no inputs, outputs, or user interaction from users of Playground will be used to improve the models or services?
Fun experiment:
Feed my question into Playground and it will tell you not to worry but if you click the thumbs up it toasts you for helping it improve the model. Feed it into ChatGPT and it says that Playground is NOT covered by the API data usage policy and interactions with it can be used to improve the model.
The playground is essentially a wrapper for the API, so any usage on the playground counts as API usage.
I understand that chatGPTâs answers about the privacy policy may have caused confusion, ChatGPTâs knowledge cutoff is September 2021, and back then, OpenAI was able to use your data. However, OpenAI has since changed their policy and will now only use data from users who specifically opt-in.
Itâs more accurate to say, âThe playground calls an api endpointâ. Almost all non-static websites are split into a web-service front-end which calls an API backend. So playground (web-service) can call lots of different backends (gpt-4, gpt-3.5-turbo). APIs are usually accessed by curl or sdk (python, go) code, or by a webservice frontend. OpenAI clearly says that ChatGPT and Dalle (webservices) which call backend APIs are NOT covered by the API Data Usage policy. They say the âNo data supplied to APIs are used in trainingâ loud, but say the âWeb services which call APIs have different termsâ very quietly. Mostly there is a lot of confusion about the API / Webserver split, and they donât anywhere list which web services follow which Privacy policy (having only named Dalle and ChatGPT as âexamplesâ that donât follow API Data Policy). I would therefore assume, if itâs got a user interface (GUI) then itâs probably following Privacy terms similar to ChatGPT, but only a clearly written Privacy policy can determine this conclusively, not the opinions of the user base.
(c) Use of Content to Improve Services. We do not use Content that you provide to or receive from our API (âAPI Contentâ) to develop or improve our Services. We may use Content from Services other than our API (âNon-API Contentâ) to help develop and improve our Services. You can read more here about how Non-API Content may be used to improve model performance. If you do not want your Non-API Content used to improve Services, you can opt out by filling out this form. Please note that in some cases this may limit the ability of our Services to better address your specific use case."
We tried filling out the opt-out form but never got a response to confirm. Itâs a âweâll get back to youâ not a âyou have opted outâ flow. And these terms change every two months or so.
âIf you do not want your Non-API Content used to improve Services, you can opt outâŚâ
For API calls, openAI wouldnt actually recieve the human feedback of what was done with the completion/response. Although i believe there is an opt in to let openAI train directly on your inputs. I should have been more clear.
Yeah i see what you mean, the wording here is vague and confusing, theyâve updated the help page a week ago to clarify what they mean
When you use our non-API consumer services ChatGPT or DALL-E, we may use the data you provide us to improve our models. You can request to opt-out of having your data used to improve our non-API services by filling out this form with your organization ID and email address associated with the owner of the account.
In the intervening year, a message has been added to the lower right of the playgroundâs parameter interface bar, noting new policy, last updated July 21, 2023:
API and Playground requests will not be used to train our models. Learn more
Playground inputs go to the moderation endpoint unless you opt out â and still go to a content flagging system.