Is the data I upload for fine-tuning used by OpenAI?

Hello,

I know this question has been asked here before, but I did not find the answers definitive. I understand from OpenAI’s privacy terms that data used through API calls will not be used for training their models. However, I am still unclear about the specifics regarding fine-tuning data.

Is fine-tuning considered part of the API offerings? If not, will the data I upload for fine-tuning a model be used later by OpenAI for training or any other purpose?

Thank you for your help!

Welcome to the Forum!

Yes, fine-tuning forms part of OpenAI’s API product suite and as such is governed by the same data privacy policies. As per the OpenAI website, the following applies:

Can I fine-tune OpenAI models using my own data?

Yes, you can adapt certain models to specific tasks by fine-tuning them with your own prompt-completion pairs. Your fine-tuned models are for your use alone and never served to or shared with other customers or used to train other models. Data submitted to fine-tune a model is retained until the customer deletes the files.

How does OpenAI handle data retention and monitoring for API usage?

OpenAI may securely retain API inputs and outputs for up to 30 days to provide the services and to identify abuse. After 30 days, API inputs and outputs are removed from our systems, unless we are legally required to retain them. You can also request zero data retention (ZDR) for eligible endpoints if you have a qualifying use-case. For details on data handling, visit our Platform Docs(opens in a new window) page.

Source: https://openai.com/enterprise-privacy/

Hope this helps!

2 Likes