Question about Files, Assistants & pre-trained data

itsvnk · May 25, 2024, 6:50pm

Hello All

I have waded through the documentations and perhaps I missed it and would like to know before i go and do some tests. So here goes…

On a particular topic, let us say that OpenAi already has data in terms of statistics, etc. Let us say that, on the same topic, I upload some of my own data that may have some kind of an overlap with the existing data and will have some new data as well

Now, when I use this file with the assistants API, will OpenAI use its own data where my file missed it?
And, how will it handle overlaps? Which data takes precedence?

Thanks a ton, in advance

vb · May 25, 2024, 7:10pm

In my experience when working with libraries that had breaking changes in newer versions the model would almost always refer to the training data unless specifically instructed to treat provided input as correct.

This can be done via few-shot prompting or specifically mentioning the relevant documents that should supersede the training data.

In the end you will have to make you own tests but in general the model will use what it has learned first unless instructed differently.

Topic		Replies	Views
New Assistant feature and Fine-tuning API	4	3761	February 5, 2024
How does OpenAI assistant handle the data given to in File Search? API assistants	2	3453	May 3, 2024
Data privacy on file uploads in relation to the new assistants (retrieval etc) API api , privacy , assistants , assistants-api	1	2405	November 8, 2023
Does the assistant ONLY use the knowledge files to answer the requests? API assistants , assistants-api	3	1567	November 18, 2023
How to use multiple documents in Assistants API to get best result? API api , assistants-api	0	642	January 8, 2024

Question about Files, Assistants & pre-trained data

Related topics