How do deal with the file API and rapidly changing data

burton · June 6, 2021, 9:43pm

We have a large, mostly static dataset that we want to use with the search and also the answer API.

The problem is that we can upload a large dataset and create a file but we can’t update it.

Ideally it would be a corpus indexed by ID which would mean we can delete, insert or update an individual item within that file.

The problem is right now it’s static and so would quickly become stale for us.

Are there any workarounds here?

hallacy · June 6, 2021, 9:46pm

Shy of continuously uploading new data and deleting old ones, not at the moment. We’ve had this feature in the backlog for awhile but haven’t pushed on it.

Can you give me some details on your dataset? How large is it / how often does it change, that sort of thing?

burton · June 8, 2021, 12:30am

Well can’t in public without giving out too many details of our app.

But there are tons of use cases so say an Twitter indexer or gmail or something where you have the underlying content change fairly often.

burton · June 8, 2021, 12:30am

That would technically be possible but super super super expensive.

Say it was like 50MB , to update 100 bytes I’d have to re-index all 50MB paying for the tokens each time.

Topic		Replies	Views
Update method for File API? API	6	695	March 4, 2024
Uploading small JSON-ified SQL database, Does referencing pre-uploaded files cost tokens? API plugin-development , api , token , gpt-4-turbo , knowledge-files	1	406	January 15, 2024
Q&A pricing for 1 million Token data file Prompting	5	1934	July 23, 2021
Uploading large files API	4	2111	December 19, 2023
Subgroups of organization uploaded files API	1	224	June 25, 2021

How do deal with the file API and rapidly changing data

Related Topics