Hey there Newb GPT api user, I have been tasked with quite the topic here at work.
Here is what I want to do (exact details changed due to NDA and all that):
I have a collection of PDF docs, lets say of video games. These video games are to be parsed to json using a specific schema I defined; this works rather well.
I also have a json file with all the video game categories, its an array of 100 categories with title and ID.
I want to use gpt to, after parsing the PDF to a json, add a category ID and title to the object, using the external json list of categories to find the one which is best.
At the moment I am not sure what the best way to do this is, as far as I can see there are a few ways to do it. Currently I upload the video game doc to my account using the js library, create an assistant with code interpreter tool for using OCR on the document, and then give the schema and exact instructions. Would it makes sense for me to upload the video game categories json and add an additional message to the thread explaining how he should use this file? Or should I define a function, and in this function explain that the assistant should map categories using the previously uploaded file?
On the topic of prompt engineering/assistants/threads, does it make sense to explain a lot of steps in one prompt/message, or should one stream the responses and reactively add appropriate messages?
Sorry for the newb take, any help greatly appreciated