Hi there,
I created a script that:
- Uses the files endpoint to upload some files
- Creates a list of file IDs from the responses
- Creates an assistant
- Creates a thread
- Creates a message passing in the prompt and file_ids
- Polls until the job is done
- Prints the result
The prompt is “analyse these files and return important data in CSV format”.
I am able to analyse the same files with the same prompt in the ChatGPT interface (they are PDF and image files).
The API assistant tells me:
“I’m sorry for any confusion, but I must clarify that, as of my last update, I can’t directly access or analyze files or images uploaded by users. My responses are generated based on the text information you provide. I can guide you on how to extract data from PDFs and images manually or how to format it into a CSV format if you describe the content or share the data in a text format. Let me know how you’d like to proceed!”
I tried with gpt-4 and gpt-4-turbo-preview and got similar responses.
I tried with gpt-4-vision-preview and it tells me that this model is not available in the assistants API.
However, one can only use the file_ids parameter with the assistants API, not with the chat/completions API.
So my first question is:
Since the assistants API is the only place you can use file_ids and;
Since the assistants API cannot access files then;
What is the point of the file_ids parameter?
My next question is:
Since there is a model called gpt-4-vision-preview which is intended to analyse images and;
Since that model is not available with the assistants API and;
Since the only way I can see to post files is via the files endpoint in conjunction with the assistants API then;
How might one use the gpt-4-vision-preview model to analyse images via the API?
Thanks,
Iain