What is it specifically you are looking to achieve?
There should not be problem with the upload but I am not sure I fully understand what you mean by “retrieve” in this context. Can you elaborate on the purpose of your GPT and the role of the images?
Also, if you could share instructions that you use and example prompts, that would be helpful.
Hm. I was able to upload pictures into the knowledge base and prompt the custom GPT to return a description of the images in the role of the GPT user (note: this only worked when code interpreter was enabled).
I am building a custom GPT to act as our ‘Home Knowledge Base and Advisor’ - essentially I have provided it with various documents where I kept record of maintenance or new installations that was done in our house; as well as landscaping improvements. We have a large garden and so I also provided it with names and descriptions of various plants/trees. I also want to upload pictures of the various home appliances, plants, trees, etc and anything house related that is interesting.
Amongst other things I want the GPT to be able to retrieve and show me images of whatever I uploaded e.g. I will ask it:
what model fridge do we have in the back kitchen?
this fridge is not dispensing ice, search for a user manual and help me troubleshoot it
now show me a picture of the fridge from the uploaded images
It responds to question 1 and 2 very well - but cannot retrieve a simple image from its uploaded docs
I uploaded images as part of a Word doc and labelled it within the word doc. However the GPT struggles to correctly identify an image and then just retrieves all images
Sorry I’m new to this so not sure if I understand you. I have a ChatGPT Plus account (paid) and am using it to build my own custom GPTs where I can provide custom instructions, load docs, etc. It works very well, except I haven’t figured out how to make it handle multiple uploaded images well.
I believe they’re talking about the API… see (quickstart…) Basically, you build your own “ChatGPT”… It’s a lot more work, but it gives you more control too.
Have you turned code interpreter on in your Custom GPT when you set it up?