I’m making a question and answer gpt, we’re i need my bot to analyse an image and make a question about that image, I’ve got the image analysis part down but I need the user to be able to see the image to better understand the question. I’ve tried giving it the photo file or the url link but it keeps telling me it can’t do that, but I know it can because on ChatGPT (OpenAI) it tells me to give the image url then it just displays the image. How can I get my gpt/assistant to do that?
This is something you must do completely in your user interface with your code.
The assistants endpoint can’t use gpt-4-vision, so I assume you are programming on chat completions.
You would create a file store that accompanies the message, and then just use user interface methods to show the uploaded image, whether it be a web page or client application or native code.
Thank you for your reply, though do you know any pdf or video that can show me how to do this, as I do not yet know how to use the user interface method quite well.
What you ask is more “how do I program computers to do what I want”.
You can start with the very clever AI you have at hand. Give it a system message “You are an expert programmer and tutor in xxxxx”, and ask away for the particular type of coding you are doing…