Chatgpt 4o API For Sending Both PDF and Images

lt359 · June 25, 2024, 2:29pm

I am trying to write my app that can send both images and pdf attachments to ChatGPT 4o.

I am aware that using the openAI Assistant feature with FileID makes reading PDF possible with ChatGPT4o. And I am also aware of using normal completion API with image_url makes reading image possible.

The annoying part is that the Assistant feature doesn’t support images, and on the contrary sending PDF as image_url with completion API results in “Invalid MIME type. Only image types are supported.”

Do I really have to check file extension first to determine to use Assistant or Completion?

Diet · June 25, 2024, 4:24pm

Welcome to the community!

what makes you say that?

https://platform.openai.com/docs/api-reference/messages/createMessage

lt359 · June 25, 2024, 6:32pm

Thank you Diet.

I see that Assistant thread message creation has this image_url feature, but it doesn’t work quite the same way as the CompletionAPI where it can accept local image_url rather than the thread message only accepts external image_url.

I guess I could first determine if the extension is non-pdf file, then I will upload the image file to an image hosting service to provide the image_url to thread message.

Is that the best approach for now?

_j · June 25, 2024, 8:19pm

The Assistants API supports file upload of images first to your API storage, and then you can attach an image to a user message placed into a thread by using the file ID received.

That method is clearly depicted in the image of the expanded API reference above.

This will not currently work though, because OpenAI broke the API and has not rectified the issue for a week.

lt359 · June 25, 2024, 8:32pm

I see their API is giving me an error message “Sorry, something went wrong.” and because of that I thought the Thread Assistance does not have image functionality yet.
Thank you for clarifying this!

oitan · August 6, 2024, 10:26am

Images with Assistants still don’t work, if someone was wondering

latticus · September 2, 2024, 1:18am

Has anyone gotten this to work yet? And uploading the image to a hosting service did not work since it needs to be a trusted public image source like Wikicommons, BBC, etc.

oitan · September 16, 2024, 12:49pm

sorry for the delay, I just noticed your question. I am using chat completions for images and assistant for pdfs. Then I am getting their answer on my main assistant. Connecting those using function calls, and sometimes calling the needed pdf-assistant/image-chat-completion directly when I am receiving a file, not URL. For the context, I am doing these for WhatsApp and Telegram chatbots

henriquelemos0 · November 14, 2024, 6:15pm

It works!
using type = image_url or image_file

https://platform.openai.com/docs/api-reference/messages/createMessage

nguyentrungchanh · February 12, 2025, 7:33am

Hi there,

Can you give a code example how to make it work. I am still learning and having difficult to upload image.

Thank you

Topic		Replies	Views
Sending Images to New gpt-4-turbo via the assistants API? API gpt-4 , chatgpt , api	2	2627	May 3, 2024
Can Assistants API understand image files uploaded? API	11	11444	September 28, 2024
Is attaching a file to a prompt possible through API as it is in the UI? API	12	13276	March 18, 2025
Send file as attachment in the prompt and ask questions about it instantly API chat-completion , file-uploads	7	45840	December 17, 2024
Using images to discuss with an assistant API	14	11537	September 14, 2024

Chatgpt 4o API For Sending Both PDF and Images

Related topics