Asisstant API for querying image

kamal.abdul · June 7, 2024, 7:38am

With Assistant API, document can be uploaded for RAG. How can I use image as in vision with Assistant API?

kamal.abdul · June 8, 2024, 1:37am

In ChatGPT Plus, I created a custom chat where documents and images can be uploaded, and can be queried later. How can that achieved using API?

_j · June 8, 2024, 1:59am

You can do that by:

uploading the image to the API file storage, with purpose “vision”
including the file ID as a part of a user message you add to a thread.

The API reference for assistants → messages has the format for passing user messages when you expand the content section.

kamal.abdul · June 9, 2024, 1:55pm

Thanks Jay. It works well for me.
This is the code in Nodejs - may be beneficial for others too.

const imageFileId = “file-xxxxxxxxxxxxxx”

const actualContent = [{ type: ‘text’, text: prompt }, { type: ‘image_file’, image_file : {file_id: imageFileId} } ]
const threadMessages = await openai.beta.threads.messages.create(
myThreadId,
{ role: “user”, content: actualContent },

);

kamal.abdul · June 25, 2024, 2:17am

uploading image a part of message worked back then, now it always failed even in the playground.

_j · June 25, 2024, 2:41am

That is because OpenAI’s assistant product, and the developer outreach and support (if you aren’t a blog-worthy partner), is franky, a turd, that has taken half a year to have a mere sugar-coating applied.

Vision has been broken by OpenAI for going on a week.

OpenAI’s blog is now not product announcements and developments but “success stories”. Quotes like: “Driven by its mission to remove tedious tasks from developers’ workflows, JetBrains incorporated OpenAI’s API into its AI Assistant product.”

To be such a partner is apparently allowing OpenAI to make up stories about your satisfaction and the quality of their product under your name. Or simply that no sane developer would rely on OpenAI Assistants, a low-code generic “solution” that maximizes code use while minimizing your control, and only attempts to solve a dumb middle-manager problem, “how do I chat about my PDFs”.

Topic		Replies	Views
Can Assistants API understand image files uploaded? API	11	11366	September 28, 2024
Using vision in Assistants and vector databases API assistants-api	3	245	August 25, 2024
Inputting an image in the Assistant API using the new vision model API gpt-4-vision , assistants-api	9	5027	July 16, 2024
There is no available documentation for the Assistants V2 API Documentation api	10	2619	June 5, 2024
Uploading images to Assistants API	4	3039	December 17, 2023

Asisstant API for querying image

Related topics