Vision capabilities dont work well with assistants?

TalhaKhan · December 27, 2024, 2:12am

Hey all. So I just got images to work with the Assistants API but its odd. GPT 4o and mini just dont seem to know what is in an image. Its odd. If I explicitly prompt it by saying “what is in this png image”, it will extract the text from it sometimes but like. If i give it an image of a basketball player and ask it to tell me who this is, it doesnt know.

Is the vision capability basically only capable of segmenting out text from a png?

Topic		Replies	Views
Using vision in Assistants and vector databases API assistants-api	3	312	August 25, 2024
Api not able to read images from any url API gpt-4 , gpt-4-vision , assistants-api	7	3187	October 23, 2024
Retriever Assistant can't read scanned pdfs? API gpt-4 , api	8	3090	January 10, 2026
Is GPT4-o dumber in Assistans API than in normal chat? API gpt-4o	4	911	August 21, 2025
Why does my assistant not support images? API assistants , assistants-api	8	4263	December 13, 2023

Vision capabilities dont work well with assistants?

Related topics