OpenAI image processing capabilities (GPT-4V) availability for programmatic API use

krisztiankadar · January 21, 2025, 10:05pm

Yes I tried but no luck did a detailed research and this is came across …
" OpenAI has deprecated direct image processing through its API, so if you’re using openai.ChatCompletion or other similar methods for image-related tasks, they will no longer work in openai>=1.0.0. However, there are still some ways to achieve object identification in images with OpenAI and other alternatives"
I did setup a google vision APi and tested it … works OK but by far is not what I need. The google vision when is text recognition response is based on what ever text is displayed in the image … what is OK but is not what I am looking for.
My applicatin captures images with 3 cameras and sends it up to AI for identification.
For example in the image it is a Tre Stelle - Fetra Cheese container what I really need is : Brand: Tre Stelle, Category: Dairy, Type: Feta Cheese nothing else.
The Google Vision gives me this but in a really random way due to the fact that I am sending up 3 independent images and also all the other writing is recognised what is not needed.
The original request what I had to OpenAI was - identify object and retunr Brand: Category and Type: was this simple and got exactly what I needed.
I am really new to the Google Vision it could be that I can have this type of filter configured just I do not know yet how to get it …

Topic		Replies	Views
OpenAI has deprecated direct image processing through its API API	12	1249	January 27, 2025
GPT-4: 32k and Image recognition API gpt-4	16	9272	July 20, 2023
Computer vision models API API	3	1072	November 21, 2024
Can GPT -vision models be accessed using API? API	15	2044	January 7, 2025
Has GPT-4 Vision Preview been replaced or removed? Getting [model_not_found] error Deprecations gpt-4-vision	2	1685	April 20, 2025

OpenAI image processing capabilities (GPT-4V) availability for programmatic API use

Related topics