Gradio app to classify image using gpt 4 vision

defiman1729 · January 14, 2024, 8:01pm

I am trying to create a simple gradio app that will allow me to upload an image from my local folder. The image will then be encoded to base64 and passed on the paylod of gpt4 vision api
i am creating the interface as:

iface = gr.Interface(process_image,"image","label")
iface.launch()

But I am unable to encode this image or use this image directly to call the chat completion api without errors

Has anyone tried?

I have however been able to create a FAstAPI endpoint which allows me to upload a file and return the expected output


app = FastAPI()

@app.post("/uploadfile")
def create_upload_file(file: UploadFile):
    name = file.filename
    type = file.content_type
    image_path = name
  # Function to encode the image
    def encode_image(image_path):
      with open(image_path, "rb") as image_file:
        return base64.b64encode(image_file.read()).decode('utf-8')

    # # Getting the base64 string
    base64_image = encode_image(image_path)

    headers = {
      "Content-Type": "application/json",
      "Authorization": f"Bearer {api_key}"
    }

    payload = {
      "model": "gpt-4-vision-preview",
      "messages": [
        {
          "role": "user",
          "content": [
            {
              "type": "text",
              "text": """ identify the following from the image: My prompt here
                            """
            },
            {
              "type": "image_url",
              "image_url": {
                "url": f"data:image/jpeg;base64,{base64_image}"
              }
            }
          ]
        }
      ],
      "max_tokens": 300
    }

    response = requests.post("https://api.openai.com/v1/chat/completions", headers=headers, json=payload)
    print(response.json()["choices"][0]["message"]["content"])
    # message = json_data["output"]["choices"][0]["message"]["content"]
    return {'status':'loaded successfully, check console','output': response.json()["choices"][0]["message"]["content"]}

Topic		Replies	Views
Using "gpt-4-vision-preview" for Image Interpretation from an Uploaded Folder API gpt-4	2	2419	November 10, 2023
Different errors when inputting an image into gpt-4-vision-preview API gpt-4 , api	1	1476	January 17, 2024
Using an image as input gpt4 api API	3	19831	June 3, 2024
Ask GPT-4o about a file - Example python function with file upload base64 and tiktoken and usage history with forced json return API gpt-4o	3	4105	June 8, 2024
How to load a local image to gpt4 -vision using API API gpt-4-vision	4	48226	February 27, 2024

Gradio app to classify image using gpt 4 vision

Related topics