GPT says it cannot read images

Hi there!

I’m trying to create a script that makes database for my bssns. Bascially it has to read the screenshot from a link, describe it, and write it in a google sheet.

I’m not a developer, and know very little about programming, so I’m using chatGPT to do everything instead of me.

In theory everything is ok, API sends the picture, I receive the answer, but the answer is: I’cannot tell you what’s on the picture. I’ve tried different pictures, and different prompts, incuding simple “what’s on the picture”. I’ve tried files from google drive, and some random pictures on the web.

I’m using google appscript to do all this.

Can you help me to resolve this problem?

I’m attaching what is the answer form GPT:

10:03:00	Informacje	Response Text: {
  "id": "chatcmpl-AFczjaWWCBeIc23QOv5ajZfY4nrPH",
  "object": "chat.completion",
  "created": 1728288179,
  "model": "gpt-4o-2024-08-06",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "I'm sorry, but I can't describe the picture because I cannot see it. If you have any questions about the image description, I can try to help you based on the available information.",
        "refusal": null
      },
      "logprobs": null,
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 12,
    "completion_tokens": 51,
    "total_tokens": 63,
    "prompt_tokens_details": {
      "cached_tokens": 0
    },
    "completion_tokens_details": {
      "reasoning_tokens": 0
    }
  },
  "system_fingerprint": "fp_2f406b9113"
}

https://platform.openai.com/docs/guides/vision

I think you might need to send the image in base64, but I could be wrong, if you check the documentation it might give you the insight on how to solve it, best of luck and welcome to the community