How to make custom GPT process an action whose response is an image

dragonzurfer · November 11, 2023, 5:49pm

I am trying to make a custom gpt that will write poems on images generated from actions available to it. I am using the cataas.com API.

Tried to create actions with this schema

{
  "openapi": "3.0.0",
  "info": {
    "title": "Random Cat Image API",
    "version": "1.0.0",
    "description": "API that returns a random cat image"
  },
  "servers": [
    {
      "url": "https://cataas.com"
    }
  ],
  "paths": {
    "/cat": {
      "get": {
        "summary": "Get a random cat image",
        "description": "Returns a random cat image",
        "operationId": "getRandomCat",
        "responses": {
          "200": {
            "description": "A random cat image",
            "content": {
              "image/jpeg": {
                "schema": {
                  "type": "string",
                  "format": "binary"
                }
              }
            }
          }
        }
      }
    }
  }
}

which generates a random cat image.

But everytime I try something it’s failing. Am I making some kind of mistake in the schema or the actions at this time just doesn’t support image/jpeg as response.

divinci · November 17, 2023, 7:47am

I hope it’s on the roadmap as this would open up GPTs to be what they should be. Not custom one shot prompt templates with RAG - but natural language interface to all types of external API calls (including images)

manueldario.bruna · November 17, 2023, 12:32pm

i’m in the same place, i trying to get a dummy image to test but the debuggin repsonse show a empy json “{}”.

Maybe the actions can receive other format like json or text.

Anyone did try with base64 or a link from json?

divinci · November 17, 2023, 12:39pm

I’ll do some testing myself now and see if I can crack it

jp2023 · December 21, 2023, 10:39am

this got me further - but hit another error loading plugin. presumably PIL. maybe there’s another way.

Topic		Replies	Views
Problems with a response image from a GPT Action Plugins / Actions builders gpt-4 , chatgpt , api , chatgpt-plugin	1	1067	January 23, 2024
How to modify schema of custom GPT action to send an image file with post request? Plugins / Actions builders plugin-development , openapi , chatgpt-plugin , actions	17	6474	February 2, 2024
Custom GPT to load a hosted picture and request a new picture GPT builders gpt-4 , gpt-builder , website	3	751	January 21, 2024
Actions that return images for inline viewing Plugins / Actions builders chatgpt	0	102	January 3, 2025
How to send image to GPT Action and send image from Action to user Plugins / Actions builders chatgpt , plugin-development , actions	11	10582	December 28, 2024

How to make custom GPT process an action whose response is an image

Related topics