Actions that return images for inline viewing

peter.tiedemann · January 3, 2025, 2:28pm

I am trying to put together a Custom GPT with several custom actions. For one of those I would basically like to setup Kroki to enable a wide selection of diagram rendering (I realize ChatGPT has some support build in, but I will have same problem for some other tools).

I have generated an OpenAPI schema for kroki(.io) that seems to work fine through Postman etc, and set it up as an action in my custom GPT, but it fails to work.

I am using this schema:

openapi: 3.1.0
info:
  title: Kroki API
  description: API for generating diagrams from text.
  version: 1.0.0
servers:
  - url: https://kroki.io
paths:
  /:
    post:
      operationId: generateDiagram
      summary: Generate a diagram from plain text
      description: Converts a plain text diagram definition into an image.
      requestBody:
        required: true
        content:
          application/json:
            schema:
              type: object
              properties:
                diagram_source:
                  type: string
                  description: The diagram definition in plain text.
                diagram_type:
                  type: string
                  description: The type of diagram (e.g., graphviz, mermaid).
                output_format:
                  type: string
                  description: The desired output format.
                  enum:
                    - png
                    - svg
                    - jpeg
              required:
                - diagram_source
                - diagram_type
                - output_format
      responses:
        "200":
          description: Successful response containing the generated diagram.
          content:
            application/octet-stream:
              schema:
                type: string
                format: binary
        "400":
          description: Invalid request body.
        "500":
          description: Internal server error.

What does the ChatGPT UI expect from an action that returns an image?

Topic		Replies	Views
Problems with a response image from a GPT Action Plugins / Actions builders gpt-4 , chatgpt , api , chatgpt-plugin	1	1068	January 23, 2024
GPT building: Display images delivered via actions Plugins / Actions builders image-reading , gpt	7	3585	January 30, 2024
How to send image to GPT Action and send image from Action to user Plugins / Actions builders chatgpt , plugin-development , actions	11	10611	December 28, 2024
Custom GPT to load a hosted picture and request a new picture GPT builders gpt-4 , gpt-builder , website	3	752	January 21, 2024
Image rendering in GPT output Plugins / Actions builders chatgpt , plugin-development	2	1569	January 6, 2024

Actions that return images for inline viewing

Related topics