Image response from an MCP server with agents sdk

Sravan_Voonna · May 26, 2025, 4:14pm

Hi, I have an agent based on the openai-agents framework and I’m using an MCP server that returns an image when called.

Below is a snippet from the MCP server, where types.ImageContent is from mcp library which I assume to be the standard while outputting an image.

png = base64.b64encode(png).decode("utf-8")
return [types.ImageContent(type="image", data=png, mimeType="image/png")]

The problem is that the agent/model is not interpreting this as an image and instead it is interpreting this as a string(text), and because the png is the string-formatted image, the token usage bumps up(around 100k) because the agent is treating this as text.

I believe there should be an adapter that converts image from MCP defined format i.e., types.ImageContent to openai models accepted format. Is there anything like that? I searched a lot for this but couldn’t find any lead.

Below is the Screenshot from the traces

Does OpenAI support images as response from tools/functions?

brg · July 3, 2025, 11:09am

I am looking for the same functionality. Or at least a way for passing the image data directly to the run context, without going through the LLM.

dcoldburn · July 27, 2025, 3:53pm

Hi, did anybody end up finding a solution to this issue? Do any workarounds exist?

Topic		Replies	Views
How to upload generated image back into context using Agents SDK? API image-reading	1	311	June 27, 2025
Rendering Base64 Images Returned by SD API through OpenAI Assistant API assistants-api	1	795	February 19, 2024
Agents that can generate images API api , image-generation , agents , ai-agents , agents-sdk	4	588	June 22, 2025
OpenAI's Python SDK (v1.37.1) doesn’t support direct image input to GPT models API	6	306	October 30, 2024
Need Assistance with ChatGPT-Image Integration API	2	1197	December 19, 2023

Image response from an MCP server with agents sdk

Related topics