MCP: can a tool return an image (media) to the LLM?

Roman_Skochko · October 6, 2025, 6:50am

Does the standard Responses API or Chat Completions API support returning the actual media content (like PNG/JPG image bytes/data) as the result of a tool/function call, instead of just a URL, so the LLM can directly see and analyze the file?

_j · October 6, 2025, 7:01am

Not internal MCP, but OpenAI finally, and only on Responses, recently allowed placement of images in the function return message that comes from your code.

You can have a function or automatic function definitions by MCP subscription that are actually serviced by an MCP server that you make the API call to. Then you’d be able to send an image in the tool return (along with some AI-targeted messaging of why its there) when an MCP service is configured to also transmit images.

An example use of images as tool input is seen in ChatGPT, impossible on the API because of the feature lockdown and container content lockdown: Code interpreter that can have images returned within reasoning, and then the AI itself loads, crops, zooms to try to get a better view. (which would be an absolute money-burner on the API; you as API developer can deliver optimized sliced images without any AI desparation and futility).

Topic		Replies	Views
Tool Calling - function that returns an Image API	1	619	October 31, 2025
Images and files as function call outputs API	3	906	October 4, 2025
Gpt4-o Support for Image URLS as tool responses API gpt-4 , image-reading , tools , gpt4o	16	1727	July 19, 2025
Returning image as tool output in Assistants API? API function-calling , gpt-4-vision , assistants-api , tools , gpt-4o	4	3910	June 4, 2025
Image response from an MCP server with agents sdk API agents , assistants-api , agents-sdk , responses-api	2	1439	July 27, 2025

MCP: can a tool return an image (media) to the LLM?

Related topics