Suggestion: Limited Token Coverage and Pre-Send Multimodal Inputs for App SDK

chenlidong · January 9, 2026, 7:49am

I’ve been developing a built-in app inside ChatGPT using the App SDK and MCP, and after building a real feature, I’ve noticed some structural limitations that significantly affect developer adoption.

In its current form, the App SDK is powerful as a tool framework, but it stops short at the point where the model itself becomes meaningfully usable—especially for multimodal scenarios. If ChatGPT App could partially “cover” model token usage under controlled limits, the overall developer value proposition would be much stronger.

More importantly, an extension point for pre-send multimodal inputs would be highly impactful. Concretely, before the user clicks “Send,” tools could be allowed—under explicit user confirmation—to attach images, audio, or files into the outgoing user message, rather than only returning post-generation tool outputs.

With clear user consent, strict limits, and platform-level safeguards, this could unlock significantly more compelling in-app experiences without undermining cost control or trust boundaries.

I believe this kind of constrained, opt-in pre-send capability would make the App SDK far more attractive to serious developers building inside ChatGPT.

Topic		Replies	Views
Files in ChatGPT Apps (MCP): parity with GPT file Actions? ChatGPT Apps SDK	2	205	November 27, 2025
Feature Request: Support MCP Sampling (createMessage) for server-initiated LLM completions ChatGPT Apps SDK chatgpt-app , mcp	0	95	December 29, 2025
ChatGPT Apps need customization to speed up the ecosystem growth ChatGPT Apps SDK chatgpt-app , feature-request , customization	0	59	January 30, 2026
How to improve multi-step MCP call execution with instant models ChatGPT Apps SDK	0	45	May 8, 2026
How to build a great ChatGPT app ChatGPT Apps SDK	6	1807	December 3, 2025

Suggestion: Limited Token Coverage and Pre-Send Multimodal Inputs for App SDK

Related topics