Allow GPT builders to disable Voice Mode when Custom GPT Actions are not supported

Hi OpenAI team,

I would like to file a feature request regarding Voice Mode / Advanced Voice Mode in Custom GPTs that depend on Actions / function calls.

There are already several community reports about this issue. Earlier discussions first asked for Advanced Voice Mode support in Custom GPTs; later comments reported that Advanced Voice Mode became available, but Actions are skipped or not executed, while the same Custom GPT works correctly in text mode. There is also a newer thread specifically about function calling not working in Advanced Voice Mode. (OpenAI Developer Community)

My request is slightly different from “please make Voice Mode support Actions,” although that would of course be ideal.

The immediate product problem is this:

Voice Mode is still offered to users even when the Custom GPT cannot fulfill its core purpose without Actions.

For many Custom GPTs, Actions are not optional enhancements. They are the core integration layer. In my case, the GPT depends on backend calls for learner state, curriculum selection, progression logic, and next-step computation. In text chat, this works as expected. In Voice Mode, however, the required backend calls are not executed, so the user enters a degraded experience without understanding why.

This creates a very poor UX:

  • the user assumes they are using the same Custom GPT experience,

  • the GPT may claim or imply it can proceed,

  • but the required backend logic is unavailable,

  • and the creator has no clean way to prevent this broken flow.

Feature request:

Please provide at least one of the following options:

  1. Automatically disable Voice Mode for Custom GPTs where configured Actions are not supported in Voice Mode.

  2. Provide a builder-controlled setting to disable Voice Mode explicitly, for example:

    disable_voice_mode: true

    or:

    voice_mode: disabled

  3. Show a clear warning before entering Voice Mode, for example:

    “This GPT uses Actions that are not available in Voice Mode. Some functionality may not work.”

Expected behavior:

If a Custom GPT depends on Actions, users should not be silently routed into a mode where those Actions are unavailable. Either Voice Mode should support the Actions, or the builder should be able to disable Voice Mode for that GPT.

Why this matters:

For education, workflow automation, enterprise assistants, and domain-specific copilots, the backend is often the source of truth. If Voice Mode bypasses that backend, the GPT may become misleading rather than merely limited.

This is especially painful on mobile, where Voice Mode is highly visible and attractive to users. The current behavior creates avoidable confusion and loss of trust.

A creator-side flag to disable Voice Mode would already be a major improvement while full Voice + Actions support is still unavailable or unreliable.

Thank you for considering this.

I totally agree, but would be hugely disappointed if the development team spent effort providing an opt out feature to a clear product gap. I think we all just want voice mode to work in custom gpts using function calling. This is a huge blocker on the road to enterprise adoption.

I fully agree that the real solution should be Voice Mode working properly with function calls / Actions in Custom GPTs.

If I could choose, that would absolutely be my preferred outcome as well.

My point is not that an opt-out flag should replace proper Voice + Actions support. It is more of a practical fallback while that support is unavailable or unreliable.

I can imagine that making function calls work fluently in a real-time, low-latency voice interaction may be technically difficult. But compared to that, conditionally disabling the Voice Mode entry point — or giving builders a simple flag to disable it — should be much easier.

Right now the UX is worse than just “feature not supported.” The user is actively offered Voice Mode, enters it, and only then discovers that the GPT can no longer perform its core task. For GPTs that depend on backend calls, that is a broken flow.

So yes: Voice Mode + Actions should be the goal. But until that works reliably, builders need a way to prevent users from being routed into a mode where the GPT is known to fail.

Hey @all, yeah, this is a solid callout. If Voice Mode is available but the GPT can’t actually use the Actions it depends on, it’s easy to see how that could feel broken or misleading.

Really appreciate you taking the time to spell this out. I can’t share a timeline right now, but I’ll make sure it gets passed along internally.

- Sunny