Unknown parameter: 'modalities'. when creating transcriptionSessions

payments.co.mz · March 23, 2025, 12:48pm

I am trying to create a transcriptionSessions with these parameters:

And I am getting this response from the API
Unknown parameter: ‘modalities’.

But according to the documentation this parameters is supported by the API.

I am using the last version of the nodeJs SDK: 4.89.0

_j · March 23, 2025, 12:55pm

You can be rejected by the SDK validation before anything is sent if you do not have the latest version of the library incorporating changes needed.

openai/resources/beta/realtime/transcription_sessions.py

    def create(
        self,
        *,
        include: List[str] | NotGiven = NOT_GIVEN,
        input_audio_format: Literal["pcm16", "g711_ulaw", "g711_alaw"] | NotGiven = NOT_GIVEN,
        input_audio_noise_reduction: transcription_session_create_params.InputAudioNoiseReduction
        | NotGiven = NOT_GIVEN,
        input_audio_transcription: transcription_session_create_params.InputAudioTranscription | NotGiven = NOT_GIVEN,
        modalities: List[Literal["text", "audio"]] | NotGiven = NOT_GIVEN,
        turn_detection: transcription_session_create_params.TurnDetection | NotGiven = NOT_GIVEN,
        # Use the following arguments if you need to pass additional parameters to the API that aren't available via kwargs.
        # The extra values given here take precedence over values defined on the client or passed to this method.
        extra_headers: Headers | None = None,
        extra_query: Query | None = None,
        extra_body: Body | None = None,
        timeout: float | httpx.Timeout | None | NotGiven = NOT_GIVEN,
    ) -> TranscriptionSession:

payments.co.mz · March 23, 2025, 1:19pm

I am using the last version of the nodeJs SDK: 4.89.0

_j · March 23, 2025, 2:17pm

Support for modalities seems to be there:

github.com/openai/openai-node

src/resources/beta/realtime/transcription-sessions.ts

master

// File generated from our OpenAPI spec by Stainless. See CONTRIBUTING.md for details.

import { APIResource } from '../../../resource';
import * as Core from '../../../core';

export class TranscriptionSessions extends APIResource {
  /**
   * Create an ephemeral API token for use in client-side applications with the
   * Realtime API specifically for realtime transcriptions. Can be configured with
   * the same session parameters as the `transcription_session.update` client event.
   *
   * It responds with a session object, plus a `client_secret` key which contains a
   * usable ephemeral API token that can be used to authenticate browser clients for
   * the Realtime API.
   */
  create(
    body: TranscriptionSessionCreateParams,
    options?: Core.RequestOptions,
  ): Core.APIPromise<TranscriptionSession> {
    return this._client.post('/realtime/transcription_sessions', {

This file has been truncated. show original

In the API reference, modalities is optional (but doesn’t indicate the default). I would suspect that you don’t want AI voice-out for your voice-in.

payments.co.mz · March 23, 2025, 7:00pm

Yes, I want the text only. But seams like the API does not support this property.
The SDK implementation is OK

Topic		Replies	Views
Error: "Unknown parameter: 'session'" when using OpenAI Realtime API API gpt-4 , api , realtime , api-realtime	2	381	November 23, 2024
Error during updating session API realtime	1	97	October 10, 2024
Transcription config for `gpt-4o-mini-transcribe` doesn't work? Bugs	4	259	March 21, 2025
[Realtime API] Input audio transcription is not showing Bugs realtime	9	2055	February 28, 2025
Input_audio_format not correctly setting (Advanced Voice API) API api	0	80	December 19, 2024

Unknown parameter: 'modalities'. when creating transcriptionSessions

Related topics