Can WebRTC Be Used for a Real-Time Text-to-Text Chatbot Instead of WebSockets?

goldast.xps · March 6, 2025, 2:46am

Hi everyone,

I have a question regarding WebRTC and its potential use for a text-based chatbot. I understand that WebRTC is typically used for real-time voice and video communication, but I’m wondering if it’s possible to leverage WebRTC for a real-time text-to-text chatbot—without using voice—so that I can avoid the costs associated with real-time voice processing and only pay for real-time text.

Due to certain constraints, I’m unable to use WebSockets for this project, which is why I’m exploring alternative options.

Would using WebRTC for this purpose be viable? If so, are there any significant drawbacks compared to WebSockets when handling a real-time chatbot?

I’d really appreciate any insights or experiences you can share. Thanks in advance!

goldast.xps · March 6, 2025, 3:07am

This is what I tried to implement but wasn’t able to use it:

const peerConnection = new RTCPeerConnection({});
const dataChannel = peerConnection.createDataChannel('oai-events', {
   ordered: true,
});

const offer = await peerConnection.createOffer({});
await peerConnection.setLocalDescription(offer);

const sdpResponse = await fetch(`${OPENAI_BASE_URL}?model=${OPEN_AI_REALTIME_MODEL}`, {
        method: 'POST',
        body: offer.sdp,
        headers: {
          Authorization: `Bearer ${EPHEMERAL_KEY}`,
          'Content-Type': 'application/sdp',
        },
});

 const answer = {
   type: 'answer',
   sdp: await sdpResponse.text(),
};

await peerConnection.setRemoteDescription(answer);

peerConnectionRef.current = peerConnection;

But what I get from the OpenAI Server is this error/response:


{
    "type": "answer",
    "sdp": "{\"error\":{\"message\":\"Invalid SDP offer. Offer did not have an audio media section.\",\"type\":\"invalid_request_error\",\"param\":null,\"code\":\"invalid_offer\"}}"
}

Only thing that I need is a text communication through the WebRTC protocol.

Sending a message like the below code:

    const event = {
      type: 'conversation.item.create',
      item: {
        type: 'message',
        role: 'user',
        content: [
          {
            type: 'input_text',
            text: testCounter.current.toString(),
          },
        ],
      },
    };

    dataChannel?.send(JSON.stringify(event));

and parsing the response like this:

      dataChannel.addEventListener('message', (event) => {
        try {
          const message = JSON.parse(event.data);

          if (message.type === 'response.done' && message?.response?.output[0]?.content[0]?.text) {
            logger(message?.response?.output[0]?.content[0]?.text);
          }
        } catch (error) {
          logger(error, 'Error in data channel message');
        }
      });

But I can not when I don’t put the audio in the connection flow.

wadesolowoniuk · March 6, 2025, 4:56am

Yes, I couldn’t get the webrtc to work either, I do use websockets to play around the real time API. I found the 4o mini realtime works better than the 4o realtime. I got more static using the full model. I do use the voice function

goldast.xps · March 8, 2025, 3:14am

#Up

Anyone idea about how we can use WebRTC only for sending / receiving events through stringified JSONs?

Topic		Replies	Views
RealtimeAPI: WebRTC (Client) + WebSocket (Server) possible? API realtime	12	389	February 23, 2025
Is realtime api directly speech to speech? API realtime , api-realtime-speech	13	1139	January 14, 2025
Send function call output from server in WebRTC connection API	2	81	April 10, 2025
For those who've built a GPT4 chatbot with streaming ... how? Webhooks vs. Server-Sent Event? API gpt-4 , api , streaming	3	1969	January 30, 2024
WebRTC transcription guide seems to be broken Bugs	12	405	April 1, 2025

Can WebRTC Be Used for a Real-Time Text-to-Text Chatbot Instead of WebSockets?

Related topics