Streaming is now available in the Assistants API!

stephen.walther · April 5, 2024, 9:00pm

Hi hq1 - take a look at the OpenAI Assistant Starter Kit which illustrates how to use Assistant streaming with a NextJS app.

Live App Here: https://openai-assistant-starter-kit.vercel.app/
Blog Walkthrough Here: Use the OpenAI Assistant Starter Kit to Quickly Build New OpenAI Apps - OpenAI Blog - Stephen Walther on OpenAI
Full Source Code Here: GitHub - Superexpert/openai-assistant-starter-kit: Starter Kit for creating an OpenAI Assistant web application using NextJS + ReactJS + TypeScript

The app streams from the server and then parses out the Server Side Events in a ReactJS component. I’m not using any special libraries. Feedback welcomed and appreciated!

sashirestela · April 5, 2024, 9:42pm

Take a look to this:

moonrubble · April 6, 2024, 10:20am

to update - i pivoted to using the vercel ai sdk and could get the streaming working brilliantly. I am still curious why I could not get the websocket to return the stream, but have moved to the vercel way of doing it (for now).

I would love to hear an update, if the websocket streaming is working with a frontned… thx

hq1 · April 7, 2024, 9:41pm

Hi - I will go and check my python code again, as would really like to get this working, and really appreciate the pointers. I did pivot to using the vercel ai sdk and got a working streaming in < 10 minutes.

I will give your suggestions a try and come back with an update.

thanks again!
hq

gubanov.pa · April 12, 2024, 9:49am

Hi everyone, does anyone have an idea if streaming the message object during an assistant call consumes the RPM limit?

60 req/min limit at the user account level.

nathanvirushabadoss · April 13, 2024, 3:18am

i’m also struggling with submit_tools_output_stream.

Burmachach.Nuradilov · April 15, 2024, 5:45pm

How to save this messages history. I was using run.id to retrieve, but now I can’t find out id for each run.

if prompt := st.chat_input("How can I help you?"):
    # Add user message to the state and display on the screen
    st.session_state.messages.append({"role": "user", "content": prompt})
    if prompt:
        run=client.beta.threads.runs.create_and_stream(
                thread_id=thread_id,
                assistant_id=assistant_id,
                model="gpt-4-turbo-preview",
            ) 
        with run as stream:
                with st.chat_message("assistant"):
                    response = st.write_stream(stream.text_deltas)
                    stream.until_done()
    with st.chat_message("user"):
        st.markdown(prompt)

shahir1 · April 21, 2024, 2:16am

You have to uninstall and reinstall the latest version of openai SDK

will14 · May 9, 2024, 11:46am

@shahir1 I updated my library to 1.27.0 and it still has this import issue

smiiith · May 16, 2024, 6:53pm

I’m trying to build a streaming app with openAI assistants where there are two assistants that communicate with each other. I want to stream the interaction between the two back to the frontend, without the end user having to do anything. Do you have any examples of that?

Thanks!

stephen.walther · May 16, 2024, 8:56pm

Hi smiith – you should be able to adapt the OpenAI Assistant Starter Kit code to run threads associated with two different assistants by modifying the server-side code in the route.ts file.

You’ll need to setup two assistants at the OpenAI playground: https://platform.openai.com/playground?mode=assistant
Next, you can build two different threads:

   // add new message to thread
    await openai.beta.threads.messages.create(
        newMessage.threadId,
        {
            role: "user",
            content: newMessage.content
        }
    );

Next, run both of the threads using a particular assistant by using the Assistant Id (you can get the Assistant Ids from the OpenAI playground).

    // create a run
    const stream = await openai.beta.threads.runs.create(
        newMessage.threadId, 
        {assistant_id: newMessage.assistantId, stream:true}
    );

You can pass the responses from the two different assistants back and forth by adding the response messages to each thread.

Hope this helps! And sounds like an intriguing project!

Stephen

will14 · May 29, 2024, 9:59pm

I was able to get it to work when I changed my python runtime from 3.11 to 3.8

GoldenJoe · July 24, 2024, 11:14pm

Still waiting on documentation of the AssistantEventHandler. I see various examples floating around, some of which override certain events, while others do not. It’s unclear when that is necessary, or even what a complete list of the events are. GPT itself doesn’t know either.

Vad.min · September 17, 2024, 10:37am

with client.beta.threads.runs.stream(
        thread_id=thread_id,
        assistant_id=ASS_ID,
    ) as stream:

With the above, I get this error: AttributeError: 'Runs' object has no attribute 'stream', meanwhile it is there.

When I decide to use the AssistantEventHandler, I get this AttributeError: 'Runs' object has no attribute 'stream'.
Meanwhile, when I import AssistantEventHandler to a Jupyter notebook, it imports, but in my script, I get that error.

I’m on version 1.45.1, I downgraded to 1.42.0 but got the same errors.

Vad.min · September 20, 2024, 1:59pm

It worked now. I think my terminal was on a different environment, which might have had a different openai version.

Topic		Replies	Views
Using Streaming Assistants API With Websockets API assistants-api	9	882	January 21, 2025
Has anyone managed to get a tool_call working when stream=True? API api , function-calling	22	19704	May 24, 2024
[Critical] Over 25% Assistant API Request Timeout Randomly API	81	5767	March 18, 2024
Streaming from Text-to-Speech api API api , python , tts	53	51629	January 21, 2025
Multiple function calls with streaming API gpt-4 , function-calling , streaming	6	4605	April 5, 2024

Streaming is now available in the Assistants API!

Related topics