How to Resume Streaming in Python After Submitting Function Call Outputs in OpenAI Assistants API?

felinavarro94 · February 13, 2025, 5:50pm

Subject: Streaming Stops After Function Call Response Submission

Hi,

I am trying to use function calls with streaming. I can successfully stream responses up until the requires_action event, after which I submit the function tool call response.

This submission happens successfully, and if I print all messages, I can see that the assistant does reply after the function call result is submitted.

However, the stream does not continue to output the GPT response, and it stops after submitting the function call result.

Does anyone know how to make the stream continue and capture the assistant’s full response after the function call execution?

Steps to Reproduce

Below is my implementation:

from openai import OpenAI
import os
from dotenv import load_dotenv
import requests
import json  # Added for handling JSON

load_dotenv()
client = OpenAI()

assistant = client.beta.assistants.retrieve("asst_oa8dz4R0M9XBWdOqmxq22Lnp")
thread = "thread_bG77CUSW5xqWE3PfHXcJIZW0"

message = client.beta.threads.messages.create(
    thread_id=thread,
    role="user",
    content="Can you please use both Code Interpreter and the multiplication tool (not the multiplication tool twice, one of the tries has to be with Code Interpreter) to find 1111 times 1111?"
)

print("Starting run stream...")

stream = client.beta.threads.runs.create(
    thread_id=thread,
    assistant_id=assistant.id,
    instructions="Please address the user as Jane Doe. The user has a premium account.",
    stream=True,
)

def multiply_two_numbers(a, b):
    return a * b

stream_content = [] 
for event in stream:
    stream_content.append(event)

    if event.event == "thread.message.created":
        print("\nAssistant:")
    elif event.event == "thread.message.delta":
        print(event.data.delta.content[0].text.value, end='')
    elif event.event == "thread.run.step.created":
        if event.data.step_details.type == "tool_calls":
            print("\nTool Call:")
    elif event.event == "thread.run.step.delta":
        if hasattr(event.data.delta.step_details.tool_calls[0], "code_interpreter"):
            print(event.data.delta.step_details.tool_calls[0].code_interpreter.input, end='')
        else:
            if event.data.delta.step_details.tool_calls[0].function.name:
                print(f"Calling function {event.data.delta.step_details.tool_calls[0].function.name}")
            print(event.data.delta.step_details.tool_calls[0].function.arguments, end='')

    elif event.event == "thread.run.requires_action":
        tool_outputs = []
        current_run = client.beta.threads.runs.list(thread_id=thread).data[0]
        
        for tool in event.data.required_action.submit_tool_outputs.tool_calls:
            function_name = tool.function.name
            arguments = json.loads(tool.function.arguments)
            
            if function_name == "multiply_two_numbers":
                multiply_result = multiply_two_numbers(arguments.get("a"), arguments.get("b"))
                tool_outputs.append({
                    "tool_call_id": tool.id,
                    "output": str(multiply_result)
                })

        # Submit all tool outputs at once after collecting them in a list
        if tool_outputs:
            print("\nThe tool output is:")
            print(tool_outputs)
            run = client.beta.threads.runs.submit_tool_outputs(
                thread_id=thread,
                run_id=current_run.id,
                tool_outputs=tool_outputs
            )
            print("Tool outputs submitted successfully.")

# Function concludes with 'thread.run.step.completed'

Observed Behavior

Before the function call (requires_action event):
- Streaming works as expected.
After submitting the function call response:
- The assistant correctly processes the function output and replies.
- However, the stream stops, and I do not see the assistant’s response in real-time.
If I fetch all messages manually, I can see the assistant’s reply.

Expected Behavior

The stream should continue past the function call and display the assistant’s response after processing the function result.
ChatGPT’s UI does not stop streaming after function calls, so the API should also support this.

Question

How can I ensure that the stream continues after submitting the function call results?

Any help would be much appreciated!

Thanks!

_j · February 14, 2025, 4:05am

Creating Streams

There are three helper methods for creating streams, a new paradigm from what you demonstrate in your code:

client.beta.threads.runs.stream()

This method can be used to start and stream the response to an existing run with an associated thread that is already populated with messages.

client.beta.threads.create_and_run_stream()

This method can be used to add a message to a thread, start a run and then stream the response.

client.beta.threads.runs.submit_tool_outputs_stream()

This method can be used to submit a tool output to a run waiting on the output and start a stream.

This also takes an understanding of the events needing to be collected or parsed from a stream.

github.com/openai/openai-python

helpers.md

main

# Structured Outputs Parsing Helpers

The OpenAI API supports extracting JSON from the model with the `response_format` request param, for more details on the API, see [this guide](https://platform.openai.com/docs/guides/structured-outputs).

The SDK provides a `client.beta.chat.completions.parse()` method which is a wrapper over the `client.chat.completions.create()` that
provides richer integrations with Python specific types & returns a `ParsedChatCompletion` object, which is a subclass of the standard `ChatCompletion` class.

## Auto-parsing response content with Pydantic models

You can pass a pydantic model to the `.parse()` method and the SDK will automatically convert the model
into a JSON schema, send it to the API and parse the response content back into the given model.

```py
from typing import List
from pydantic import BaseModel
from openai import OpenAI

class Step(BaseModel):
    explanation: str
    output: str

This file has been truncated. show original

Topic		Replies	Views
Streaming ends when submitting tool call outputs to Assistant API assistants-api , tools , assistants-streaming	0	63	May 21, 2025
[Python] OpenAI streaming ends after submitting 2 or more tool outputs API gpt-4 , assistants-api	0	103	March 28, 2025
Problem with Responses API + Stream + Function Calling with Openai Python SDK API python , function-calling , sdk , streaming , responses-api	8	415	May 22, 2025
Sending response back to the model after API call API	6	1778	September 25, 2024
Assistant API Streaming: create_and_stream vs submit_tool_outputs_stream Bugs gpt-4	0	2614	April 2, 2024