Functions calling with streaming

Hi, just wanted to share that function calling with streaming can actually work very smoothly.
To manage both “normal” answers and function-calling you can do something like this on the generator object being received from API request with stream = True:

response_text = ""
    function_call_detected = False
    for response_chunk in response_generator:
        # print(f"Response chunk: {response_chunk}")
        if "choices" in response_chunk:
            deltas = response_chunk["choices"][0]["delta"]
            if "function_call" in deltas:
                function_call_detected = True
                if "name" in deltas["function_call"]:
                    func_call["name"] = deltas["function_call"]["name"]
                if "arguments" in deltas["function_call"]:
                    func_call["arguments"] += deltas["function_call"]["arguments"]
            if (
                function_call_detected
                and response_chunk["choices"][0].get("finish_reason") == "function_call"
            ):
                print(f"Function generation requested, calling function")
                function_response_generator = call_3A_function(
                    messages, func_call, user_id
                )
                for function_response_chunk in function_response_generator:
                    if "choices" in function_response_chunk:
                        deltas = function_response_chunk["choices"][0]["delta"]
                        if "content" in deltas:
                            response_text += deltas["content"]
                            yield response_text
            elif "content" in deltas and not function_call_detected:
                response_text += deltas["content"]
                yield response_text

(here yield was used in order to make stream immediately available to a flask app to start streaming it to front-end.

15 Likes

Thanks for the inspiration! :slight_smile: Solved an annoying problem that i had.

Your example helped a lot. Thanks

One thing that I noticed is that the streaming takes time for in case of function call when compared to normal completion without function call. Did you also observe this?