Assistant API with gpt-4 turbo delivers back prompt as answer

kachari.bikram42 · December 5, 2023, 2:24pm

quite a number of times I have faced the issue, where the assistant api with the new gpt 4 turbo model delivers back the user prompt as answer. This is strange. I have never come across this issue with the previous models.

kristianv · December 6, 2023, 3:23am

How are you checking the thread for new messages? You are sure you are not just checking for the latest message before the prompt/run has completed? If so, the latest message will be your prompt (with role=user) until the thread is done with your run and provided a new message (with role = assistant).

kachari.bikram42 · December 6, 2023, 9:28am

I am waiting for the run to complete and then retrieving the message. The same code works most of the time for the same prompt, but sometimes I face this issue, where I get the prompt back as an answer. Below I am sharing a code snippet that I am using to poll the assistant, check the status and retrieve the answer -


        while True:
            run_status = await self.check_run_status(run.id, thread.id)
            if run_status.status == "requires_action":
                if tool_instances:
                    self.__logger.info("need to make a function call")
                    function_ids_to_result_map = await self.handle_function_calls(run_status, tool_instances)
                    self.__logger.info("function_ids_to_result_map: ", function_ids_to_result_map)
                    if function_ids_to_result_map:
                        await self.submit_tool_outputs(thread.id, run_status.id, function_ids_to_result_map)
                        self.__logger.info("checking run status after submitting tool output")

                else:
                    raise ThreadRunException(f"status is {run_status.status}, but no tool instances are defined")

            elif run_status.status == "completed":
                break

            elif run_status.status in ["cancelling", "cancelled", "failed", "expired"]:
                raise ThreadRunException(f"Thread {thread.id} ran into an issue")

            await asyncio.sleep(polling_interval)

        # Retrieve the latest message
        thread_messages = client.beta.threads.messages.list(thread_id=thread.id, order="desc", limit=1)
        assistant_message = None
        async for message in thread_messages:
            self.__logger.info(f"message id: {message.id}")
            self.__logger.info(f"content: {message.content[0].text.value}")
            assistant_message = message
            break

        return assistant_message

kristianv · December 6, 2023, 9:57am

And this 1 message you get back has role assistant (and not user)?

Since you only retrieve 1 message and order by the created_at value, you might be facing an issue where your message and the reply from the assistant are created at the same time. This timestamp is according to created_at a Unit timestamp in seconds, so if it takes less than 1 second to produce the response, you could be limiting to 1 message ordering on a value where multiple message have the same value.

It is better to retrieve a larger set of messages, and use the role (assistant) to determine which message to process as the assistant’s response.

pedrods · December 6, 2023, 10:03am

Getting the same problem, but with gpt-3.5-turbo-1106.

kachari.bikram42 · December 6, 2023, 11:56am

" And this 1 message you get back has [role] assistant (and not user )?" ---- I am not checking on this part. I missed this. Let me try out your solution. Thanks for the help

kachari.bikram42 · December 6, 2023, 2:35pm

I tried your solution. What I saw is that even after retrieving a larger set of messages, only one message with role as “user” gets added to the list of messages and no assistant message gets added. The same prompt works sometimes perfectly , at other times only message with the role as user gets added.

eric.mcgivney · January 10, 2024, 11:42am

@kachari.bikram42 I had this same problem and seem to have solved it by adding a delay before reading the thread:

def ask_assitant(user_message):
    assistant = client.beta.assistants.retrieve(ASSISTANT_ID)
    thread = client.beta.threads.create()

    client.beta.threads.messages.create(
        thread_id=thread.id, role="user", content=user_message
    )

    run = client.beta.threads.runs.create(
        thread_id=thread.id, assistant_id=assistant.id
    )

    completed_run = wait_on_run(run, thread)

     time.sleep(30) # INSERTING DELAY HERE HELPED

    messages = client.beta.threads.messages.list(thread_id=thread.id, order="desc")
    new_message = messages.data[0].content[0].text.value
    
    return new_message


def wait_on_run(run, thread):
    while run.status == "queued" or run.status == "in_progress":
        run = client.beta.threads.runs.retrieve(
            thread_id=thread.id,
            run_id=run.id,
        )
        print(f"Run status: {run.status}")
        time.sleep(5)
    return run

kachari.bikram42 · January 11, 2024, 11:51am

Thanks for the help. I will surely try it out. did you find out the reason for the problem and why adding a delay solves the problem for you?

Topic		Replies	Views
OpenAi Api Assistant Answers with the Question value API gpt-35-turbo , chatgpt , assistants , assistants-api	1	720	January 19, 2024
No answer message with openai assistants api API chatgpt , api	9	2553	March 2, 2024
[Assistant API] Discrepancies in Function Triggering and Output Submission in Bulk Message Threads with OpenAI API API	10	843	January 21, 2024
Assistant API response missing code API gpt-4	2	693	January 12, 2024
Assistant is repeating itself in a single run API gpt-35-turbo , assistants , assistants-api	4	840	January 21, 2024

Assistant API with gpt-4 turbo delivers back prompt as answer

Related Topics