Hi there, I’m trying to assess some assistants behaviour and I can’t tell if it’s behaving strangely due to my input or due to a bug.
I’m seeing that a single run is causing dozens of messages (in one case around 35) to be appended to a thread. In my case: I’m using gpt-3.5-turbo, I’m beginning with a thread consisting of some user and assistant messages, and I’m creating a run with some override instructions directing the assistant to classify the conversation so far and respond with a small bit of JSON. All of the resulting messages seem to be well formed JSON and appropriate responses to the instructions, I just don’t understand why there are dozens of messages.
An edit for clarity about what I’m doing: I start this classification run right after completing another run, where the assistant has appended a message responding to the user. I don’t append any additional messages before starting the classifier run. It strikes me that this may be an unusual usage pattern for the assistants API.
Has anyone else encountered something like this?