Assistant API - messages and runs

ilkeraktuna · May 18, 2024, 7:11pm

Hi,

I am trying to maintain a conversation with an assistant in my project.
But I am not sure if I understand the “runs” context correctly.
If I create a message and then create a blocking run, message response is added to the message list and I can retrieve it listing the messages.
If I create a streaming run instead, my expectation was to leave that stream open until I close it and receive consequent messages from that run.
However, after the first received message , that stream is closed by the API.
Does that mean , I have to create a new run each time I create a new message ?
If that’s correct, what’s the advantage of using a streaming run instead of a blocking run ?

Macha · May 19, 2024, 2:24am

You can think of a run like executing the actual command that makes the language model whir.

A “blocking” run basically means “Give me the output when you’re all finished.”

A “streaming” run means “give it to me slowly, in little bits, as you’re producing the output.”

When a stream is closed, it means it finished producing the output, and the command is finished. You can’t stream information when there’s nothing left to stream.

Technically, if you want the door open all the time, so you could let information stream through at any moment, this would be called a websocket. You cannot establish a websocket to a language model API, nor should you.

The advantage/disadvantage to each is about how fast you want to receive the output.

So, to put all this together, threads (and the messages within them) captures and packages the data in a way to prepare it so it can be sent to the language model to process. Executing a run is what actually does the procedure of creating an output. You can retrieve this output at the end when it’s finished, which would be a blocking run, or you can retrieve it as it’s being made, which would be the streaming run.

ilkeraktuna · May 19, 2024, 1:21pm

thank you. it is clear for me now…

Topic		Replies	Views
Runs created by assistants API	0	326	December 20, 2023
Do I need to run a new "run" with every new created message in the thread for the Assistant to respond? API api	2	155	August 12, 2024
Assistants API - best way to get the reply to a given user message? API assistants-api	6	3940	July 10, 2024
Assistants API don't allow perform two concurrent runs on same thread API	8	3974	August 12, 2024
Multiple text responses in single Run (Assistant, no streaming) API	4	1222	May 8, 2024

Assistant API - messages and runs

Related topics