Any updates on Assistant API Streaming?

logankilpatrick · February 16, 2024, 2:34pm

This is our first shot on goal of making a higher level API (which is still in Beta), future iterations will be much quicker. As it turns out, building something people will really love building with takes time a while. I am hopeful the wait will be worth it : )

GurEt · February 16, 2024, 3:11pm

Excited by your update on OpenAI’s beta API, Logan!
Any chance to join the waitlist
and contribute feedback?

dpasca · February 16, 2024, 3:41pm

Taking time to develop something like this, at this scale, is perfectly legitimate. I’d be happy to wait a few more months, if there was at least a hint of a timeline to work with.

The issue is with communication… streaming was announced as “coming soon” months ago, and the state hasn’t changed.
Timing right now is everything. I’ve been lurking for two months, and every day I wonder whether I should finally allocate a week+ of work to roll my own Assistant + streaming, or wait just another day… without knowing if “soon” is 1 more day, week, month.

jorgeintegrait · February 16, 2024, 9:56pm

Thank you for the additional information @logankilpatrick. As feedback from an implementation developer perspective, it would be really impactful for our capacity to use the Assistants API in production if it was more transparent in two key aspects:

Internal Tool usage: For the documentation to explain how the assistant interacts with the tools retrieval, code interpreter, and dall-e, including the instructions, functions available, etc. This would ideally include a breakdown of how the Assistant does RAG. How are the documents indexed? which embedding model is used? What is the workflow for retrieval?
Token usage: At the moment, this may be the biggest limiter we have seen from clients and other developers. There is little predictability in cost, which is particularly complex whilst using retrieval or code generation, which can sometimes un unpredictably use huge amounts of tokens are used by the browsing tool. Not only access to after-the fact counts of tokens, but also control oon max-tokens per thread/run + rate limits for assistants would be a great start.

Thank you and the team for all the work you do! Looking forward to seeing more

thibauld · February 17, 2024, 12:00am

That made my day! Thanks for communicating, highly appreciated. Output streaming cannot come soon enough

bscribble1350 · February 17, 2024, 11:41pm

I’d like to second this comment. We could really use streaming as soon as you all can get it done!!

yvoderooij · February 18, 2024, 7:56am

Thanks for the response, really glad to hear there is still progress being made and the next milestone is near.

audioone · February 26, 2024, 1:37am

Attention OpenAI, still waiting for Response Streaming. this is really stumping developers not being able to use Assistants API as a production ready option for building chatbots without having to worry about all the maintenance previously required.

Sora is great. But don;t forget us, your other revenue stream.

hardwickcarl · February 27, 2024, 3:54pm

Any update here Logan? We have built a product around the API and are in a holding pattern waiting for streaming as it is completely unusable at the moment. Just an expected timeline would help a ton. Thanks, sir

yvoderooij · February 29, 2024, 9:14pm

Hey Logan, any updates yet? Thanks in advance

lucasishuman · March 5, 2024, 9:51pm

rluis · March 5, 2024, 10:07pm

I get a persistent error when turning the streaming preview on. Error goes away with it off.

Robs · March 6, 2024, 9:23am

Unfortunately, Logan no longer works at OpenAI.

atty-openai · March 6, 2024, 9:25am

Hi all — we’re close to launching streaming and are beginning testing in the playground this week. We’ll let you know once it is live!

audioone · March 6, 2024, 9:41am

Thanks. This is great news. You have just changed our trajectory.

yvoderooij · March 6, 2024, 8:53pm

Amazing, thanks!

Getting greedy here… but by any chance also vision implemented at the api?

CinematicDev · March 6, 2024, 10:45pm

You can just use a function tool and use vision through the completions api. It’s what i’m doing for my assistant

tom35 · March 6, 2024, 10:53pm

Thank you for the update

gregjanik · March 7, 2024, 10:42am

It looks like he Assistants playground keeps streaming the same response over and over again - unsure where to report this. Still a way to go, but I’m excited to see progress!

atty-openai · March 7, 2024, 8:44pm

Sorry about this! Could you share more details or a reproduction? Run ID / Thread ID would also be helpful.

Topic		Replies	Views
Stream Assistant Response API	13	14680	March 13, 2024
Steaming in Assistant API API assistants-api , streaming	3	4499	March 15, 2024
Assistants API and Streaming API	3	2235	March 14, 2024
Streaming Responses in OpenAI Assistant APIs Community assistants , assistants-api , assistant	4	1506	March 13, 2024
How do you stream assistants API responses? API assistants-api	4	2848	January 9, 2024

Any updates on Assistant API Streaming?

Related topics