Is it possible to stream chain of thought in API for o1 model

jordan.hensley · January 1, 2025, 12:15am

Is it possible to stream the chain of thought happening during o1 API calls? Calls to this model can sometimes take a while and it would be a great user experience if I could show what was happening while the end user waits the extra time for a final response.

I would like to create an experience similar to the “thinking” loader in the chatGPT UI.

jordan.hensley · January 1, 2025, 12:18am

My call to the o1 model is using a structured output, so I would still like that to be the final response, but being able to show the chain of thought during the loading state would be amazing!

arata · January 1, 2025, 12:43am

There is no such feature announced and no streaming for either o1-preview or newly announced o1 (plain) currently available.

Streaming is “coming sometime” to o1, and the delay might be in also the implementation for offering such progress events besides ensuring that we never get anything leaked from the internal reasoning generation.

Streaming and “thinking” remains something fully realized in ChatGPT, broadly available to anyone that wants to pay $200 rent for a non-nerfed o1 - and unavailable to API developers. o1 itself is unavailable to anyone except a slim selection of tier-5 API users.

jordan.hensley · January 1, 2025, 12:59am

Gotcha, thanks! Do you know of any good avenues for feature requests in the API?

I would imagine this would be a much appreciated feature for many o1 users as it starts to get supported in applications.

arata · January 1, 2025, 1:11am

This forum has Feedback - OpenAI Developer Forum - but is basically idea banter with other users, thus not a complete “paste bot text ideas and get ignored” dumping ground as ChatGPT “feature requests”.

Streaming, “thinking”, and keep-alive progress (against network timeouts) is somewhat obvious as a request, but doesn’t have a straightforward chat completions method, as that API endpoint is not built for out-of-band metadata. You already have to ask for usage data as an additional chunk by sending an API parameter for it. A new delta chunk field for events could break everybody’s code.

magarrent · January 8, 2025, 1:36pm

The o1-preview model has the stream option

gpetit · February 19, 2025, 9:41pm

And now o1 has streaming as well, but neither o1-preview nor o1 seem to provide the chain of thoughts - or did I miss anything?

merefield · February 19, 2025, 9:54pm

Exposing chain of thought isn’t likely to happen because that would expose some of OpenAI’s trade secrets and special sauce.

edwinarbus · February 19, 2025, 11:47pm

This is something we’re thinking about. Some technical limitations to figure out, but will keep you posted if there are any updates to share.

joyasree78 · March 17, 2025, 6:09am

Is this capability available yet. It would be great to show the user the thinking steps when user is waiting for the final results

_j · March 17, 2025, 8:48am

You can currently get a reasoning statement only with one model:

generate_summary- computer_use_preview only
A summary of the reasoning performed by the model.

Returns in output:

Object Description

Event: response.output_item.added you might receive in the future

{
  "type": "response.output_item.added",
  "output_index": 0,
  "item": {
    "id": "msg_123",
    "status": "in_progress",
    "type": "reasoning",
    "role": "assistant",
    "content": []
  }
}

Being an output, being an array of reasoning texts, it’s there, ready for use in the Responses API when they give parity.

Topic		Replies	Views
Add a Parameter to Reasoning Models to Keep Users Informed While Waiting API api	2	258	April 19, 2025
How to print o1 reasoning token API o1-preview	12	3303	October 14, 2024
Showing "Reasoning Texts" or "Think" or "Chain of Thoughts" to Users in my RAG API	1	47	July 17, 2025
Showing Reasoning Texts to user in Assistant API API reasoning , ai-reasoning	1	1665	February 3, 2025
OpenAI o1 streaming now available + API access for tiers 1–5 Announcements	6	6208	December 2, 2024

Is it possible to stream chain of thought in API for o1 model

Object Description

Related topics