OpenAI o1 streaming now available + API access for tiers 1–5

edwinarbus · November 20, 2024, 2:22pm

o1-preview and o1-mini now support streaming! You can get responses incrementally as they’re being produced, rather than waiting for the entire response — useful for use-cases that need lower latency, like chat. https://platform.openai.com/docs/api-reference/streaming

We’re also expanding the beta so all developers on tiers 1-5 have access to these models.

_j · November 20, 2024, 4:14pm

Awesome!

For those who now get to use o1 models, and with more interactivity, just a reminder of what it does not accept as input parameters yet…

System Message

  "error": {
    "message": "Unsupported value: 'messages[0].role' does not support 'system' with this model.",

Tools

  "error": {
    "message": "tools is not supported in this model. For a list of supported models, refer to https://platform.openai.com/docs/guides/function-calling/supported-models.",
    "type": "invalid_request_error",

Functions

  "error": {
    "message": "functions is not supported in this model. For a list of supported models, refer to https://platform.openai.com/docs/guides/function-calling/supported-models.",
    "type": "invalid_request_error",

`response_format`

no strict or structured output. Your output desires will likely be lost through all the reasoning.

  "error": {
    "message": "Invalid parameter: 'response_format' of type 'json_schema' is not supported with this model. Learn more about supported models at the Structured Outputs guide: https://platform.openai.com/docs/guides/structured-outputs",

Not max_tokens, but `max_completion_tokens`

and under-specifying can cut the AI off internally before any response is seen, because this now is for limiting your cost of all AI produced language, even the internal whirring.

No temperature, top_p, logprobs, penalties, etc…

logit_bias still works - a positive value can make the AI never escape “reasoning”.

Avoid any questions about thinking processes

A mere mention of reasoning or internal thoughts will bomb you out.
Don’t bother telling it how to think step-by-step.
One input, one output is the best usage.

Exception: API Error: {'message': 'Invalid prompt: your prompt was flagged as potentially violating our usage policy. Please try again with a different prompt: https://platform.openai.com/docs/guides/reasoning/advice-on-prompting', 'type': 'invalid_request_error', 'param': None, 'code': 'invalid_prompt'}

(Now you’re talking to o1 – and talking about the pricing, remember, “ChatGPT can make mistakes”)

Truth: completion_tokens billed as output also includes internally-generated unseen reasoning_tokens.

edwinarbus · November 20, 2024, 11:32pm

Thanks for reminding! And yep, not *yet — the team’s working on adding system messages, function calling, etc. We’ll post in Announcements when they’re enabled!

fern2 · November 28, 2024, 6:24pm

I know this is not the right forum but I lose nothing… Is streaming also supported in AzureOpen AI too?

I appreciate your understanding.

razvan.i.savin · November 28, 2024, 7:07pm

_j:

Functions

  "error": {
    "message": "functions is not supported in this model. For a list of supported models, refer to https://platform.openai.com/docs/guides/function-calling/supported-models.",
    "type": "invalid_request_error",

Well, if functions are not working for this model, it is useless.
Any model in the next six months of 2025 that does not have functions available will become obsolete.

PaulBellow · December 2, 2024, 1:00pm

This topic was automatically closed after 11 days. New replies are no longer allowed.

Topic		Replies	Views
Is streaming available via the Python SDK for the o1 model? API	3	619	January 9, 2025
Does anyone know when GPT-4o-mini-search-preview will support streaming? API	2	247	March 21, 2025
Streaming support for o1 (o1-2024-12-17) (resulting in 400 "Unsupported value") API streaming , o1	1	1561	January 9, 2025
Do we also have streaming support for azure openai for o1-preview API azure	6	1436	January 31, 2025
Tier 5 Access, But No o1 Models?! API Problem or Bug? Let's Figure This Out! API	4	2445	September 20, 2024

OpenAI o1 streaming now available + API access for tiers 1–5

System Message

Tools

Functions

response_format

Not max_tokens, but max_completion_tokens

No temperature, top_p, logprobs, penalties, etc…

Avoid any questions about thinking processes

Related topics

`response_format`

Not max_tokens, but `max_completion_tokens`