Streaming completion in Python

mukdalwt · October 24, 2022, 4:11pm

Hi,

Does anyone have a working code snippet for how to make streaming work in python? All the discussion I’ve seen is about doing this in JavaScript.

Basically, I want the counterpart of the following where stream=True:
r = openai.Completion.create(
model=“code-davinci-002”,
prompt= prompt",
temperature=0,
max_tokens=4096,
top_p=1,
frequency_penalty=0,
presence_penalty=0,
stop=[“I:”, “O:”]
)

I tried using sseclient and urllib3, but can’t get this to work.

Thanks!

ricky · November 14, 2022, 1:56pm

I also looking the answer for this question.
Could you share with me, how they do it with JavaScript?

antonland · November 14, 2022, 2:51pm

Yea just pass stream=True and handle with a generator

ricky · November 15, 2022, 4:17am

could you give example please, or point me to tutorial, thats would be helpful.

brendandg · January 9, 2023, 10:06pm

Here’s a very quick example that streams tokens and prints out each token as it comes in:

for resp in openai.Completion.create(model='code-davinci-002', prompt='def hello():', max_tokens=512, stream=True):
    sys.stdout.write(resp.choices[0].text)
    sys.stdout.flush()

anon1353698 · March 27, 2023, 4:49pm

So, this part works with the content. But what about the token cost? It is sent via server-sent event. Any way to obtain it? Thanks

TrackZero · April 25, 2023, 1:26am

I know this is old (providing samples for the first question, not returning token cost), but I came across the post while trying to figure this out myself with the ChatCompletion api. I got it working today, it’s the one labeled oai-text-gen-with-secrets-and-streaming.py on GitHub - trackzero/openai: Experiments with the OpenAI API. You'll need your own API keys..

That particular example is authing using AWS Secrets Manager, but you can just delete the get_secret function and pull an environment variable with openai.api_key = os.getenv("OPENAI_API_KEY")

I’ll see about adding token cost on the exit function in the next day or two.

klcogluberk · April 25, 2023, 8:33am

How to count usage count while stream=True ?

TrackZero · April 28, 2023, 2:41am

you have to estimate it with OpenAI’s tokenizer, tiktoken

github.com

openai/openai-cookbook/blob/main/examples/How_to_count_tokens_with_tiktoken.ipynb

{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# How to count tokens with tiktoken\n",
    "\n",
    "[`tiktoken`](https://github.com/openai/tiktoken/blob/main/README.md) is a fast open-source tokenizer by OpenAI.\n",
    "\n",
    "Given a text string (e.g., `\"tiktoken is great!\"`) and an encoding (e.g., `\"cl100k_base\"`), a tokenizer can split the text string into a list of tokens (e.g., `[\"t\", \"ik\", \"token\", \" is\", \" great\", \"!\"]`).\n",
    "\n",
    "Splitting text strings into tokens is useful because GPT models see text in the form of tokens. Knowing how many tokens are in a text string can tell you (a) whether the string is too long for a text model to process and (b) how much an OpenAI API call costs (as usage is priced by token).\n",
    "\n",
    "\n",
    "## Encodings\n",
    "\n",
    "Encodings specify how text is converted into tokens. Different models use different encodings.\n",
    "\n",
    "`tiktoken` supports three encodings used by OpenAI models:\n",

This file has been truncated. show original

I have added an estimator to my demo repo, openai/oai-text-gen-with-secrets-and-streaming.py at main · trackzero/openai · GitHub.

klcogluberk · May 4, 2023, 9:58am

I have added an estimator to my demo repo, openai/oai-text-gen-with-secrets-and-streaming.py at main · trackzero/openai · GitHub .

thanks, but this calculates the prompt tokens, not just the completion tokens.

martin195 · July 5, 2023, 11:32pm

Are there any plans to support getting token usage when using streaming?

Topic		Replies	Views
Token usage calculation with streaming responses - is this not supported? Feedback	1	162	June 25, 2025
Chat completion "stream" API token usage API api	3	6574	May 6, 2024
OpenAi API - get usage tokens in response when set stream=True API	34	40049	August 17, 2025
How to get total_tokens from a stream of CompletionCreateRequests API	6	6396	December 19, 2023
How do you get token count when streaming API	4	4326	December 19, 2023

Streaming completion in Python

Related topics