Cost for failed GPT requests

krishnakumar.mishra · June 8, 2023, 7:16am

Request to Chat models (gpt-3.5-turbo or gpt-4) can fail in 2 different ways:

Issue from OpenAI side - This is case where model is overloaded with other requests
Client adding a request time out - Here, for example, I as customer make a call with request time out as 10 seconds. The moment 10 seconds are passed, request is failed.

So in both situations, am I charged?

Foxalabs · June 8, 2023, 7:45am

Typically you will be charged for everything, the billing system has no way to tell if the message is interrupted by something external to the system that sends the message out, network issues, lag, external factors, clearly if the model fails or errors then that is not counted, but if the model has performed the required task and the compute has been used… yes you are charged for it.

I understand what you mean though, typical usage error rates should be tested in your implementation and accounted for in costings. I know this rate will fluctuate, but you can build in a small % to cover those, say 1%.

krishnakumar.mishra · June 8, 2023, 8:24am

Let me explain further.

OpenAI doesn’t yet provide costs segregated by API keys on their billing dashboard. Say I have 4 tools that call openai, and all 4 tools have their own API key, I can not know which tool incurred how much of Openai cost based on API keys.

So what I do is:

For each tool, I calculate cost based on prompt tokens + completion tokens + model in use

But in cases where model fails, I have no idea of “completion tokens”. So my cost calculation would be inaccurate if openai on their side has charged me for that failed request.

Foxalabs · June 8, 2023, 8:29am

Yes, I see your issue, you can create separate organisations within your account and use that to track usage by implementing the following in your calling code:

import { Configuration, OpenAIApi } from "openai";
const configuration = new Configuration({
    organization: "org-M7cflNCqTZcPZIOV2a9QrRUe",
    apiKey: process.env.OPENAI_API_KEY,
});
const openai = new OpenAIApi(configuration);

or

import os
import openai
openai.organization = "org-M7cflNCqTZcPZIOV2a9QrRUe"
openai.api_key = os.getenv("OPENAI_API_KEY")

see OpenAI API

krishnakumar.mishra · June 8, 2023, 8:47am

2 more questions. (Thank you for all your help though )

Is it possible to create multiple orgs in single openai account?
If yes, say org 1 have access to gpt-4. If I create org 2, will this org too have gpt-4 access or would I need to apply for it separately?

Foxalabs · June 8, 2023, 10:28am

You can create multiple organisations within an OpenAI account, the GPT-4 API is applied to each Organisation so that would need to be requested for each.

krishnakumar.mishra · June 8, 2023, 11:03am

Sorry but you are wrong here. GPT-4 API is not applied to Account but to individual organization. I checked it.

Foxalabs · June 8, 2023, 11:06am

My apologies, you are correct. I’ll update my answer to avoid any confusion.

jwatte · June 8, 2023, 11:48am

If you get a “model is overloaded error,” which typically comes back within a second, you will not be charged.

If you get a “request timed out” error, which typically takes a long time, you will be charged. My understanding is that they send the request to the model, so it costs them money, even if you timeout. The only reason this wouldn’t be the case, would be if there is a local network malfunction that causes the timeout so the request doesn’t even get to the OpenAI gateway.

The solution to this is to make requests with much longer timeout values. so you get fewer timeout failures and actually wait for the completion instead.
(And, yes, waiting a minute for a completion isn’t great, and makes interactive experiences bad, especially in use cases where you can’t just stream the response directly back but have to wait for the full thing for whatever reason.)

Topic		Replies	Views
Does a Failed Request Eat up $$ API	2	1475	December 31, 2023
Does OpenAI charge you for failed (timeout error) requests? API	1	1420	April 18, 2023
Will I be charged for failed ChatCompletion requests? API gpt-35-turbo	5	2517	May 16, 2023
Overloaded, but still paying API	1	590	May 13, 2023
GPT-4 API Gateway timeout for long requests, but billed anyway API	62	12349	November 30, 2023

Cost for failed GPT requests

Related topics