CLI Fine-Tune Error: Hard Billing Limit Exceeded

tff · May 17, 2023, 5:44pm

I’m encountering an error that says my fine-tune via the CLI will exceed my billing hard limit, even though my calculation for how much the fine-tune should cost is lower than my billing hard limit. I’ve reached out to OpenAI via email and their support “chat” to see if I can get an estimate for how much it will cost to fine-tune the davinci model, but it’s been over a week without a response from OpenAI.

The training file contains 11,125 lines, with a prompt and completion pair on each line. The file is 23.8 MB. The file contains 23,426,369 characters. According to OpenAI, fine-tuning the davinci model costs $0.0300 / 1K tokens.

According to OpenAI, 1 token equals approximately 4 characters (What are tokens and how to count them? | OpenAI Help Center). So if I did my math right, my training file contains 5,856,592.25 tokens and should equate to a cost of $175.70. My soft billing limit is $200, and my hard billing limit is $500.

However, when I run the fine-tune command via the command line, I receive the following error:

"[2023-05-17 13:41:53] Fine-tune failed. Fine-tune will exceed billing hard limit

Job failed. Please contact support@openai.com if you need assistance."

Why is this not working? Can anyone help me with it?

novaphil · May 17, 2023, 5:47pm

Do you already have billing usage this month? The billing limit is per month.

tff · May 17, 2023, 5:48pm

Yes, but my billing usage this month is only $39.36.

tff · May 17, 2023, 5:51pm

I just increased my soft billing limit to $300 and am retrying just in case that is the issue.

tff · May 17, 2023, 5:58pm

After increasing the soft billing limit, I still received the same error.

sps · May 17, 2023, 6:13pm

Welcome to the community @tff

IIRC n_epochs also influences the price of fine-tuning.

tff · May 17, 2023, 6:25pm

Thanks for the welcome, @sps!

Do you know how n_epochs affects the price? If the default is 4 epochs, does that mean it costs 4*$175.70?

Edit: I’m adding --n_epochs 1 to the CLI command and retesting.

tff · May 17, 2023, 6:44pm

Looks like it’s working now! Thanks so much for the help.

sps · May 17, 2023, 6:47pm

Yes, because 1 epoch is one complete cycle through the training dataset.

Although it’s worth noting that n_epoch of 1 may not result in adherence to the behavior defined in the training dataset, while higher values may overfit.

tff · May 17, 2023, 6:48pm

From this, it looks like 1 epoch should work for my use case:

[

Conditional generation

](OpenAI API)

Conditional generation is a problem where the content needs to be generated given some kind of input. This includes paraphrasing, summarizing, entity extraction, product description writing given specifications, chatbots and many others. For this type of problem we recommend:

Use a separator at the end of the prompt, e.g. \n\n###\n\n. Remember to also append this separator when you eventually make requests to your model.
Use an ending token at the end of the completion, e.g. END
Remember to add the ending token as a stop sequence during inference, e.g. stop=[" END"]
Aim for at least ~500 examples
Ensure that the prompt + completion doesn’t exceed 2048 tokens, including the separator
Ensure the examples are of high quality and follow the same desired format
Ensure that the dataset used for finetuning is very similar in structure and type of task as what the model will be used for
Using Lower learning rate and only 1-2 epochs tends to work better for these use cases

Topic		Replies	Views
Token Count for Fine-tuning API fine-tuning	4	2599	December 18, 2023
Why does a 1115 length fine-tuning model file costs 1,520 trained tokens? API	3	1105	March 29, 2023
Tweaking the Amount of Epochs API	2	2037	December 28, 2023
Fine-tune will exceed the billing hard limit API	4	1549	December 17, 2023
Finetune model completion cut off too short Prompting	7	3974	January 17, 2023

CLI Fine-Tune Error: Hard Billing Limit Exceeded

Conditional generation

Related topics