A general question regarding the classification - how do we model/train?

damir.olejar · May 27, 2021, 3:25pm

Hi

I am very new to GPT-3 and have been using it for two days only.
The reason why I am using GPT-3 is to create a classifier.

The API CURL calls to a classifier include the “examples”.
However, the classical approach to creating a classifier is to provide the training data, create a model, and then simply execute the query on the model.

The pricing is very important, and in order to avoid getting over-charged for the examples I will be using over-and-over again, I would like to somehow train the GPT-3 classifier and to simply make the next REST call as a query (without a necessity to provide the examples).

Is such a thing possible to be done, or do I really need to provide the examples each time I make the REST call to API ?

Thank you kindly for your answer, and I apologise if this question has been asked/answered somewhere else… I am still very new.

EDIT: I see the option to upload the file, but I do not see how we get charged when searching the file or per file content/size.

Damir

tabarak · May 27, 2021, 5:40pm

@damir.olejar Please take a look at our Pricing Page for more info on how pricing is calculated for endpoints that use files i.e. Answers, Search, and Classifications.

damir.olejar · May 27, 2021, 5:42pm

Already did:
…In this scenario, costs are largely based on the number of examples reranked (controlled by max_examples ) and the total length of those examples. If you pass examples in your request instead, costs are based on the total length of all those examples…

Does not state HOW the prices are calculated per file length, the number of examples, etc. The problem is, I need exact numbers and not a description.

Thank you for replying

tabarak · May 27, 2021, 5:48pm

On that page, please see the FAQ about Search. It has a formula that you can use to estimate tokens. The answers and classification use a combination of search + completion, so it should give you a ballpark. You can also try out a few examples with the ‘ada’ engine (our fastest and least expensive engine → $0.0008/1k tokens) to test how many tokens are used.

damir.olejar · May 27, 2021, 6:01pm

Thank you. I will also track the usage and do the estimate.

Topic		Replies	Views
How to make classification and information extraction task cost-effective API	4	404	March 20, 2024
Reclassify the text content/image API	0	160	November 12, 2023
Chat-Gpt3 Fine-Tuning Questions API chatgpt , fine-tuning	2	404	December 8, 2023
API & File = What is the Cost? API gpt-4	5	7026	November 12, 2023
Is it possible to reduce the cost and speed of classification endpoint while keeping the performance? API	2	270	August 24, 2021

A general question regarding the classification - how do we model/train?

Related Topics