Can anyone explain the embedding usage stats?

williamj · March 1, 2023, 9:51pm

Hello:

Can anyone explain the usage stats from this image?

I only performed three 8000 token embeddings.

williamj

curt.kennedy · March 1, 2023, 9:56pm

That is a lot of requests and a lot of tokens. Did your API key get out? Regenerate a new one (and delete the old one)

Or is your code stuck in some error and it keeps retrying?

williamj · March 1, 2023, 10:23pm

I created a new API key two days ago.

I’m not sure on my code being stuck.

Here is my code:

I did get a warning on the split_list embedding assignment.

williamj

williamj · March 1, 2023, 11:08pm

You were right!

Here is what GPT-3 said:

“To reduce the token usage, consider optimizing the code by only accessing the ‘combined’ column once per DataFrame and storing the result in a variable, rather than accessing it for each element. Additionally, consider using a more efficient algorithm for retrieving the embedding, such as using a pre-trained model.”

Could you provide the dataframe Python code that does this?

I thought this was already implemented.

williamj

curt.kennedy · March 2, 2023, 2:21am

I don’t think you need a dataframe, you need a database! To prevent calling something over and over, you need to keep track of what you’ve done in the past, and at each call of the code it looks at the past and decides to send the call to the API if hasn’t been done yet.

In a nutshell, create a hash of what you have done (hash the thing sent to the API) and check the existence of this hash before you send it. Dataframes are short term in-memory objects, databases are long term.

Store your embeddings in-memory for speed, and when you get the closest ones retrieve the values in the database for your prompting efforts. It looks like you keep embedding over and over. The tutorial does everything for you, but look at what it is doing each time you run it to understand.

Topic		Replies	Views
Did I do something wrong? API	7	1035	June 9, 2023
[SOLVED] - Requests counting and billing history API embeddings	2	1038	June 13, 2023
Embedding model token limit exceeding limit while using batch requests API embeddings , token , batching	8	24091	October 15, 2023
Calculating embeddings costs API	8	10248	September 5, 2023
Spending limits per user or per API key API	13	10742	December 17, 2023

Can anyone explain the embedding usage stats?

Related topics