Inputs tokens limit, data extraction

Muchamucha · February 3, 2024, 3:27am

Hello there,

Am just wondering about how the token system really works? Does it count tokens based on inputs or outputs.

In anyway, i have large sets of data in text formats, 1 txt file is about 40.000 tokens, 100 txt files = 4.000.000 tokens.

Is it possible to use tire 1 gpt 4 for that, extracting structered data from unstructered text worth of 4 million tokens?

_j · February 3, 2024, 4:19am

Hello there to you!

The rate limiter considers both input and output tokens.

However, your concern is more about the model capabilities.

The AI is limited in how much data it can ultimately contemplate and analyze at once. The AI model has a context window length, the maximum memory of tokens, an area for both processing your language input and for forming a response.

Standard high-quality gpt-4 has a context length of 8k tokens. gpt-4-turbo has a context length of 125k for understanding, with a limited output.

You need to partition your job into smaller tasks, or load only data that is relevant.

As for rate limits: At tier 1 (paying less than $50 in the past),

gpt-4-turbo-preview has a limit of 150000 tokens per minute. However it has a much more restrictive 500000 tokens per day.
gpt-4 has a limit of 10000 tokens per minute; no daily limit.

Both limits are sufficient for sending a maximum context length request via chat completions once per minute.

Muchamucha · February 3, 2024, 5:24am

Now that makes more sense, much clearer and appreciated, thank you

Topic		Replies	Views
Token limit -API for input API token	1	614	November 23, 2024
Token per minute rate limit for GPT4 issues API rate-limit	7	10631	December 22, 2023
Token limits GT-4OMINI model-2024-07-18 API api	3	234	December 12, 2024
Tokens limit gpt-3.5-turbo-0125 API token , gpt-0125	1	3664	February 15, 2024
Please tell me the maximum number of tokens for GPT-3.5-turbo-1106 API api , token , pricing	4	19503	January 15, 2024

Inputs tokens limit, data extraction

Related topics