Optimum model for tagging articles?

parakeet · May 31, 2023, 7:05am

Which model and method offer the best cost efficiency / quality for tagging articles as follows… ?

Article: varying length, average 600 words
Tags: AI should choose from pre-defined list of around 300 entities - potentially 550 words, 6,000 chars. Multiple can be returned.

Using Tokenizer, it seems the list of predefined terms alone gets to around 2,300 tokens, without also including the article. So I guess the combined input could potentially exceed 4,000.

I assume the returned tags would account for very few tokens.

I’m having a brain-fart understanding whether the token limits quoted in the Playground are for combined input-output or just output.

Anyone think that text-davinci-003 can execute these tasks with quality, or is it going to require GPT-4?

Topic		Replies	Views
Document Tagging 4o vs 4o-mini API	1	170	December 12, 2024
Tagging large number of narratives while maintaining context API api	3	1148	July 3, 2023
Auto-tagging articles - any thoughts? API	13	4912	May 31, 2023
Completions API: how to pre-evaluate number of tokens needed? API	3	359	May 11, 2024
Best API Engine Model for Key Phrase Extraction from Blog Posts API	0	349	October 5, 2023

Optimum model for tagging articles?

Related topics