Accuracy and token limitation

ashwinthandu03 · November 22, 2024, 12:02pm

Heyy. Can some one help me with this. I wanna summarize and categorize the call transcriptions of more than 1hour calls via llm’s. I’ve tried several methods by sending it to gpt 3.5 and gpt 4 via chatcompletions. But the token limit for one api call is 16k for 3.5 turbo and 8k for 4. But for an 1hr convo the tokens is like 25 to 30 k. I wont be able to run it via openai apis (exclude GPT-4-32K). Is there any way to run it on foundation models with the ability of GPT-4’s accuracy ? I’m more concerned about the accuracy and quality of the classification and summarization and I need the out in json format as it has to be pasted to sheets automatically. Please suggest me some methods.

Topic		Replies	Views
OpenAI truncating the response API gpt-4 , chatgpt	0	194	April 4, 2025
Need more than a 4097 token call from chat gpt api API	7	3291	November 28, 2023
Is there a way to beyond the maximum tokens in the playground? Prompting	4	2555	December 21, 2022
Creating a Continuity-Based Chatbot similar to chatGPT API gpt-4 , chatgpt , api	0	157	October 21, 2024
Help Needed: Tackling Context Length Limits in OpenAI Models Community gpt-4 , chatgpt , token , rate-limit , openai	8	18459	February 8, 2024

Accuracy and token limitation

Related topics