Optimization of large requests to GPT

teamdrella · November 24, 2023, 9:12am

Hi all,

I’m building a tool that lets users analyze (often quite large) texts. They may have different questions/needs regarding the same text at different points in time.

I’m worried that sending this text over and over to chat-gpt (queries include “explain X in this text”, “what’s the most important points in it” etc.) might burn my credits faster than I’d like. Please note, I’m still new to actually understanding cost-effectiveness and tokens in general.

Maybe there’s a way to optimize this process?

To me it’s related to maintaining “sessions” between a certain user and chatGPT via my server. So, maybe there are established practices for not having to send the entire chat history over and over to chatGPT and first do something with it server-side? Like, maybe I could utilize embeddings in some way?

Any help would be much appreciated.

chdavid · November 24, 2023, 10:17am

Hey @teamdrella,

First idea that comes in mind is writing your prompt in a .docx or .pdf document. Then try uploading it in GPT-4 and say “Attached is the prompt”. Let me know if that works?

Word limits (as of late November 2023):
GPT-3.5: 2,048 tokens (about 1,500 words)
GPT-4: 4,000 tokens (about 3,000 words)

I also wrote a guide here related to the word limit per prompt, per GPT model.

Topic		Replies	Views
How to improvement my app to use less tokens Community gpt-4 , api	4	7708	July 8, 2024
Strategies for handling long texts in ChatGPT API limits API	5	108	January 22, 2025
Creating a Continuity-Based Chatbot similar to chatGPT API gpt-4 , chatgpt , api	0	96	October 21, 2024
Long Prompt with Large Text Data Prompting gpt-35-turbo , chatgpt , api	3	12301	July 14, 2023
What is the best way to upload datasets that exceed the token limit? API	3	1496	December 18, 2023

Optimization of large requests to GPT

Related topics