Any Suggestions to Reduce Cost and Limit Message Length of GPT 4 Turbo Model in Assistan API?

emraltun · March 27, 2024, 11:40am

Hello everyone.

I am working on a project. In my project, I am trying to answer the questions of our students using an assistant api. I am providing the information with json file using retrieval tool.
When I use the gpt 3.5 model for the assistant, the chatbot does not give very consistent answers to the questions. But when I use the gpt 4 turbo model, I can get decent and consistent answers.

But in the pricing policy, the gpt4 turbo model costs 20 times more than the gpt 3.5 model. For this reason, I try to reduce the number of tokens used by the assistant using the gpt4 turbo model. For this purpose, the assistant ignores my commands such as “do not write answers longer than 100 words” in the instructions section.

As a result of my research, I read that there is no message length restriction feature in the asisstant api.

Do you have any suggestions for the gpt 4 turbo model to charge less? Or if there is an OpenAI official or CEO reading this message, can prices be reduced?

Lucas-Mutter · March 27, 2024, 12:27pm

Try making it to where it’s answers are to be consistent.

Let me know if this does not work.

Topic		Replies	Views
How to improvement my app to use less tokens Community gpt-4 , api	4	6251	July 8, 2024
Seeking Advice on Reducing Costs for RAG Chatbot Using File Search Assistant API api	4	544	July 6, 2024
Has Chat GPT 3.5 turbo api gone up in price? API	11	1640	July 31, 2023
How to reduce price for assistant Prompting assistants-api	1	324	May 25, 2024
OpenAI api cost reduction API gpt-4 , api , pricing	1	93	October 14, 2024

Any Suggestions to Reduce Cost and Limit Message Length of GPT 4 Turbo Model in Assistan API?

Related Topics