User Experience Woes: Highlighting Issues with Assistant API Usage

PreppyBrain · November 18, 2023, 10:19pm

Hello. I’m testing the assistant API and noticed that it doesn’t split my files into small chunks. I uploaded two files, one with 26,470 characters (11,767 tokens) and the second with 66,882 characters (29,725 tokens). Since my text isn’t in English, according to the OpenAI tokenizer, one token for me is equivalent to 2.25 words.

I asked a question (input 7 tokens) and received an answer (output 392 tokens). However, in the Usage, I calculated that I used 37,290 Context Tokens and generated 456 tokens.

In 40 minutes of work, I used 170k tokens for $4.7, with 59 API requests. It’s crucial to have the ability to customize chunk_size, chunk_overlap, and match score. Effectively splitting files into smaller chunks, performing semantic search, and sending only necessary chunks to the model would be efficient.

The model used is gpt4-1106-preview, Code interpreter is off, Retriever is on.

_j · November 19, 2023, 12:39am

Thanks for sharing your anecdote consistent with expectations.

“API requests” doesn’t tell the exact story, because it takes several API requests to even set an assistant and question in motion. Conclusion:

PreppyBrain · November 19, 2023, 10:12am

I just don’t get why they would release such an inefficient and blunt tool. It seems counterproductive.

Topic		Replies	Views
Challenges and Concerns with OpenAI's Assistant API: A Researcher's Perspective Feedback gpt-4	8	4530	January 23, 2024
Assistants API Cost Exceeds Reasonable Expectations API gpt-4	4	1088	April 11, 2024
Pricing of Assistant API misleading API	1	2065	December 11, 2023
Assistants API token usage and pricing breakdown clarification API gpt-4 , api , assistants	10	10571	February 6, 2024
Too many input tokens are used by Assistant Feedback assistants-api	2	212	November 20, 2024

User Experience Woes: Highlighting Issues with Assistant API Usage

Related topics