Issue attaching 1000 token txt file to assistant

Avataris · June 24, 2024, 7:10pm

for me the same. I have small txt file, which even less then 1k tokens and it was always working in my assistant. but right now i could not use it in the playground, it says “Failed to index file: Error extracting text from file file-kR8adPtDRd7EecD39EJHqEaN detail_str=‘, detail: File contains too may tokens. Max allowed tokens per file is 2000000.’ self.error_code=<FileParsingErrorCode.TOO_MAY…”. looks like you have a bug open ai…
upd: actually it doesn’t matter what file i upload, I get every time same error. even if i create new vector database and attach file to it, it’s always failing with same error, after i try to switch on file search for assistant.

is there any known solution, i saw a lo of people reporting the same.
Continuing the discussion from Failed to index file File contains too may tokens. Max allowed tokens per file is 2000000:

_j · June 24, 2024, 8:18pm

There is parsing done on any uploaded file to try to determine a type, despite what extension you’ve used. This could fail if your file was a binary upload and has data that is not UTF-8.

It is possible to text-bomb OpenAI’s token encoder and leave it looping. Too long a processing time might return an inaccurate error message.

A first test is to add some placeholder text to the text files so that it cannot be read as another type. Add some lines of prepended text, such as “Here’s the start of a new document, which contains new knowledge for the AI”, and you can also add to the end. Some preprocessing or sanitation of .txt extensions yourself can make them more acceptable to this odd inspection being done.

Topic		Replies	Views
Failed to index file File contains too may tokens. Max allowed tokens per file is 2000000 API api , assistants	23	4463	March 2, 2024
Assistant Document Upload Error Max Tokens Even Though File Is Less Than 2 Million Tokens Bugs assistants , assistants-api	1	465	January 19, 2024
Error uploading file to retrieval API gpt-4-turbo	4	856	November 14, 2023
Assistant API .txt file upload fails Bugs	4	1402	May 5, 2024
New Assistants API bug when uploading files API	13	3237	January 25, 2024

Issue attaching 1000 token txt file to assistant

Related topics