Hi, what am I missing? Doc says 512mb limit, but not the tokens. What is wrong?
Basically I was just trying to create an assistant with Retrieval tool, I did file specifically less than 512mb tried it and got this error.
Hi, what am I missing? Doc says 512mb limit, but not the tokens. What is wrong?
Basically I was just trying to create an assistant with Retrieval tool, I did file specifically less than 512mb tried it and got this error.
Hi and welcome to the Developer Forum!
Not seen that message before, 2M tokens would equate to approximately 8Megabytes of pure text.
Iāve seen that message in messages before.
Hi guys, thanks for fast response.
@_j hmm thanks, but what does it mean though? I need to upload huge files for retrieval, like all 20 will be 500mb for sure. Does it hard limit of backend engine, or what?
It means that the documentation does not match the capabilities seen in practice.
If intentional, I expect that someone scratched their head hard about the backend costs of chunking and embedding half a gigabyte of data just so someone can ask some questions only informed by 0.001% of that upload.
text-embedding-ada-002 is $0.10 per megatoken without overlaps. Then they say āmaximum 10 GB per assistantā. Upload a DVD binary rip of āOffice Spaceā as text files. Say hi to a chatbot. Delete. $500 of backend.
Yeah, same feelings. Donāt get this monetization strategy, also seems like they are quite weak in writing API layerā¦
Same error with a 45MB file, got around it by breaking up into 12MB