I want to create an assistant through the API, but the API documentation seems a bit difficult, so I create and run it visually in the playground. Currently, I mainly do retrieval of my own data.
When creating an assistant online and testing it in the playground, I upload a “file” when creating the assistant, but I am also allowed to upload a “file” when running the Thread in the test. Is there any difference?
Each question to the assistant seems to consume a lot of tokens?
Every question has to go through queued, in_progress, and completed. Will it be time-consuming?
How to determine the thread’s life cycle? How to reuse?
Why didn’t I get any results back?