The Assistant API has been in beta for almost a year. The new structured response feature is absolutely critical for anyone building an application. Yet, it seems like the entire Assistant ecosystem is in disarray. For example, the rate limits are undocumented. After going back and forth with a support bot for weeks, a human finally responded with these specs -
- GET: 1000/min
- POST: 300/min
- DELETE: 300/min
- POST: “/v1/threads/_thread_id/runs”: 200/min
- POST: “/v1/threads/runs”: 200/min
Can someone from OpenAI give us some insight into the Assistant roadmap for those of us building a platform with which we mean to serve clients?