20, 30 sec assistants API answer

There’s always the option of converting your project to Completions. My bot uses it’s own chain of thought loop and uses only Completions. Answers using smaller models even involving functions are almost always sub 5 seconds or less. And humans probably don’t need a response in less than one second :slight_smile:

For example, see this 2 second response:

2 Likes