your main problem is going over your tier limits. depending on the model, check your request per day, request per minute, tokens per minute. you can certainly use assistants api. it will not necessarily use less tokens. it actually has the potential to increase your token usage if you use file search. though there are token control properties that you can use.
1 Like