I have written assistant api that takes text file and input and do search on it but for one query where json response is of large size json response gets truncate in between.
What can be done to avoid this? I see max_tokens is only available in chat completions api.