I applied the change in the code, quite the max_tokens but it gives me the same error, even the response appears shorter.