I’ve realized that if you do short and specific requests, the AI will work very well, otherwise, if you do bigger requests, the AI may not work too good, giving some “network errors” and closing the body stream, resulting in a performance failure
1 Like
Yeah, the shorter the prompt and completion, the faster it goes. Trying to fine-tune on a smaller model can sometimes help too.
1 Like