Using the default 64 max_tokens could return incomplete responses, i.e. completion.data.choices[0].text is missing text. I’ve played around with the Explain Code example, here’s an example of an incomplete response using max_tokens: 64:
It’s exporting two functions: activate and deactivate.
The activate function is called when the extension is activated.
The deactivate function is called when the extension is deactivated.
The activate function is registering a command called sayHello.
The command
Is there a way to generate complete responses that fit a given max_tokens limit? Or do I need to increase the limit to cater to long responses?
Auto-fit could be counter productive as code length/function will change the length of explanation. Meaning smaller explanations would consume more tokens and larger ones would lack detail.
VScode extension is a great idea. Is the explain code functionality the only thing you are going for?