I'm getting this error ''Selected model is at capacity. Please try a different model.''

I’m getting this error when calling the API:

“Selected model is at capacity. Please try a different model.”

how to handle this

Hi, and welcome to the developer community. I had it too like one or two times.

It means that the model is at capacity, not enough compute for so many users at that time.
I would assume it happens when a new peak level of usage is reached and they need to find a way to scale it.

Like I said it happens .. but not very often. You just have to wait or you could try to restart vs code if you use that. For me it looks like they reserve a spot for you on a model .. I think that because when i restart vs code i normally get an error message and I have to restart it again.

Many times turning it off and on again seems to work haha… I should become a 3rd level IT support guy…

I don’t know quite where I got this idea from, but… please try a different model when encountering an error that explains that you should handle this by trying a different model.

WIthout more GPU compute to offer: its either degrading the AI quality, or having some users try back later.

Thanks for the report that you see this - such is all over social media.

IMO, a better message would be:

Selected model is at capacity due to compute issues. Please try back later.