Yes.
This also helps folks understand that the expected 32K tokens with gpt-4-32k that the approximate words are 24k (give or take a lot of course).
So, when someone tries to send a 24K word prompt (messages array), it will likely fail because the total token count is for both the messages array and the completion (response), not to mention a users max_tokens setting when they submit the chat completion.
I have been thinking for the past week to start a wiki page here (everyone can edit the page) so we can hopefully consolidate these facts which seem to confuse so many here.
![]()