I started to get this on first request too.
Error code: 429 - {‘error’: {‘message’: ‘Rate limit reached for gpt-4-vision-preview in organization org-ptXXX on tokens per min (TPM): Limit 10000, Used 6870, Requested 3745. Please try again in 3.69s. Visit https://platform.openai.com/account/rate-limits to learn more.’, ‘type’: ‘tokens’, ‘param’: None, ‘code’: ‘rate_limit_exceeded’}}
The numbers of tokens and the cool off time seem to be random (only requested is fine). My billing page does not show any usage of the model.
The error message returned is misleading or there is some bigger problem.