Looking for Recommendations on Hosting a High-Traffic AI App

My app is scaling quickly, and I’m unsure which hosting approach pairs best with OpenAI API usage.
Should I go with serverless, container hosting, or dedicated servers?
Would love to hear what setups others are successfully using under heavy load.