Hi folks, I need your opinion on configuring an AWS machine for deploying Hugging Face’s Whisper large model (JAX version) and data storage for both audio and streamed textual data.
My end goal is to deploy the stream output model, but for now, I am setting up the current model without steam functionality. What would be the optimal AWS configuration to consider the future scope of the project?
- If I decide to use HF’s Whisper jax version, what would be the best configuration for large, taking into account the future streaming component?
- If I choose to use other implementations with streaming support, what would be the optimal configuration for large?