Is there a way for me to send the system instructions to OpenAI just once (assuming they are the same for each item in the batch) so that I can avoid using tokens that will simply be duplicates?
So essentially, even if there were optimizations by sending system prompts and input that you have sent before, there is no additional discount offered to you of a “cache hit”.
The full text for understanding is always required.