I’m utilizing the GPT-4 Turbo API for a study involving image-text verification. My objective is to evaluate whether a given sentence accurately describes an image. The process involves the model providing a detailed rationale for its decision before concluding with a final answer in the format of “Final Answer: Yes/No.” I have a dataset comprising over 16,000 image/sentence pairs.
The prompt I intend to use is as follows: “Analyze the image step by step using available information and determine if the sentence accurately describes it. Your final answer will be Final Answer: Yes/No. Sentence: There are no dogs in the room.”
I will use the code shared in the OpenAI’s website. What’s the most efficient approach to achieve this using the API? Additionally, do you have any suggestions or cautions regarding this usage? My aim is to ensure that my usage of credits is optimized for efficiency.
I’m not entirely sure on the technical side, but openAI has documentation on prompt efficiency for achieving results, and the compromise is possibly swapping accuracy for credits, since consecutive prompts seem to provide more accurate end results