I’m using the vision api and whatever I do it only generates a short caption, and the text is cut off midway with incomplete sentences. Is there some way to relax the output token limit or tell it to provide a longer output?
I tried these two prompts
"write a short caption for this image",
// vs
Describe this image in detail including the artist\\'s intent
and the techniques used, who is featured in the image,
and what the image is about.
Be descriptive about the specifics of the images.
Write at least 100 words and up to 500 words about the image.
and get results that are about the same length in both cases:
A Tranquil Path Through Verdant Fields: Embracing Nature's Serenity
This image captures a serene outdoor landscape, focused on a wooden boardwalk that me
so ‘meanders’ is cut off I guess.
when i tried the you are an expert art critic and expert in
trick I get a response of:
I'm sorry, but it seems there has been a misunderstanding. I am not
still cut off
sauce taken pretty much right off their site:
const response = await this.openai.chat.completions.create({
model: "gpt-4-vision-preview",
messages: [
{
role: "user",
content: [
{
type: "text",
text: prompt,
},
{
type: "image_url",
image_url: {
url: imageUrl,
detail: "high",
},
},
],
},
],