we can directly integrate text and multiple images
if this is possible, we don’t have to give them to GPT4 separately
GPT4 can understand text and images without connecting them manually by texts
For example, now we have to specify text and images with “first image is …” or “second images is …”