API for image generation for gpt-4o model

I think it’ll able to the reason is because it works on natural text, unlike dall e where you need to inpaint…
But I’m not sure that will it be able to retain memory with unique ID or each time we need to provide input a image to manipulate it!
Also will the input needs to be in base64 or it’ll be able to process public url