Wen image embedding model

And so now we have dall-e 3 and vision, that’s awesome, huge, great, super!! But we need some glue for all that stuff, don’t you think?

An image embedding model can be that special sauce, just can’t wait…

(off topic: TTS model is also great, congratz for the wise choice of Brazilian Portuguese as the PT standard. but still sounds like an us-person speaking portuguese (a fluent as fu** in portuguese, but still an us-person. So… any plans for improve the multilingual side of the model?))


Is image embedding model available through API now?

You can generate image embeddings using APIs

You can refer to this for futher research: Do image retrieval using multi-modal embeddings - Image Analysis 4.0 - Azure AI services | Microsoft Learn

1 Like