Integrating Vision with Assistant API

My PR (albeit in RoR) might help: