Optimizing GPT-4o's Vision Performance?

I’m not sure what you specifically mean that we are “building in parallel,” but we are definitely not just replicating what is shown in the demos. Furthermore, I’m developing a domain-specific product that is integrated into our users’ workflows, so a GPT would not suit my organization’s needs. I agree, though, that there’s more going on behind the scenes in the demos that we aren’t seeing. I’m hoping we can uncover how everything was done.