How can I integrate video stream with GPT-4o and monitor for an object of interest when it shows up?

I would like run GPT-4o model on CCTV camera for security, and alert if any threat like firearm or weapon shows up. Any advice/ Guidance on how to go about implementing ?

