Video analysis with Open AI

lreinhard7 · July 3, 2024, 4:20pm

Hi,
I am trying to analyze a video that I made with my drone.
The video has no voice but rather records activities only.
What I try to get is a summary of what happened in the video.
Example:
08:01 Person leaves house and walks to garden shed
08:05 Person takes equipment out of the garden shed and walks into garden area
08:10 Person does some work in the garden
etc.
Is it feasible to get this information?

_j · July 3, 2024, 5:24pm

Here’s a cookbook example, which is overly optimistic about current vision models and how many pictures they can accept, and uses a method retired in newest models, but gives an overview of frame extraction and asking.

Many more frames need to be discarded to accommodate a budget, and timestamping is not a feature except by what you know about the segment you sent for analysis of video images.

A second round of AI processing may be required to remove the redundancy of what is reported in an image, as there is no actual long-term view of a video, only creative providing of images.

lreinhard7 · July 4, 2024, 5:59am

Many thanks for the answer. Seems that I am asking for something that is currently not easily achievable. Let’s wait a couple of months and then see what is possible.

Capital · February 14, 2025, 6:45am

This should do what you’re looking for. You can connect it to the OpenAI API

byjlw/video-analyzer on GitHub. Cants share the link apparently

Topic		Replies	Views
Use Open AI API for video analysys API gpt-4 , api , chatgpt-plugin	4	12730	May 21, 2025
How to summarise a lengthy video using image frames greater than 20 using gpt-4o API gpt-4	5	3031	November 12, 2024
Request for Detailed Analysis of an MP4 Video with Textual Description API	2	548	August 8, 2024
Video to Script AI Model VIDEO-TO-Scenario Community chatgpt , plugin-development	1	1095	February 18, 2024
Can i use GPT 4 Vision API to analyze Video? API	1	15047	January 28, 2024

Video analysis with Open AI

Related topics