Using GPT (via API) to analyse changes of facial emotion in a video

ziqizhang · March 20, 2025, 9:37pm

I read this cookbook that explains that you can do video analysis by extracting its frames then apply image analysis of each frame.

While this allows me to do video-to-text analysis, I wonder how you can understand if and how things have evolved/changed during the video.

For example, I want to analyse a video recording of an interview, and to understand the change of emotional states of the person. How would you approach this?

My thought:

chunk the video to frames
ask gpt to analyse the emotion of each frame
placing the analysis outputs from step 2 into sliding windows, say, 5 frames at a time, then ask gpt how the emotion has changed based on the transcript it generated.

Would this work? Any other ideas?

Thank you

Topic		Replies	Views
Can i use GPT 4 Vision API to analyze Video? API	1	15238	January 28, 2024
Video analysis with Open AI Community gpt-4-vision	3	23109	February 14, 2025
Use Open AI API for video analysys API gpt-4 , api , chatgpt-plugin	4	12885	May 21, 2025
Request for Detailed Analysis of an MP4 Video with Textual Description API	2	568	August 8, 2024
How to summarise a lengthy video using image frames greater than 20 using gpt-4o API gpt-4	5	3091	November 12, 2024

Using GPT (via API) to analyse changes of facial emotion in a video

Related topics