Khan academy GPT-4o Math tutor demo - How to

One possible method might be that the canvas in the Khan Academy app detects when the pencil draws something and possibly sends a few images as the pencil moves to provide enough context. An example of this kind of analysis can be seen in the video cookbook from OpenAI:
https://openai_com/examples/gpt_with_vision_for_video_understanding.