Khan academy GPT-4o Math tutor demo - How to

confi13 · May 14, 2024, 6:21am

In today’s Khan Academy video of Math tutoring with his son
the interactivity between annotating the triangle and the assistant is impressively synchronized.

Does anyone know what APIs are used to create this interactive multi-modal experience? Is the whiteboard converted to video and sent to the model?

_j · May 14, 2024, 9:10am

That’s part of the unprecedented publicity push of this new model. Filmed in the OpenAI setup as were a bunch of others in the publicity package, released with perfect timing. Plausible “nice” application of the technology, instead of replacing phone trees for corporations.

The app likely has the ability to “look” when you say look, as you notice the speech slowing down while obtaining the initial image. The model can process over an image a second, but one can listen, that a lot of the discussion wouldn’t need visual cues. The actual context loading of captured images, context length management of chat with images, aborting output with speech, you can imagine them all with generation and API methods. Continuation of generation upon a dynamic context is something that we don’t get.

The ChatGPT app is its own product with methods not available to consume by API even now.

fluxtah · May 14, 2024, 10:19am

Since gpt4-o is multimodal expect more modes of input to be released as time goes on. How you plug audio, video, etc into the API I guess will come with an API update soon.

confi13 · May 14, 2024, 2:09pm

One possible method might be that the canvas in the Khan Academy app detects when the pencil draws something and possibly sends a few images as the pencil moves to provide enough context. An example of this kind of analysis can be seen in the video cookbook from OpenAI:
https://openai_com/examples/gpt_with_vision_for_video_understanding.

Diet · October 6, 2024, 11:28pm

I mean sorta, but it’s ridiculously expensive.

burukutugagaranga · April 1, 2025, 2:59pm

This AI-powered tool provides step-by-step explanations, personalized feedback, and interactive problem-solving, making complex topics easier to understand. Whether you’re struggling with algebra, calculus, or geometry, this tutor adapts to your learning pace. For those who need additional academic support, check out https://quickassignmenthelp.info/ for expert assignment help. With AI-driven education evolving rapidly, tools like this are revolutionizing the way we learn, offering accessible and effective tutoring for everyone.

Topic		Replies	Views
Can GPT understand jqMath formulas? API	5	856	May 27, 2023
GPT ,generate math equation in latex format problem API gpt-4 , api	8	4046	March 14, 2024
Educational Chatbot Documentation	15	3083	September 13, 2024
Model selection problem. Mainly used to solve mathematics and physics problems API	4	3222	June 29, 2024
Knowledge base or prompt words, which one is more efficient in solving elementary mathematics problems? API gpt-4	6	437	June 30, 2024

Khan academy GPT-4o Math tutor demo - How to

Related topics