How does GPT-4 multimodal input interface work?

benslinux · August 14, 2023, 12:43pm

I have been using ChatGPT on a daily basis since December 2022, primarily to help me as I advance in C Programming - it has proved invaluable for this. I’m considering trying out GPT-4 due to its support for multimodal input. When feeding information to GPT-4, such as text and images - how is this actually done by the user? Are images, documents, code snippets, etc. directly pasted into the interface?

novaphil · August 14, 2023, 9:35pm

Regular GPT4 doesn’t yet support images/video. Generally any other content is just pasted into the text box. If you are using Code Interpreter mode you can upload files (and zip of files) by clicking the (+) icon in the text box.

Topic		Replies	Views
GPT-4 API multimodal access (images) API	8	13681	July 2, 2024
How to use ChatGPT-4 to analyze images as Openai said Plugins / Actions builders	3	17138	December 13, 2023
GPT-4 with image input documentation API	3	3710	December 15, 2023
How do I use images with the gpt-4 api? API gpt-4	1	1008	August 27, 2023
Multimodal (image) fine tuning with GPT-4 API gpt-4 , fine-tuning	17	7288	October 3, 2024

How does GPT-4 multimodal input interface work?

Related topics