If I fine-tune a GPT-4o model on specific text examples, will I still be able to pass images to the model for inference? Also, will the fine-tuning on text examples impact the model’s performance with images?
Welcome to the community!
I believe so if the model is multi modal.
I do not believe so. I’ve heard fine-tuning for images might be coming eventually, though.