Using gpt-4-vision-preview in Langchain

adeshg · December 8, 2023, 6:37am

I am trying to create example (Python) where it will use conversation chatbot using say ConversationBufferWindowMemory from langchain libraries. User will enter a prompt to look for some images and then I need to add some hook in chat bot flow to allow text to image search and return the images from local instance (vector DB)

I have two questions on this:

Since its related with images I am trying to use gpt-4-vision-preview model in my code. My code samples looks like:

from langchain.chat_models import ChatOpenAI

qa = ConversationalRetrievalChain.from_llm(ChatOpenAI(model="gpt-4-vision-preview", max_tokens=1024), retriever, memory=memory,chain_type="stuff")

It gives me the error: The model gpt-4-vision-preview does not exist or you do not have access to it. Learn more: How can I access GPT-4? | OpenAI Help Center.

However in my other code sample if I do it like:

import openai
 response = openai.chat.completions.create(
        model="gpt-4-vision-preview",
        messages=[
            {
            "role": "user",
            "content": [
                {"type": "text", "text": "Describe the image in detail"},
                {
                "type": "image_url",
                "image_url": {
                    "url": f"data:image/jpeg;base64,{image_base64}",
                },
                },
            ],
            }
        ],
        max_tokens=4096,  # default max tokens is low so set higher
    )

In this case it works using my openAI key.

So does gpt-4-vision-preview model is supported only in method openai.chat.completions.create() ?

Other question is what I am trying to achieve is really feasible / doable? I mean when I go though the open AI documents I got many samples where is user is trying to read and attach PDF or other docs to a langchain ConversationalRetrievalChain and do question answer session.

But when it comes to say text to image search using Weaviate existing Schema (with images vectorized) can I allow the text to image search in conversational chatbot?

Topic		Replies	Views
Make OpenAI Vision API Match GPT4 Vision API chatgpt	4	3403	December 6, 2023
Using the assistance / chat completion API to ask about an image attachment? API api , image-reading , chat-with-images	5	4813	December 17, 2023
Does gpt-4-1106-preview include Gpt-4V? API api , gpt-4-vision , gpt4-vision	3	3738	November 13, 2023
Image to text description in the API? API	7	27555	April 1, 2024
Using gpt4o as OCR fills data with invented data API gpt-4 , gpt4o , ocr	10	152	December 20, 2024

Using gpt-4-vision-preview in Langchain

Related topics