Image to text description

holdensrstips · October 25, 2023, 4:19pm

im trying to incorporate image detection into my discord bot . i want it to function how it does in the gpt-4 browser page. you give it a image and it tells you what the image is .

here is my current gpt-4 discord bot , very simply , how do incorporate the users being able to give the bot a image to describe?

import os 

import discord
import openai
from discord.ext import commands
from dotenv import load_dotenv

load_dotenv()

TOKEN = os.getenv('DISCORD_BOT_TOKEN')
intents = discord.Intents.default()
bot = commands.Bot(command_prefix='!', intents=intents)
OPENAI_API_KEY = os.getenv("OPENAI_API_KEY")

openai.api_key = OPENAI_API_KEY

@bot.event
async def on_ready():
    print(f'Logged in as {bot.user.name}')

@bot.event
async def on_message(message):
    #check if the message is from a user
    if not message.author.bot and bot.user.mentioned_in(message):

        user_message = message.content.split(' ', 1)[1]

        response = 'Nothing yet!'

        if user_message.startswith('!ask'):
            question = message.content[5:] #remove !ask from last message content 

            response = openai.ChatCompletion.create(
                model="gpt-4",
                messages=[
                    {"role": "system", "content": "you are a helpful assistant."},
                    {"role": "user", "content": question}
                ],
            )
    await message.channel.send(response['choices'][0]['message']['content'])



    await bot.process_commands(message)
def run_chat_gpt_bot():
    bot.run(TOKEN)

holdensrstips · October 26, 2023, 3:47pm

still looking for a solution , im currently using tesseract OCR to at the vary least detect text and pass that to GPT4 so it kinda of works . but i want it to be able to describe a image like how gpt-4 does in the openai gpt4 chat page

_j · October 26, 2023, 4:16pm

Well, besides needing the “intents” specifically for the message contents, you’ll need to wait until there is an OpenAI API product available that has computer vision.

Or use something different:

raoulg · October 27, 2023, 1:03am

I as well am wanting to upload an image to ChatGPT4 using the API, as it allows in the web version. Any help on this?

_j · October 27, 2023, 1:07am

The API does not have an AI model that allows image upload or computer vision.

That may come in the future.

Before even asking how to do it, you’d want to know the price.

raoulg · October 27, 2023, 10:57am

I don’t care about the prices, I want to be able to do it

Topic		Replies	Views
Do ChatGPT API supports image describe functionalities via API? API gpt-4 , gpt-35-turbo , chatgpt , api	1	728	July 16, 2024
What are the APIs for image analysis? API gpt-4 , api	2	10432	May 17, 2024
Make OpenAI Vision API Match GPT4 Vision API chatgpt	4	3926	December 6, 2023
I need help with image recognition for my Discord bot using GPT-4 Community gpt-4 , chatgpt , api , image-reading	0	665	December 23, 2023
Describing images with GPT3 API	5	11498	August 3, 2023

Image to text description

Related topics