GPT-4: 32k and Image recognition

Thiago · July 6, 2023, 10:18pm

Anyone’s got news on this? Did the public release of these get cancelled or should we expected a similar roll out to gpt-4 API - with early developer access - later this year?

Any news on this is appreciated it! Also, what would you use it for? 32k is easy to imagine, but the image recognition? so many different ways to go about it

Foxalabs · July 6, 2023, 10:24pm

32k will be rolled out, but it is super heavy on resources so it will be a gradual one as more compute gets put online, image input will (I imagine) follow a similar rollout to GPT-4 in that respect. Slow initial alpha with a slow beta after that and then a full general release.

100% speculation but just following from prior offerings.

_j · July 6, 2023, 11:12pm

What can you do with an AI computer vision system?

Bulk CAPTCHA solver. My theory why you might not see it for a while.

anon22939549 · July 6, 2023, 11:31pm

It just means we need a new method of determining who is human, but existing CAPTCHAs have been broken for a while now by anyone moderately sophisticated and properly motivated.

david23 · July 6, 2023, 11:50pm

There is a lot of cool and scary things we can do with GPT Vision. CAPTCHA solver is on the low hanging fruit rung. How about build out software from a design mockup which is only one or two rungs above a CAPTCHA solver

Foxalabs · July 6, 2023, 11:56pm

Reading schematics and PCB layouts and designing electronics is one of my “would like to try” items

anon22939549 · July 7, 2023, 2:27am

I mean… I’m guessing half of those would get you banned from from all OpenAI services in a digital heartbeat.

anon22939549 · July 7, 2023, 2:32am

Now this… This is interesting…

I know machine learning has been used in chip design, but it would be a boon to makers everywhere if there were a readily-available, affordable system that could from an image of a PCB, redesign it to be more compact, use fewer parts, etc…

I think this is one of the best ideas I’ve heard for this yet.

supershaneski · July 7, 2023, 2:34am

If GPT-4 includes vision, does this mean it will be possible to do face recognition, maybe by doing “facial data embedding” or “facial data fine tuning”?

Thiago · July 7, 2023, 10:53pm

Anything is speculation at this point regarding this, we need it in our hands to test it out. Facial recognition is already achievable with other AI methods, so I don’t see why it would be a problem. Fine tuning it would probably get it to be more accurate, but given that we don’t have GPT-4 fine tuning out, you can expect for different solutions around this to pop up, such as using databases in production and variables/lists/arrays in a experimental/developmental environment.

Foxalabs · July 7, 2023, 11:08pm

Further to this, facial recognition is a trivial task with mature technology, it could be that it will be part of some subsystem, but the point of image ingestion is to augment the models information input ability, it may be able to identify if the human in the image is wearing a shirt and tie or a t-shirt, but it won’t know the persons name.

Thiago · July 7, 2023, 11:25pm

here’s some good ones I can think of:

Locate cancer cell(s) and destroy it (virtual simulation done on unreal engine)
Video game npc village, with real time reaction to events
Based on medical exams, propose a diagnosis
Fully autonomous surgery with multiple agents on different focal view points (microscopic precision with macro perspective range of understanding)
Take out the garbage bot lol

Also, careful with sharing harmful ideas, it’s always best to not propagate them in my opinion

anon22939549 · July 7, 2023, 11:41pm

One thing I think will be interesting is to include it in other image pipeline tasks.

For instance, take Meta’s Segment Anything model. A natural next step in the pipeline could be to meaningfully label the segments with GPT-4.

Then, I’m immediately imagining a content-aware segmentation model. Where knowing what the objects are it will be better able to determine segments, particularly partially obscured or occluded objects. Then using GPT-4 to label those segments.

Perhaps looping iteratively to either fine-tune the segmentation map, or possibly creating nested segmentations.

Thiago · July 8, 2023, 11:30am

I found information on the 32k model, no word on the image recognition functionality:

API Access

On July 6, 2023 , we gave all API users who have a history of successful payments access to the GPT-4 API (8k). We plan to open up access to new developers by the end of July 2023, and then start raising rate-limits after that depending on compute availability.

We are not currently granting access to GPT-4-32K API at this time, but it will be made available at a later date.

andres1 · July 20, 2023, 3:28am

I need image recognition for some of my use cases. It’s advertised in OpenAI’s web site and apparently available through Bing, but it is still not available through the API. I found a video some time ago with someone from OpenAI teaching how to use it, but I tried and received an error message back. In summary, you guys from OpenAI created the high expectation, now please give us a chance to test it even if in beta.

alishalash.mansoori · July 20, 2023, 10:00am

how can to swich ChatGPT version 3.5 to version 4 ?

_j · July 20, 2023, 11:23am

Bing Chat has been occasionally been rotating the availability of GPT-4-powered machine vision to some users.

Those who get access first: those who put $10 billion into OpenAI. Microsoft. Consider they also have the AI power to recognize and blur faces before submitting for AI answering analysis.

Hi. You have found the wrong forum category and posted to the wrong topic. However I can answer that ChatGPT with GPT-4 enabled is a feature that is available to plus subscribers at $20 per month, an upgrade available within the chatgpt web interface at chat.openai.com.

Topic		Replies	Views
How to get access to gpt-4-32k? API	27	88520	December 12, 2023
Gpt-4-vision! New model name is out but not the access to it! API gpt-4-vision	9	7114	November 27, 2023
GPT4 OCR/Image Recognition API gpt-4	3	20680	December 18, 2023
Any update on GPT-4 vision? API	6	3122	December 17, 2023
ChatGPT goes Multimodal! Sound and vision is rolling out on ChatGPT Community chatgpt , multimodal	34	12052	December 10, 2023

GPT-4: 32k and Image recognition

API Access

Related topics