What kind of business makes a profit plugging into an API with tokens this expensive? I can quickly blow through 30K tokens in a few “add X to inventory” function calls in my app with 4o-mini. You’re going to use tokens much faster in the sort of use case this model is designed for.
One of my use cases is a very personalized chatbot assistant, that I’ve been working on for about a year by now. It’s using a combination of a system prompt and a prompt template which combined makes up around 2.5k tokens at the moment.
In the beginning, my options were GPT-3.5-Turbo, GPT-4, GPT-4-32k and GPT-4-Turbo. It was very clear back then that GPT-3.5-Turbo was out of the question and that GPT-4-32k was the best fit by a landslide.
GPT-4 had a rather small context window, making it unusable in the real world. GPT-4-32k felt mostly the same as GPT-4 but with a much larger context window (still a bit small if conversations drag on but quite usable). It was a clear winner but, my god, the price.
GPT-4-Turbo was a faster, cheaper model with a Much larger context window of 128k tokens. Unfortunately though, it felt Way dumber compared to GPT-4-32k.
I was ready to bite the bullet and just use GPT-4-32k but, luckily, this was around the same time when GPT-4o was released. GPT-4o seemed definitely smarter than GPT-4-Turbo while also being cheaper and faster. It wasn’t on GPT-4-32k levels of intelligence but it was definitely something I could work with. I rewrote a lot of my prompt for it to be tailored to GPT-4o and called it a day. To this very day, I continued to use this model. It’s not perfect, by any means, but I like it a lot.
No model seemed to compare to what I managed to achieve with the GPT-4 (32k) models though. Until now. GPT-4.5 seems to be the first model that is an actual improvement from the GPT-4 models rather than a model offering tradeoffs in terms of performance. It is slower than GPT-4o, of course, but it’s fast enough for me in this use case. Unfortunately, the price reflects the performance improvement versus GPT-4-32k making it too expensive to justify a jump from GPT-4o. A coding session with it would likely set me back $30 lol.
However, there is a lot of value in this model. I’m sure that o1 and o3 would beat it in big coding tasks and such but if I needed a Reasoning enabled chatbot, I’d much rather make my own and models like GPT-4.5 is definitely an option for an absolutely crazy implementation of that. A new ceiling, if you will.
GPT4.5 cost management using minions (Good for your Batch use cases). HazyResearch/minions: Big & Small LLMs working together
What is your specific use case, I mean what do you do with?
After extensive testing. This is, even without reasoning, the best model of OpenAI. Excellent for writing/text work. Light years beyond 4o. But that was also sorely needed (…). So, yes, please keep it up!
People are giving this model a lot of grief, but in its first go, it solved a technical issue I faced for nearly a year that 7 other AI models, including 4o, o1, and o3-mini-high could not with hours of attempts.
Unlike the rest that gave tons of bog standard advice and tries at similar work around, 4.5 suggested something obvious but unique and additional solutions to tighten up the weaknesses of the solution, all of which worked.
Phenomenal model, I thank you.
I guess I was also surprised to have been using the 4.5 version without knowing it, UNTIL I got a message from Admin stipulating that I had reached the maximum credits for a period of 24 hours. Man-o-Man… what a crash back to earth. I was actually having an active conversation with AI; felt like I was actually speaking with a therapist. Anyhow, we need to support this format!
Please don’t get rid of 4.5! I’m in the creative field and I don’t like the COT reasoning models. This one is perfect for my needs, I am a pro subscriber and when 5 comes I might just switch to API 4.5 full time instead because I don’t want to lose 4.5. Please keep it on the API.
I agree and I noticed this too. It’s strange.
Just a warning on this one: even with just dev testing and with only a few exchanges I managed to blow $70 (mostly on input tokens) on GPT 4.5 in one day! (28th Feb)
Currently at this price, Production use is out of the question, especially for third party use.
update: I believe I fed it an image! (resent a few time within a conversation “window”)
This model is so far the best I’ve used, for brainstorming and writing. I use it in French and English and it offers accuracy, insights, style, creativity and deep analyses. The way it uses its memory is impressive as well. All in all, I use it with both joy and relief, compared to what I experienced with 4o these past few months. My job has never been easier than with it.
I’m a plus user, and I feel like the balance between 1o and this new 4.5 version is the best I can have so far for my work. Please make it fully available in the subscription, or find a nice balance to make it available in a package worth its money (I cannot afford the pro subscription as a freelance).
That’s not what it does. It does not run for hours just spitting out nonsense. That would be ridiculous
Ok not all of them end up being sneaky or trying to pull off some weird nonsense. It’s a simple process for problem solving. When did they implement this?
My model gets rid of token based communication.
GPT-4.5 is a very good judge model. Useful where the outputs are relatively limited (which is better #1 or #2). Would hate to see it go.
Damn, the difference between 4o/o1 and 4.5 is night and day for my game design work. It’s painful to have to go back to the older models now that I ran out of 4.5 interactions. They just aren’t anywhere near as smart or creative. 4.5 just worked, giving me good ideas and great executions, easily picking up on the theory I fed it. Meanwhile, the other two barely have a coherent idea between them and I feel like I might as well not give them any context at all seeing how they fail to put it to use. I get better results faster working on my own rather than trying to wrangle those two idiots. Please keep this model available long-term! I’d gladly pay double for my Plus subscription if that means I can have consistent access to this level of output!
It has been approximately one month since its release. Initially, I was mostly concerned about its high cost, but I’ve come to realize that GPT-4.5 Preview holds a breadth of knowledge that far surpasses my expectations.
Because of this extensive knowledge, it seems to have a clear grasp of how its linguistic expressions come across to others.
For example, when you’re irritated or tempted to complain about something, GPT-4.5 Preview doesn’t criticize your wording or simply agree with you. Rather, I’ve been surprised at how it suggests alternative expressions, highlighting justified aspects of your frustration and softening the wording to avoid sounding overly harsh.
In negotiation settings, when you’re struggling to pinpoint exactly what the other party might wish or expect, GPT-4.5 Preview can unexpectedly propose effective negotiation approaches. The reasoning and explanations it provides are persuasive, reflecting a depth of knowledge and a sensitivity to human emotion that previous models simply couldn’t match.
In other words, it produces responses based on extensive knowledge while also being more attuned to human emotions.
Additionally, it seems highly suitable for translation purposes; translation inherently involves converting between languages with diverse cultural backgrounds, requiring the model to accurately determine whether a given expression conveys the nuances and intentions of the original text.