Introducing Predicted Outputs

Diet · November 5, 2024, 7:30pm

I’m sorry - I’m a little slow here

I thought we’d finally be getting partial assistant output in prompts, but this is not that.

In fact, it looks like we’re billed normally for all output tokens, predicted or not, is that right?

I mean it’s cool tech, and I wonder how you guys are doing it (parallel generation? Multiple token prediction + skip ahead? Quants?)

But for this particular use case aren’t diffs/ fuzzy diffs much faster and cheaper?

Topic		Replies	Views
When OpenAI predicted outputed input content is large, the effect is average? API gpt-4	1	117	December 16, 2024
Using predicted outputs for proofreading Feedback gpt-4o , predicted-outputs	1	215	January 22, 2025
Hypothetical Token-increase Strategy . Community gpt-4 , chatgpt	21	253	March 17, 2025
Feature Request: Token Adaptive Model API chatgpt , api	25	2069	August 8, 2023
Do 'MAX tokens' include the follow up prompts and completion in a single chat session API token	22	5273	August 25, 2023