What is the final message (string input) generated from chat.completions

roshan.santhosh · March 11, 2024, 2:53pm

In the current syntax for generating model response, we use the chat.completions.create method and send in our input through messages, functions and other arguments in a structured format.

My understanding is that all of this structured input gets converted into a single string input which is then passed to the LLM model for prediction. How can you access the final string input that gets passed to the LLM model for prediction?

Macha · March 11, 2024, 8:01pm

Hey there and welcome!

So, if everything goes according to plan, it should be sending a serialized json string to the API call.

May I ask why you need the final input? Oftentimes, you do stuff before you serialize it, and you typically don’t mess with the serialization afterwards. I thought the client calls handle this for you automatically, making this question rather unnecessary.

_j · March 11, 2024, 8:11pm

You cannot access that “final string”, as it is created by the internals of the API endpoint after you pass a list of messages with role and content within an API request with the other parameters.

You can get a hint of the special tokens that are used to enclosed messages in the actual input to the AI model context by looking at the GPT-4 template here.

sps · March 11, 2024, 8:34pm

Welcome to the community @roshan.santhosh

The final input passed to the model before being tokenized is inaccessible for safety and various other reasons.

However, you are correct that it is indeed a single string, as under the hood it’s still like the older completion models yet different.

That final string, AFAIK, is in ChatML. OpenAI used to have a GitHub readme about it, but it seems they have stopped maintaining it.

stevenic · March 11, 2024, 8:45pm

As others have said, you can’t access the final input string but you can get the total number of tokens it consumed by looking at the response object.

roshan.santhosh · March 11, 2024, 9:10pm

This was more for my understanding of how the LLM handles such inputs, specifically the function calls.

Im assuming the models with function call capability were finetuned on a very specific prompt template to handle the function calls and any different prompt structure would probably not give the same level of results.

I dont know if this info was shared elsewhere through some paper. Or if people have been able to finetune models with the ability to work with function calls.

roshan.santhosh · March 11, 2024, 9:12pm

Thanks. This was pointed out recently to me as well. For chat.completions.create, including additional arguments like functions, tools etc increases the number of prompt tokens. So these arguments are definitely getting converted into string as part of the prompt

stevenic · March 11, 2024, 9:19pm

Early on someone got the model to dump its internal prompt for functions but they’ve since plugged that hole and the last time I searched for the dumped inner prompt text I couldn’t find it.

It’s not rocket science what they’re doing though. They basically show the model a trimmed down version of the JSON schema you passed in

_j · March 11, 2024, 11:24pm

There is no ‘hole’ to be patched. The AI will be made to do whatever a determined person wants, including repeating anything in context.

Here’s that function language as it is received by the AI, as part of the first system part message.

Parallel tool calls are by an additional tool called multi_tool_use, that wastes more tokens, and tells the AI to place multiple functions in the tool wrapper, and later models are trained to use these.

Topic		Replies	Views
How the `function_call` argument in OpenAI chat completion api might've been implemented API openai-documentation	2	1364	September 16, 2023
Second function call under the hood API chatgpt , function-calling	0	683	January 17, 2024
Functions VS Tools - What is the difference? API gpt-35-turbo , api , functions , function-calling , tools	8	18882	July 17, 2024
Function calling with fine tuned model API	18	4401	December 1, 2023
Getting a function call + textual response in the same call Community gpt-4 , function-calling	3	450	July 30, 2024

What is the final message (string input) generated from chat.completions

Related topics