Best way to pass "state data" into Assistants API?

hansgarciachen · December 27, 2024, 7:31pm

My app involves a user in an ongoing conversation with an AI (using the Assistants API). The user holds a whole bunch of “state-data” which we represent as a big json structure. The AI is able to manipulate this structure through function calls.

I’m trying to understand the best way (in terms of both token use and efficient use of the context window) to pass this “state-data” to the Assistant…

The obvious choice is “add it to the prompt”. But my concern is that this will quickly eat up tokens and fill the context window with largely redundant information. Because with each turn of the conversation the AI (ie the underlying transformer) is not only seeing the current state-data, but also the state-data from each past-turn.
Currently, we are passing state-data via “additional instructions” - since my understanding is the AI (ie the underlying transformer) will only see one copy of the state-data (instead of one copy per-past-turn of the conversation) - on each turn.
We could also send a summarized/truncated state-data in the prompt, and make it queryable via function calls.

Specific questions:

I understand that token-billing is somewhat opaque - but I’m trying to get a rough idea here. Suppose we have a conversation with average message size M, structure size S, T-turns in the conversation, and token limit L. Is my understanding correct that my input token use over the conversation will be O(min(L, (S+M)*T)*T) if I put my structure in the prompt but just O(min(L, S+M*T)*T)) if I put it in the additional_instructions?
Does the underlying transformer actually get the additional instructions on every prompt? I’ve seen people claiming that they are getting ignored.
Suppose the model can query a get_data_structure function to get the current structure via a function call. Will the response to that query still be visible to the AI in future turns of the conversation?

cheesey · December 29, 2024, 11:18am

How are you getting along with this, i am trying to do roughly the same sort of thing, i want a character to be aware of their state and that of the world/game…

But i also dont want their first comment to be repeating back the ‘state info’ to me (which seems prevalent when you include state info at the begining of -every prompt- at the start…

With ‘chat’ i included a system message, after the initial prompt (this i could edit, so there was only one version and i could keep it up to date manually on each request) and it seemed to work… ‘better’ – But i siwtched to assistants as i liked the idea of not having to re-explain the entire conversation to it every post.
– i believe that’s how it works with threads? although i could be wrong.

I thought this would be ‘better’ …But as i dont think i’m able to insert system messages after creating the assistant? the only options i see available are user/assistant. I cant really use the previous ‘technique’.

Did you get any further with this? As i’m sure any further you’ve gotten towards your goal would be helpful to me, cheers!

Topic		Replies	Views
What is the recommended way to add context to the assistant? API plugin-development , api-billing , assistants	6	7802	December 13, 2023
For Assistants, should small data be added as tool response or in instructions? Prompting prompt-engineering , assistants-api	1	838	December 8, 2023
Taking users through a customer journey (dynamic prompting, different steps) API chatgpt , assistants , assistants-api	0	1077	November 24, 2023
Token consumption: Prompt tokens exponentially increase when using Threads (Assistants) API assistants-api	8	416	September 5, 2024
Assistants API - prompt requirements and OpenAI "self" knowldge API	8	2560	June 18, 2024

Best way to pass "state data" into Assistants API?

Related topics