How to get a raw list as assistant's output?

ravdar · December 28, 2023, 9:36pm

Hi, is there any way to configure the assistant to provide outputs as a raw list without any additional text? This is what the instructions for my assistant currently look like:

assistant = client.beta.assistants.create(
    name="Books Adviser",
    instructions='You are an expert on books. Your role is to recommend some books based on the data provided by a user. The output should be a Python list of dictionaries with keys ["Title"], ["Author"], and ["Topic"]. The list should be named "books". Do not write anything else; provide just a list.',
    model="gpt-3.5-turbo-1106",
    tools=[{"type": "retrieval"}],
    file_ids=[file.id]
)

Everything worked fine until I enabled ‘Knowledge Retrieval’ and uploaded a file. However, now it is quite common to see outputs such as “Based on your data, here are some books you may enjoy” or “Enjoy reading!”. How can I configure the assistant to output only the raw list as specified in the instructions?

ravdar · December 28, 2023, 9:53pm

I guess it can be done with fine-tuning or function calling, but I’ve not used any of these before. If it is possible, what solution will be more effiecent (cost, speed)? Or maybe there is a better option?

Macha · December 29, 2023, 6:05pm

Welcome to the community!

Have you considered forcing its response as a json format?

My thinking here is that you might be able to force it into providing a text list represented as a JSON file, and then use that format to programmatically turn it into the kind of data structure you need. This is the quickest and easiest solution that comes off the top of my head.

It’s not able to directly output raw data structures from a prompt directly. If it does, said data structure would be coming from its code interpreter/advanced data analysis capabilities. What it would do would be to write the script and execute the script to create some kind of data structure like that, but again, that still requires more leg work on the programming end than direct prompt → data structure. Hence why I think working with the json format would be the best solution here.

Let me know if you need more help or if that doesn’t work for you.

vb · December 29, 2023, 6:10pm

If you do get the dict with the list consistently then consider to simply remove all extra, unnecessary content before and after the list.
This should be a simple and robust solution but I admit that making sure you really get only what you want is more cost effective.

PhilFonseca · December 29, 2023, 6:14pm

Quick question, why not JSON? Can you please test this and see if it works for you?
||
Please answer with a JSON response in the following format. No salutes, no explanations, no thank you, nothing other than the specified JSON.
[
{
“Title”: “Placeholder Title 1”,
“Author”: “Placeholder Author 1”,
“Topic”: “Placeholder Topic 1”
},
{
“Title”: “Placeholder Title 2”,
“Author”: “Placeholder Author 2”,
“Topic”: “Placeholder Topic 2”
}
]
Make sure the JSON is valid!
||

edit: Just saw Macha’s reply. +1 there.

Topic		Replies	Views
Make the API return a python list or something I can rely will be a python list API gpt-4	14	6005	January 1, 2024
How to get an answer in in a predetermined without additional commentary from GPT4? Prompting gpt-4	3	1242	December 20, 2023
How to let GPT do not return any accompanying text? Prompting gpt-4 , gpt-35-turbo , chatgpt	11	13030	December 15, 2023
Limit knowledge retrieval to only pull from a list Prompting gpt-4	4	637	December 16, 2023
JSON Response format with assistant runs API	17	15120	February 27, 2025

How to get a raw list as assistant's output?

Related topics