Large context document and finetuning

betim · September 6, 2024, 2:51pm

Hey there,

We have a case we want to finetune a gpt-4o or mini with a considerable dataset (~500 rows) that’s aiming to give users ability to generate a (figma) design as json object. Json object is designed by our team and might contain sub-objects that just our plugin knows to read.
All design construction (json product) is based on a guide, for the sake of easiness let’s call it manifesto (or context) which talks about structure, components relation and what component could be parent, what could be child, and on, this document is around 20 pages long.

So model needs to be finetuned to answer to a user prompt on generating a web screen with our own json structure.

Do we have to send on every dataset record this long document, together with question and answer?

Or do we have just to mention and then send it as context when we do api call? We followed this practice but we are hitting max_tokens (4096) and we don’t get a full generation for a screen. Or there’s a solution for this, which we don’t know?

I’d really like to hear your guidance on how to approach this problem.

Thanks in advance!

icdev2dev · September 6, 2024, 4:55pm

I am not really sure that I completely understood the context. Here’s what I got out of the post above:

Your final objective is to get a custom json object as a figma design; so that you can render off of it.
You would get to that figma design on an interative basis through user interactions.
you have guidelines (aka context) that the json object representing the figma design must follow.

Those guidelines contains instructions of how components must be structured within the json object (such as where a component can appear, parent-child relationship and cross reference between components).

If my above understanding is correcting, can you please describe the function of fine-tuning and what steps you have taken to come to the conclusion that fine-tuning is the best option.

It seems that you can develop this figma design by introducing one component at a time and injecting the necessary knowledge into that one component; analogous to developing outline of a story plot, outline-of-story chapters and then building out chapters. Similar to something like this here for illustrative purpose (https://youtu.be/KJ-J5AYGG7g)

Topic		Replies	Views
Fine-tuning a model without using prompt-completion API fine-tuning	1	920	July 4, 2023
Is Fine Tuning the right approach for me? Community fine-tuning , fine-tuning-problems	0	559	November 30, 2023
Training GPT with a Context API fine-tuning , api , data-preparation , context-elements , memory-issues	0	606	March 21, 2024
Fine tuning GPT-4o with large data source in system prompt API gpt-4 , fine-tuning , api , data-preparation , api-structured-data	0	130	January 8, 2025
Struggling with fine-tuning GPT for generating JSON API fine-tuning , fine-tuning-problems	1	361	July 9, 2024

Large context document and finetuning

Related topics