Training GPTs through uploaded docs

I create GPTs that help in multi-step professional writing through prompt chaining. I am experimenting with a few for the news media domain, equipped with various actions. Where I have been struggling is to maintain the consistency of the writing styles that news media domains such as BBC or CNN follow. Tried many prompting techniques, examples, text nuances, and pattern recognition prompts, but GPTs eventually fall back to their standard watered-down, monolithic writing style after one or two paragraphs. Section-by-section composition technique followed by approvals provides slightly better results.

I built this context to ask for your opinions about, whether it would be a good idea to load a large volume of article text examples with metadata in the knowledgebase and ask GPT to learn the writing style from that through the prompt. Have you tried this technique before? If yes, please share your experience and the results you achieved.

You can check my GPT, Scriby, here: ChatGPT - Scriby

Thanks…

Here is an alternate way to do this that you could try, which is how I would have approached this.

From what you said, your issue isn’t about getting the correct data, but your issue is about how it how it gives you a structured response.

Now one way you could achieve this is by using the basic API and fine tuning how you would like to output to be structured.

I would recommend:

Start with few shot (content from a few PDF)

Run it through a Series of prompts structured prompts as each API

If your response starts to improve, but you need to teach it from a larger corpus, create a JSONL file with at least 250 items and train it

It seems like a lot of work, but I don’t think GPTs can do this just because you have a huge amount of data.

I already created a custom GPT-4 model using Assistants API and fine-tuned it with a structured article dataset. That gives me better results than the GPT does. However, I cannot open it to all, and I only provide it as a custom solution.

If you are recommending a fine-tuning job on GPT via API, can you please slightly elaborate on that?

It is a misunderstanding that these uploaded files are used for training purposes.
Some people also talk about these files as being part of a context chain (like the custom instructions)
These files are neither of these.
In fact, these files are completely irrelevant and ignored entirely, until a end-user makes a prompt that indicated these files should be read, then the python analysis will typically read the first 1000 or so characters from the document and return script output as actual context for the prompt response.

If the python script never readers the file then the file is never part of the context chain at all

Would love to learn more on how it works? Do you want to team up and do some tests? I already have a RAG set up working on the side for another project.

How can I upload docs that “train” the GPTs I have already created?

Hi @giselle.rossi the experimental process involves:

  1. Creation of document with metadata and upload in text format. This is a critical part.
  2. Explicitly mention in the file header section that the document is NOT for knowledge retrieval purposes, and is meant for enhancing the output quality of a type only.
  3. Update the specific step/part of your prompt that may use the knowledge the GPT acquired from the document, and apply it to generate the output.

A large section of Prompt Engineers don’t agree with this method. And the output quality enhancement following this process is also not remarkable for many use cases. However, it works well for specific use cases. If you want to see the workings, happy to have a call. You can write to me at both banerjee.sebabrata@gmail.com and sebo@thepromptengineers.in. Cheers!

2 Likes

Did you put in a document what you used to train GPT how to answer? For example, in special circumstances, the answer skills.@sebo001

That, and more. Mostly I use it as a prompt extender, that is invoked in special circumstances. Or for training the GPT on a specific styleguide.

By the way, I am hooked to MindStudio these days. OpenAI needs to realize they should have built a platform like MindStudio, with real automation and multistep, multi-model step builders.