About fine tuning - Your opinion?

_j · October 25, 2023, 11:15pm

I can give a layman idea of attention layer consumption, from my similar position as a layman basically, with a linguistic analogy that is parallel to what comes before the open-ended generation of a decoder-only transformer.

Abstract: A whole bunch of unexhausted attention layers is all you need

“Here’s the file you said you needed. It has Joe’s calendar, and Becky’s tasks, and her reminders. Take her reminders and add them to his calendar. Then the remaining items also must be added in some way, so take them and look at his stuff and see if he also has similar entries, and if they are still unclear, put them in that error report instead adding those with dates possibly in error to that. Then they all can be added to the new file for the outsourced workers.”

I was going to bold the tokens that require attention to find the internal references and paths back to the meaning, but you can start at pronouns, and then other anaphoric references, and see the whole passage I wrote needs attention and would be more bold than not. Basically a large remapping of where token production should actually be looking for meaning.

One can see there’s hella code even early on, by looking at the ranks of the cl100k tokenizer.

What there also certainly is: millions of instructions that are the quality of upvoted GPT-4 answers fed into ongoing gpt-3.5 internal tunes. Quality hard to beat unless you need to deviate from that professional-setting disclaimed chat with refusals for 8x the price.

Topic		Replies	Views
Do 'MAX tokens' include the follow up prompts and completion in a single chat session API token	22	6007	August 25, 2023
Hypothetical Token-increase Strategy . Community gpt-4 , chatgpt	21	738	March 17, 2025
It looks like GPT-4-32k is rolling out API gpt-4	201	74086	July 16, 2023
Fine-tuning vs Context-Injection (RAG) Prompting gpt-4 , gpt-35-turbo , chatgpt	5	13709	December 11, 2023
Processing Large Documents - 128K limit API gpt-4	40	9611	February 9, 2024

About fine tuning - Your opinion?

Related topics