Fine-tuning a codex model?

bcjordan · August 21, 2021, 5:31am

Is it possible to fine-tune either of the codex models? I’d love to play with some block-based coding datasets. The stock davinci model seems to know a bit about the structure/internals of blockly, but doesn’t seem to have many samples of blocks and what they do in various contexts.

I could try a really long prompt with them, but have had such good outcomes with fine-tuning I would love to try that as well.

vertinski · August 21, 2021, 12:15pm

Greetings @bcjordan! I too am waiting for this powerful option. (Keeping eye on this thread.)

Datasculptor · August 21, 2021, 3:50pm

“Evaluating Large Language Models Trained on Code” https://arxiv.org/pdf/2107.03374.pdf

Datasculptor · August 21, 2021, 5:32pm

According to the paper - Codex is a GPT language model fine- tuned on publicly available code from GitHub (Python)
I do not know if further refinement will have the desired effect, perhaps on data not available on GitHub?

marcin.woz · August 30, 2021, 10:56am

I write an app that’s focused on a specific field of coding so focused fine tuning is something I’m looking for myself.

marcin.woz · August 30, 2021, 1:51pm

i write a tool for coding automated tests in Selenium

Nephilim · August 30, 2021, 8:04pm

same thing, I’m using for astrophysics and biological programming, if I can fine-tune it in some of the models we have, I think it will perform way better than now

monkeydust · September 3, 2021, 8:55am

I don’t think it is yet, I asked a week ago and was told its not available. Suspect only a matter of time and perhaps $.

pkulko · September 28, 2021, 10:32pm

I’m interested in fine-tunning of Codex as well. Because right now I’m struggling to generate correct code with the correct API calls. I notice that it sometimes produces imaginary API calls, for example, when I ask it to use Office JavaScript API it can produce something like this:

      const currentWorksheet = context.workbook.worksheets.getActiveWorksheet();
      const table = currentWorksheet.tables.getItemAt(0);
      //add an AutoSum for each of the 5 columns of the table
      table.columns.getItemAt(0).summary.addAutoSum("Total");

In the above example there is no “summary” field for the TableColumn in the office-js API. And surely there is no addAutoSum() method. All that was confabulated. I do provide reference to the office-js api in the beginning. I even tried providing several working examples of code with the correct API calls so that it gets the idea better. Providing the examples of code does help a bit. But the input length is limited, so if we could finetune it on many code snippets from particular API then it would potentially allow us to significantly increase the probability of generating the correct code.

keshavchander100 · July 21, 2023, 5:19am

Hi @marcin.woz , Could you please explain your usecase ,since i was also working on creating selenium code , i write a prompt using the manual steps and create the tests with code in selenium .
Note : Its still in progress not yet completed.

Topic		Replies	Views
Fine-tuning for Codex? API codex	20	8798	March 26, 2024
Codex Prompt Engineering & Finetuning? API codex	1	1383	August 24, 2022
Finetuning Code-davinci API codex	2	3640	March 26, 2024
Fine-Tune Davinci to write programming language API codex	2	1115	May 3, 2023
Does Codex support fine-tuning? API codex , gpt-4 , fine-tuning	1	608	March 26, 2024

Fine-tuning a codex model?

Related topics