How does the generative aspect of GPT impacts my models?

branchette · February 13, 2023, 12:45am

Say I fine tune a model such as davinci, how would the generative aspect of the AI impact the prompts I create? Without going into the neurons can you say how the data I provide is leveraged to generate new content?

ruby_coder · February 13, 2023, 2:25am

Hi @branchette

The base davinci model is pre-trained and fine-tuning does not effect or change the pre-trained model.

Fine-tuning (in the GPT generalized architecture) occurs outside the pre-trained deep artificial neural network in a component referred to in the architecture at the “decoder”.

According to the architecture, the decoder takes the output from the model and performs a number of key tasks preparing the data for output.

Fine-tuning effects that part of the “prepare the data for output” process.

Hope this helps.

branchette · February 13, 2023, 4:37pm

Thanks for your answer @ruby_coder . Does the same hold true for embedding?

ruby_coder · February 13, 2023, 4:44pm

Please be specific if you have a question related to embeddings.

Thanks!

branchette · February 13, 2023, 6:14pm

I am bit confused and that might show in my question. I send this prompt to ChatGPT: “Write python code to call a rest service.”. As a result it generates python code with a URL that is hard coded.

Then I tell it to modify the code to retrieve the URL from an environment variable, and it does beautifully.

The question is how do I achieve the same behaviour for code I would process using embeddings. Suppose I create embeddings for additional some domain specific python functions. How can I do the same refinement that GPT does?

ruby_coder · February 13, 2023, 6:28pm

Yes you are

Text embeddings are not generative text.

Text embeddings are vectors that represent text.

branchette · February 13, 2023, 6:33pm

So, how then do I take advantage of the generative aspect of GPT if neither embedding nor fine tuning are the answers?

ruby_coder · February 13, 2023, 6:40pm

Sorry again @branchette , but I have no idea what you are referring to or talking about or what you are trying to accomplish in your line of questioning.

Sorry again.

curt.kennedy · February 13, 2023, 6:50pm

You would use the embedding (vector) to get a similarity, and then use the actual text for the embedding to feed into a GPT-3 prompt. This would use the “generative aspect of GPT” when using embeddings.

branchette · February 13, 2023, 11:25pm

OK. Thx all. Let me give a concrete example.

Me: Write python code to say hello:
ChatGPT3: Here’s a simple Python code to print “Hello”:

print("Hello")

Me: Make it a function
ChatGPT3: Sure, here’s the same code as a function:

def say_hello():
    print("Hello")

Me: write code in programming language Tormat to call rest api
ChatGPT: I’m sorry, but Tormat is not a recognized programming language.

Now, I want to train a model so the chat bot responds to this question. Not only that, I want it to also be able to iteratively change the code the way it does for python. Thoughts?

curt.kennedy · February 13, 2023, 11:48pm

For code, have you tried using Codex at the code-davinci-002 endpoint?

Not sure if training regular old GPT-3 on code will lead to good results. But Codex is trained on code specifically.

branchette · February 14, 2023, 2:17am

I see here how embeddings are used for searching code:

github.com

openai/openai-cookbook/blob/main/examples/Code_search.ipynb

{
 "cells": [
  {
   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Code search\n",
    "\n",
    "We index our own [openai-python code repository](https://github.com/openai/openai-python), and show how it can be searched. We implement a simple version of file parsing and extracting of functions from python files."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",

This file has been truncated. show original

I understand the general idea of what needs to be done. But the fundamental question I am not able to answer is how does the openai improves or refines an answer.

Take the python example i had previously. First I tell it to write code to say hello and it does. Then I ask it to make it a function and it does.

Does this mean that both the simple hello statement and the hello function had to be entered into the model individually or did the model generated the function based on some knowledge it has.

I did the same test with another case. I asked it to write code to make a rest api and it did with a hard coded URL. Then I asked it extract the URL from an environment variable and it did.

So, I understand how to use embeddings to search models and return the most relevant piece of code. But I am trying to figure out how the model can build on that piece of code to generate even more complex code segments.

curt.kennedy · February 14, 2023, 2:36am

@branchette What that cookbook is doing is embedding code from an already existing repo, and then embedding it, and then searching the embeddings. It isn’t generating code. To generate code, use Codex or another code generation API (if there are any)/

Searching with embeddings is usually a precursor to prompt formation in GPT-3. But the embedding doesn’t create new information, it encodes it to a vector so you can do math on it (mainly searching for similar vectors, and then returning the top contents).

So in your case, to get it to work. Embed all your code. Get the top hits, and feed it into a Codex prompt to get it to refine it. If you do this, it might pop out something that is reasonable.

Oh and try it out in the playground first before going through all the hassle of embedding and ending up with lackluster results.

Topic		Replies	Views
How to fine tune so GPT knows a new API and then how to prompt to use that API Prompting	4	1406	March 29, 2023
How to teach a new coding language to GPT? API gpt-35-turbo , fine-tuning	12	5544	September 5, 2024
What's better for the type of chatbot I am building? Fine tune or embedding? Community chatgpt , api	10	2115	August 20, 2023
Teaching GPT a new/niche programming language API	1	1607	June 2, 2023
Prompt Assistance , Potentially Fine Tuning oddity Prompting	6	1175	February 7, 2023

How does the generative aspect of GPT impacts my models?

Related topics