Does anybody know if this method of finetuning GPT-3 without a specific task still works?
If so, what is going on under the hood / what actual training is occurring through this method. Thanks!
Does anybody know if this method of finetuning GPT-3 without a specific task still works?
If so, what is going on under the hood / what actual training is occurring through this method. Thanks!