Per the title, I have quite a number of openai credits expiring this month.
Hope to make create some finetuning datasets to upload at Huggingface.
Any ideas would be appreciated!
Per the title, I have quite a number of openai credits expiring this month.
Hope to make create some finetuning datasets to upload at Huggingface.
Any ideas would be appreciated!
That’s an interesting offer.
Do you have the option to fine-tune GPT-4 or 3.5 only?
Only GPT 3.5
Any ideas on Q&A datasets?
No need to worry. I updated the tags to properly match your request.
Hopefully this will help you find some nice ideas!
Edit- I thought it was disallowed to use outputs from OpenAI as training data for other models, but I guess thats only if you’re going to “compete with OpenAI’s models”
How about:
Run embeddings on a chunked database of everything you have that could inform an AI application better.
Do one on this Discourse forum.
There are many questions that are being asked here and while there is no AI that can currently answer them all, there are a few like the ask-ai channel on the OpenAI Discord forum that can answer about one every now and then. The bot is by Kapa.ai and uses RAG.
What would be interesting to see is if a fine tuning can do a better job.
A better way to access the post on this forum for use with RAG or fine tuning would be to pull them as JSON instead of HTML. Just add .json
to the URL.
The first post of this topic is
https://community.openai.com/t/alot-of-api-credits-expiring-this-month-any-ideas-on-finetuning-datasets-i-should-create/639600
to get the JSON reply use
https://community.openai.com/t/alot-of-api-credits-expiring-this-month-any-ideas-on-finetuning-datasets-i-should-create/639600.json
I will warn you that this forum is an echo chamber.
Really great idea! Thanks for sharing.