Custom model for domain-specific translation

fb4docs · August 3, 2023, 3:07pm

Hi everybody, a couple of newbie questions: 1) Is it possible to use a custom model for domain-specific translation purpose, like translating medical concepts from one language to another by feeding a chatgpt 3.5 model (from my inormation 4th version can not yet be customized) with translation memories in form of pairs of source_language_concept - target_language_concept as jsonl-formatted files? From your experience does this improve the accuracy of the model afterwards? is there any example of such project available in the community or on the web? 2) What would be the openai licensing model suitable for this, is the “Plus” upgrade from the free license suitable or does it require a higher-price model? Thanks everybody and looking forward to your helping hand.

sps · August 3, 2023, 6:55pm

Welcome to the OpenAI community @fb4docs

Here’s a sample before fine-tuning

The gpt-3.5 and gpt-4 models will be available for fine-tuning later this year.

The Plus is a subscription plan for chat.openai.com.

If you want to use the API api.openai.com, you have to add a payment method, and the billing occurs monthly on a pay-as-you-go basis for the tokens consumed per pricing

fb4docs · August 3, 2023, 8:35pm

Thanks for the info. Is this a model you’ve already fine-tuned on a lower-than 3.5 model? You mention that fine-tunning is not yet available on 3.5 and above yet.

saharin · September 5, 2023, 7:54pm

Hello!
Please tell me how to create data for fine-tuning using a thematic dictionary of 40,000 entries?

sps · September 5, 2023, 8:00pm

Hi @saharin

Here’s OpenAI’s guide to preparing a fine-tuning dataset

Can you elaborate more on this?

saharin · September 5, 2023, 8:07pm

We want to add translator functionality to the English-Ukrainian military dictionary website, which works using the GPT model trained on dictionary data, with a user interface similar to Google Translate.

sps · September 5, 2023, 9:27pm

GPT comes pre-trained.

Before you proceed for fine-tuning, you should check if the model already achieves what you are looking for using playground

Topic		Replies	Views
I want to create a domain-specific chatbot API	5	1181	December 17, 2023
Custom datasets? API	3	524	December 27, 2023
Fine tuning the model for our specific use case? API	4	1015	December 27, 2023
How to improve translation quality for specific theme? Prompting gpt-4	1	492	March 21, 2024
Translating with GPT3 Community	1	3035	February 10, 2023

Custom model for domain-specific translation

Related topics