Newbie Seeking Advice: Fine-Tuning GPT-3.5 Turbo for Civil Engineering Domain in Korean

hyungjin0706 · May 8, 2024, 3:34pm

Hello OpenAI Community,

I’m a newbie here and currently working on developing a Korean chatbot specifically tailored for the civil engineering domain. My goal is to fine-tune the GPT-3.5 turbo model to effectively recognize and handle specialized terminology in this field.

To achieve this, I have a bilingual glossary with 25,000 entries, containing both Korean and English translations of civil engineering terms. I am considering the best way to utilize this glossary to construct my dataset and enhance the model’s performance in recognizing these domain-specific terms.

Here are a few points I’m particularly seeking advice on:

1.Dataset Construction:
How should I structure my dataset using this glossary for the most effective fine-tuning? Should I include example sentences, or is a list of term translations sufficient?
2.Fine-Tuning Practices:
What are the best practices I should follow when fine-tuning the GPT-3.5 turbo model for this specialized domain? Are there specific parameters or techniques that are particularly effective for domain-specific language models?
3.Handling Bilingual Terms:
Given the bilingual nature of the glossary, how can I ensure the model effectively understands and translates between Korean and English civil engineering terms?
Any advice or suggestions would be greatly appreciated!

Thank you!

Topic		Replies	Views
Seeking Guidance on Fine-Tuning GPT-3.5 Turbo for a Biography-Based AI Chatbot API api	0	349	December 26, 2023
Custom model for domain-specific translation Community chatgpt	6	867	September 5, 2023
Fine Tune on GPT-3.5 Turbo Instruct API api	3	286	March 24, 2024
Building Own Knowledge Base LLM Community embeddings , chatgpt , api , assistants-api	3	783	April 8, 2024
Could Someone Give me Advice on Best Practices for Training Large Language Models? Community large-language-model , training	0	135	April 29, 2024

Newbie Seeking Advice: Fine-Tuning GPT-3.5 Turbo for Civil Engineering Domain in Korean

Related Topics