Transferability of multiple languages

leaf · September 30, 2023, 10:45am

There’s a large dataset of human annotated questions and answers for a database query language - it’s the only one that exists. But, all of it is in Chinese. The actual query language is universal, but the questions and entities are Chinese.

If I fine-tuned on this, and asked questions in English, would it work out? I remember hearing some research about it’s transferability but I’m not certain how this has been seen to work

sps · September 30, 2023, 11:11am

Hi @leaf

You can try translating the dataset to English, while retaining the entities in Chinese.

Can you share a prompt completion pair?

leaf · September 30, 2023, 11:30am

{
		"query": "云艺文华的全称你知道是什么？", 
		"cypher": "match (:ENTITY{name:'云艺文华'})<-[:Relationship{name:'简称'}]-(h) return h.name", 
		"answer": [{"h.name": "云南艺术学院文华学院"}]
	}

I’m not sure if translation is reliable enough - all of the semantic relationships between words would need to transfer perfectly for it to still be reliable.

_j · September 30, 2023, 12:35pm

I always like a ponderous question.

We are barely even shown examples of how to make a sarcastic bot, so deeper levels of fine-tune, one really needs to think logically about how language model AI acts.

Hypothetical: What if, in my examples, I tuned an AI on only responding in Chinese to my English questions. Or only in English to my Chinese questions? If I said “no Chinese”, could it still answer the questions trained in Chinese?

I have a feeling that Chinese → Chinese knowledge examples in fine-tune may be much harder to activate with English inputs.

sps · September 30, 2023, 4:14pm

I’m not familiar with chinese but here’s an experimental translation: OpenAI Platform

Topic		Replies	Views
Fine tuning using a corpus API api	8	2275	July 13, 2023
Understanding GPT-3 and multi-language API	1	4734	December 25, 2022
Can vector base data be stored in chinese? GPT builders	1	110	December 5, 2024
Fine tuning improvement on partial translation tasks API api , fine-tuning-problems	0	563	February 6, 2024
Multi language support like Jarvis.ai, copy.ai and most other websites Prompting	5	1484	December 17, 2023

Transferability of multiple languages

Related topics