ChatGPT Spanish support

I am new to AI and ML.
I have been studying and using ChatGPT regarding world problems such as global warming and similar.
I read (not tried) that ChatGPT can recognize some Spanish text. This is not for translation but to generate answer to the user.
Is there a project to add more support in Spanish for ChatGPT engines?


Hey Pablo, right now, most of our models are purposely trained to use mostly English (see here in our embeddings limitations: OpenAI API). Hopefully this will expand over time but for now, any good Spanish results are a bonus.

See the GPT-3 paper for more details (GPT-3 powers ChatGPT):

For GPT-2 a filter was used on a multilingual collection of documents to produce an English only dataset due to capacity concerns. Even with this filtering GPT-2 showed some evidence of multilingual capability and performed non-trivially when translating between French and English despite only training on 10 megabytes of remaining French text. Since we increase the capacity by over two orders of magnitude from GPT-2 to GPT-3, we also expand the scope of the training
dataset to include more representation of other languages, though this remains an area for further improvement. As discussed in 2.2 the majority of our data is derived from raw Common Crawl with only quality-based filtering. Although GPT-3’s training data is still primarily English (93% by word count), it also includes 7% of text in other languages. These languages are documented in the supplemental material.

Hello Patrick
Thanks for the quick feedback.
I cannot imagine the huge amount of work that Openai has done to produce chatgpt.
I hope projects to add new languages will start in the future because, Spanish is the 2nd most spoken in the world with with 471 million (probably not in internet) after Chinese with 1.3 Billion and English is the 3rd most spoken language with 370 million. But In internet, I guess that English is the most common language.
One of technology barriers (internet and books) to non English speakers, is language. I hope this will change with new projects


1 Like