What do all these models do?

there are a lot of models. what do all these models do?

id
ada
ada:2020-05-03
ada-code-search-code
ada-code-search-text
ada-search-document
ada-search-query
ada-similarity
babbage
babbage:2020-05-03
babbage-code-search-code
babbage-code-search-text
babbage-search-document
babbage-search-query
babbage-similarity
code-cushman-001
code-davinci-001
code-davinci-002
code-davinci-edit-001
code-search-ada-code-001
code-search-ada-text-001
code-search-babbage-code-001
code-search-babbage-text-001
curie
curie:2020-05-03
curie-instruct-beta
curie-search-document
curie-search-query
curie-similarity
curie-similarity-fast
cushman:2020-05-03
davinci
davinci:2020-05-03
davinci-if:3.0.0
davinci-instruct-beta
davinci-instruct-beta:2.0.0
davinci-search-document
davinci-search-query
davinci-similarity
if-curie-v2
if-davinci:3.0.0
if-davinci-v2
text-ada:001
text-ada-001
text-babbage:001
text-babbage-001
text-curie:001
text-curie-001
text-davinci:001
text-davinci-001
text-davinci-002
text-davinci-edit-001
text-davinci-insert-001
text-davinci-insert-002
text-search-ada-doc-001
text-search-ada-query-001
text-search-babbage-doc-001
text-search-babbage-query-001
text-search-curie-doc-001
text-search-curie-query-001
text-search-davinci-doc-001
text-search-davinci-query-001
text-similarity-ada-001
text-similarity-babbage-001
text-similarity-curie-001
text-similarity-davinci-001

1 Like

Ada, Babbage, Curie, Cushman and Davinci are different models doing the same, but having different size. The model with a higher size tends to give better results, but has a higher price and takes more time to give result (because it requires more computation). Davinci is the model with the biggest size.

The models that have “code” in the name are part of “Codex” - they serve the purpose of generating code.

The models that have “text” in the name serve the purpose of generating plain text (that’s their primary purpose, but they can also generate code to a small extent).

The models that have “search” or “similarity” in name are for “embeddings” - they serve the purpose of finding similar texts (as described in the documentation under “embeddings”).

The models that have “search-code” in the name are for searching with “code”, “search-text” is for searching with text. The models that have “search” in the name, but not “code” are for searching text (“document” is for specyifing the documents among which you search, “query” is for the query by which you search). “similarity” is for finding similar documents as well, but there’s some difference between “search” and “similarity”. From what I’ve remember it’s mostly about the length of the searched documents.

The models with “edit” are for editing code or text (as opposed to completing it).

The models with “instruct” are the models trained specifically for being able to deal with the input (prompt) in a form or instructions.

The models with “insert” are for insertions (you pass [insert] in the prompt and it generates text/code in the middle of the prompt, in place where you put “insert”, instead of generating at the end).

“001”, “002” are different versions. I assume “002” are better than “001” becuase 002 is an improved version of 001.

I don’t know what the models with “if” are.

2 Likes

thank you and excellent! we will try them out as such!

Ack. I found this incredibly helpful post after I wrote a page long post asking what the models do. This post should at the top of the Similarity Search / Embeddings doc pages! :scream:

1 Like