We have a vibrant free market in software in general. And there is particularly a vibrant market for new AI software.
Given that -why is it that there is an almost near absence of out-of-the-box embedding software?
I have seen a few SAAS consultancies offering embedding services apparently at enterprise prices only quoted during a sales talk. Why doesn’t someone offer either local or cloud-based embedding software at reasonable pay-as-you-go prices similar to AI model API offerings?
Well, I personally do not think that embeddings are far superior to tradition search methods in the “biggest” search use cases, which (in my view only) is full-text DB searches; so maybe others do not see the “huge demand” as you mention?
Full-text DB searches are faster than using vectorized methods; and full-text DB searches are well established.
It seems to be we are simply at a high-point in a hype-cycle where many people are enamored with vectorized search based on OpenAIs current popularity and hype; and of course these embeddings are “not free” so why would you pay good money to vectorize entries in a DB and perform vectorized searches, etc when you get very good performance with full-text DB searches (and that is free)?
I am a newbie in this regard so by all means correct me if I am wrong. But my understanding is that vectorized searches are accepted as the best way to do a semantic search. Can that be done in a full-text search without lots of additional manual semantic tagging?
The stated rationale/benefit for vector searches is that it allows a search for similarity, not just a search for an exact match.
While I will not claim to be an expert, the performance of Google vs. GPT-3 certainly is consistent with the theory. Whatever theoretical reason you ascribe to their different performance characteristics, the capability introduced by GPT-3 is profound.
You are mistaken about full text DB searching. These searches are not exact matches as you said above.
Full-text search refers to searching some text inside extensive text data stored electronically and returning results that contain some or all of the words from the query. In contrast, traditional search would return exact matches.
Anyway, I have provided a perspective on why embeddings have not taken the DB search-world by storm. Of course YMMV!