Vector Similarity Search in Postgres with pgvector, text-embedding-ada-002, and bit.io

dliden · April 10, 2023, 4:07pm

We used the pgvector Postgres extension and embeddings from the text-embedding-ada-002 model to make our documentation semantically searchable in Postgres.

Why does this matter? There are, at this point, quite a few companies that offer specialized vector databases for semantic search. More than one of those explicitly focuses on the use case of searchable docs. In some cases, these specialized solutions might be useful/necessary.

This project showed that there are fast and cost-effective alternatives. The bit.io free tier is more than sufficient for just about any project’s documentation. Generating embeddings for our docs with the text-embedding-ada-002 model cost about $0.02 (and that was after generating embeddings for all of our docs multiple times as we were getting the pipeline set up and tested). The whole thing took us an afternoon to set up, and 80% of that time was spent figuring out how to export our docs.

It’s worth giving this approach a try if you want to use vector similarity search but don’t want to pay for a specialized solution. Best of all—it’s Postgres. It integrates easily with anything that works with Postgres.

PaulBellow · April 10, 2023, 6:05pm

Congrats. Sounds cool. Thanks for sharing with us.

Topic		Replies	Views
Suddenly, [Database] Rows Can Now Have Meaning Community	4	1211	March 15, 2023
Using Redis for embeddings API	21	13597	December 23, 2023
Structured Data & Semantic Search : SQL or text-to-SQL or Vector search? Community vector-db	1	2177	June 28, 2024
Storing embeddings in SQL Server? Latency between Redis & Pinecone? Vector DB recommendations? API	18	7816	December 23, 2023
Am I on the right track with embeddings? API embeddings	6	560	February 7, 2024

Vector Similarity Search in Postgres with pgvector, text-embedding-ada-002, and bit.io

Related topics