Doubts in development, embeddings and size maneuvering

joao.martins17500 · January 16, 2025, 7:44pm

Hi guys.

I’m trying to develop a conversational chat-bot using the API but I’ve just hit a dead end, because I started working with huge data like 40k ~ 150k rows.
I have a lot of doubts about how to work with huge data in this way, and whether it is necessary to use embeddings to relate the user’s question to my data-set, and this data-set being from SQL search results, PDFS and EXCEL spreadsheets.

I’d like to know how you do it and thank you in advance because I can’t think of anything else.

anon10827405 · January 16, 2025, 7:45pm

Is your data structured or loose?

joao.martins17500 · January 16, 2025, 7:46pm

That may vary but in a 90% of the cases structured.

anon10827405 · January 16, 2025, 7:47pm

You can start with function calling or even embeddings for specific categories and iteratively narrow down into a very specific sub-section of loose information.

IMO function calling is preferred as much as possible.

joao.martins17500 · January 16, 2025, 7:49pm

I’ve already tried using embeddings to narrow down the results but I didn’t like what it returned, just like you said loose data works very well, but when I do the same with the result of an Excel Query/Spreadsheet that is structured data with rows and columns, it doesn’t work very well.

joao.martins17500 · January 16, 2025, 7:53pm

Do you recommend studying how to use the assistant API to do this or continue with completions + embedding?

anon10827405 · January 16, 2025, 8:03pm

Transform the spreadsheet into an object and then use function-calling to narrow it.

Weaviate is a good option here. It’s a Knowledge Graph that stores both the content, and it’s embedding equivalent. You can use GraphQL to query both by the structure, and the loose semantics.

I have a private GPT I use and it handles the database fairly well. Fails on inline fragments but usually fixes it up on the second attempt.

Both are suitable. Assistants is just a layer on top of Completions. For prototyping Assistants would probably be better, and then moving to Completions once you are ready to increase control.

One thing to consider is that with Assistants you are locked into their proprietary services. Whatever you build will be tightly intertwined with their service, and increasingly be difficult to pull out if more control is required.

joao.martins17500 · January 16, 2025, 8:06pm

So, I was able to do a lot of the same things as I said while I was dealing with little data, but when I turned to 40k of data or more, these problems started to arise.

I’m going to research the ones you mentioned, but do you have any other study content that could help me with this project?

Sorry for asking so many questions and thank you for your attention.

joao.martins17500 · January 16, 2025, 8:31pm

The real problem here is that i need live informations and i dont know how to do it without cost too much.

anon10827405 · January 16, 2025, 8:49pm

Then figure out a way to reduce the large data to little data. Try and find levels of granularity in your content.

I don’t, sorry. I would recommend tinkering and asking lots of questions

If you’re dealing with data that changes a lot you can still use function calling. GPT models are great adapters for natural semantics -> database query. This way you’re not required to constantly be embedding everything. You can use embeddings of static content like categories

Actually, last I recall (it’s been a while) Pinecone was a great leader in documentation. It’d be worth checking them out. After a quick glance it looks like they’ve stagnated and focused on niche business areas, but there’s still some good old content there.

rasaid111 · January 18, 2025, 2:51am

I was trying to improve my book with images that I put in there but chat gpt didn’t add those images, why?

Topic		Replies	Views
About the usage of ChatGPT Embedding API	9	4455	August 18, 2023
Using large PDFs to make a ChatBot API chatgpt , api	21	6430	December 15, 2023
How to feed data for completions, instead of using prompt/answer fine-tuning format? API	25	17762	December 17, 2023
How to fine tune a chatbot for Q&A API	12	8468	December 16, 2023
Preparing the dataset for embeddings API	10	6143	December 17, 2023

Doubts in development, embeddings and size maneuvering

Related topics