Which is the best option to send hundreds of data in openai api?

BrokenSoul · January 7, 2024, 8:47pm

I have a .csv file with hundreds of data, I know that the best option is to make chunks and send to chatgpt, but I need all the context of it to ask about information that is matched.

How can I do it?, by each chunk I need to make an API call?.

Thanks.

sps · January 7, 2024, 9:15pm

Hi @BrokenSoul

If I understand correctly, you want AI to use you data to augment the model’s response to questions.

In that case I’d recommend the Assistants API.

As it does the whole RAG for you and requires minimal setup in case you aren’t familiar with using embeddings on your own for RAG.

BrokenSoul · January 7, 2024, 9:30pm

I though that, but I need to retrieve information about a page and fill it in a .csv file and it can change very often, if I upload a file and replace it in my app each time, problems with sincronization will happen.

if I use embeddings in execution time the resources will be costs and low.

sps · January 7, 2024, 9:43pm

Interesting, as of now assistants doesn’t have web browsing capabilities but it’s supposed to be supported later on.

In the mean-time a code based solution that handles every aspect of question answering along with retrieval can be implemented and the data can be fetched from csv simply using code that’s generated by the model.

If you own the data source or if there’s an event handler dealing with data changes, you can directly update the csv file in real-time.

_j · January 7, 2024, 10:08pm

The application is still quite unspecified in scope.

A handful of varied examples and possible solutions.

“What is the top category that has the most unhappy customers”

data elements should be independently AI sentiment scored, for analysis by function

“What is the predominant theme of the day?”

Chunks can have entity extraction with totals of categories that AI can then handle all summaries.

“what add records also have both change records and delete records?”

Pretty much the AI needs to see all, or you need to put subcategories of records into groups

“what paragraphs of the book are most similar to ‘Tom Sawyer Huck Finn whitewash the fence’”

embeddings

Most of all, there are likely many things where the better answer will be by code and not language processing.

Topic		Replies	Views
Integrating OpenAI API for Comprehensive Knowledge of My Web App API	2	231	January 23, 2025
Best way to achieve this Data Analysis use case? API api , custom-gpt	4	573	November 13, 2024
Is it possible to request a csv file as payload to chatgpt via API? API	4	1501	August 30, 2024
Implementing a file upload in my application using open ai api API gpt-4 , chatgpt , plugin-development , api , chatgpt-plugin	7	7868	January 25, 2024
Analysing Big Data (CSV) via OpenAI API API gpt-4 , plugin-development , api	13	21760	February 2, 2024

Which is the best option to send hundreds of data in openai api?

Related topics