Attach a file without chunking/embedding to either assistants or completions API

domig · July 24, 2024, 2:45pm

Hey people,

my task is to parse a large (>70 pages PDF) and query some specific sections.
Before gpt-4 was made available I did that manually parsing the pdf, analysing its structure and only giving the relevant sections as context to a query made via the completions API.
This process was not very robust but GPT-4s large context window allowed me to upload the entire pdf (Chat Interface) and ask my questions without loss of quality.
Therefore I want to mimic the same behaviour using the API, but the assistants API with file-search couldn’t help since the underlying chunking and embedding process resulted in loss of information.
I could add a (supposedly complete) file using client.file.create to an assistant thread and a vectorstore with irrelevant information and got kind of good results.
Since this workaround did not yield my required performance I am asking here if anyone has successfully added an entire file to query via the API. My optimal solution would be to add the file as a “head document” and do around twenty independent batched requests onto that file using the batch API.

Thank you for any help and ideas!
Best

mikalosetmyrseth · October 8, 2024, 2:27pm

Were you able to solve this?

Topic		Replies	Views
Send file as attachment in the prompt and ask questions about it instantly API chat-completion , file-uploads	7	48152	December 17, 2024
Is attaching a file to a prompt possible through API as it is in the UI? API	12	17453	March 18, 2025
Please tell me what is your best method to upload files and use them for a specific chat API gpt-4 , gpt-35-turbo , plugin-development , api	2	5180	December 30, 2024
Send files to completion api API gpt-4 , chat-completion , pdf	10	44164	July 29, 2025
How can I make the assistant 'read' scanned documents that are in PDF format? API assistants-api , file-uploads	3	656	June 2, 2025

Attach a file without chunking/embedding to either assistants or completions API

Related topics