How to feed data for completions, instead of using prompt/answer fine-tuning format?

Very good. Did you find some tutorial about covering all this process?

upvoting this! Any tutorials? Did you find some tutorial about covering all this process?

1 Like

You can ref this notebook to use embedding to answer questions based on own knowlage.

1 Like

@i-technology you can use “langchain” lib to ingest the data ( pdf,txt,json etc) … Same lib offers text Chunking/spiltting, Embeddings , Vector store database , and now you can ask questions on your entire data

Thanks, but my post was from over 3 months ago :wink:

So yes I’ve done this and it’s actually quite limited since in the end you are just sending back a few snippets to gpt for summarization

So outside those snippets, gpt has virtually no clue what you are talking about