I have 100’s legal documents and their summaries. Each document is long (25000 words) and the summary is json/csv with some fields extracted from the document and other fields abstractive summaries of different sections of the document.
So my input to the fine tune would look like
{“prompt”: “…document text here…”, “completion”: “{case_id: 76dfe, case_date: 2023-01-12, summary: …abstractive summary…, other fields}”
…
…
2 Questions
- Can chat gpt generate output like above (json)?
- Can chat gpt process documents which have 25k - 35k words
Thanks for your time