Problems with long contexts - gpt that solves law cases

EricGT · October 16, 2024, 12:10pm

This is advice to help you win some battles, I still think you will lose the war but at least this will keep you on what I consider the best next step.

Look very closely at the o1 examples noted earlier. These were created less than a month ago and touch many of the points noted. However the examples are high level and for your problem should have many more LLM agents with the ones checking the facts based on code that is not using AI to validate the facts.

https://platform.openai.com/docs/guides/prompt-engineering

https://help.openai.com/en/articles/6654000-best-practices-for-prompt-engineering-with-the-openai-api

Also see:

A search for just papers with legal and agent in title.

https://arxiv.org/search/?query=agent+legal&searchtype=title&abstracts=hide&order=-announced_date_first&size=50

Side note:

While this is not a direct comparison or prediction, it is something to understand for possible quagmires to avoid.

How IBM’s Watson Went From the Future of Health Care to Sold Off for Parts

IBM Resurrects Watson to Ride the AI Hype Train

One of the more successful stories with LLMs is that of generating programming source code (often with bugs) and having the ability to feed back the compilation errors to eventually get correct code. Also the programming language has to be one that the LLMs have had much accurate training with, e.g. JavaScript or Python, and not languages like Prolog for which there is relatively little training data. I note this because compilers provide the feedback of what is valid or not valid. This is often a critical step that is is missing from many who apply LLMs and then fail.

Topic		Replies	Views
How to send long articles for summarization? API	23	52565	December 12, 2023
GPT + vector DB no good for understanding new code bases API	10	3705	June 4, 2023
Do 'MAX tokens' include the follow up prompts and completion in a single chat session API token	22	4986	August 25, 2023
Response being cut off in Azure OpenAI API	6	1917	January 30, 2024
GPT 3.5 and GPT 4 Turbo 1106 Question for Output Context Length API gpt-4 , api , 1106 , gpt-35-turbo-1106	8	2356	January 24, 2024

Problems with long contexts - gpt that solves law cases

Related topics