Limiting Answer Length with Tokens / Prompting

kunal · January 27, 2024, 5:04am

Without fine tuning on more concise answer shapes what are the most effective ways today to limit completions to be more concise / natural? For example when asking a broader question to GPT4, there seems to be a structured template for the response shape: brief abstract, detailed steps / options list, and then conclusion.

If I just want a more naturally written options list in a paragraph or a generally more concise response are the most effective ways to achieve that prompting for those specific types of responses or are there any guardrail parameters / phrases that tend to be more effective?

Another option I see would be using another model to summarize the output but that seems inane.

jr.2509 · January 27, 2024, 5:19am

Hi - you can achieve a lot with specific prompting. I use GPT 4 for summarization a lot and I have over time shifted to a prompting strategy that involves giving the model the general instruction for preparing a summary and then appending that by a list of specific principles that it should adopt when creating this summary. In my case that’s over 10 specific principles that address things from style, structure and granularity to other very specific points unique to my context.

Getting your output in the desired shape will involve some trial and error. Review the outputs for characteristics that you like/dislike, then incorporate these findings into your prompt.

But in a nutshell, if length is your concern, then that can definitely be addressed in your prompt.

kunal · January 27, 2024, 5:34am

Makes sense, how reliable is that kind of prompting to corral the style / output into the format that you are looking for? Or do you just do validation after the fact to ensure that it meets your guidelines?

Have you also been able to achieve tone / style changes with pure prompting?

jr.2509 · January 27, 2024, 5:50am

I have found it to be very reliable. I do check for style regularly through a manual validation process and then just make changes as necessary.

After the summary is created I have additional controls in place for validation but they are not focused on style etc. but other aspects such as cases when a summary could not be properly generated. But technically you could deploy the strategy of having the summary be reviewed for certain criteria by a second model.

Topic		Replies	Views
Setting max tokens for output issues API gpt-4 , api	4	906	January 26, 2024
Over-prompting with irrelevant context Prompting embeddings , gpt-4	8	1074	December 17, 2023
Prompt result consistency - Need some perspectives to validate understanding Prompting gpt-4 , chatgpt , prompt , prompt-engineering	7	1884	August 10, 2023
16k Input vs Output: Edit and token strategies for long input texts Prompting gpt-35-turbo , python	2	1167	December 17, 2023
Pseudo fine-tuning chat completions... best practices? Prompting gpt-4	4	788	December 24, 2023

Limiting Answer Length with Tokens / Prompting

Related Topics