Can't get a model to follow a specific length / word count

PaulBellow · August 10, 2024, 9:26pm

The discussion centers on the challenge of generating summaries with specified lengths using OpenAI models. tiagov initiates the conversation, explaining his struggle to get precise length summaries for a note-taking app, even after trying different techniques like percentages, character counts, and fine-tuning models, resulting in poor performance and high error rates.

torronen suggests fine-tuning models for different lengths and selecting the appropriate model based on the input text length. He also advises keeping datasets for future fine-tuning needs.

jr.2509 offers a solution involving dynamic logic to specify summaries by sentence or paragraph counts instead of word counts and emphasizes focusing on content over length. He describes a multi-step approach using dynamic, one-shot prompting.

tiagov responds, expressing hesitancy to retry fine-tuning due to previous poor results but considers the suggestion. He later acknowledges trying jr.2509’s sentence-based approach without success and reiterates his satisfaction with the contextual quality of the summaries, with length being the sole issue (tiagov, tiagov).

torronen proposes a new idea involving marking sentence ends with indicators for easier word count management, while jr.2509 suggests identifying and addressing issues that cause length discrepancies in summaries. torronen agrees and adds that clarifying the nature of summaries might improve outcomes.

mad_cat shares an “Indicator Prompting” method for refining summaries by marking sentences and words for removal to meet word limits, though it involves multiple steps. vb suggests breaking down the summarization task into steps—generating, checking, and modifying text to achieve desired lengths—though acknowledges it may not be scalable for [tiagov’s] use case (tiagov).

mad_cat and PaulBellow explain the difficulty in achieving specific word counts due to the token-by-token generation process of models, suggesting it’s currently impossible to get precise lengths.

Finally, Foxalabs notes that commercial note-taking apps often use multiple prompts to handle different aspects of note-taking, implying a more modular approach could be beneficial.

Summarized with AI on Aug 10

tiagov · August 17, 2024, 8:54pm

True, but even with that I don’t think I could get it right here.

mad_cat · August 22, 2024, 8:09am

Tried a recent experiment with a few new concepts I developed. I basically told it to write about a topic and then make adjustments to reach a 200 word limit. I need to do some more testing with it, see if I can improve it, but looks like I was wrong, it is possible to do it. All within a single prompt output, it got to 200 words.

tiagov · August 22, 2024, 8:28am

Can you share the exact prompt? How much testing have you done?

I hope I’m wrong but my guess is that you just got lucky. I don’t have a problem with getting GPT to output a specific length, I have a problem getting it to do it consistent. Sometimes it gets it right, but many times it gets it wrong. Sometimes it gets it wrong by a little bit, sometimes by a lot.

I’d also make sure to test it with a variety of lengths as I’m doing, for example from a 500 word text to a 100 one, from 300 to 150, from 100 to 30, etc.

mad_cat · August 22, 2024, 8:32am

I just did it one time so far and manage to get it do it. Now I’m running a post-mortem analysis to understand the operation, and yes, I do plan on running more tests. I am not ready to release my prompt until I can do more testing on it. And this wasn’t through a GPT but as a single prompt within ChatGPT. I suspect to get this to work consistently I need to make it into an advanced AI Persona.

ranger000 · December 19, 2024, 5:10am

I have the same question. Is there any practical solution?

Topic		Replies	Views
Gpt-3.5-turbo-16k Maximum Response Length Prompting api	33	34659	December 13, 2023
Cannot, for the life of me, get a detailed enough response API gpt-4 , api	13	2765	February 22, 2024
Longer GPT 3.5-turbo Output Prompting gpt-35-turbo , api	23	4345	December 8, 2023
GPT one paragraph reply? Condensation/Summary for core ideas (keep content depth) Prompting api	4	1760	February 10, 2024
Fine-Tuning a Model for Specific Length Output Prompting fine-tuning	2	362	August 24, 2024

Can't get a model to follow a specific length / word count

Related topics