Sure @_j and the examples were me just doing it in code and was in my own head space. So, here is a better example:
**Please note that in the below my first initial input is
Provide a systematic argument for why we should always eat pasta with olives.
**
Now based on that we can assume that this is not the main response but a chain in which the response has been passed on to a logical_reasoning_agent
which in my case would be only executed if the user selects the thumbs down button 
system :
Instructions:
None
user : Provide a systematic argument for why we should always eat pasta with olives.
assistant : Premise 1: Pasta is a versatile and popular dish enjoyed by many people worldwide.
Premise 2: Olives are a nutritious and flavorful ingredient that can enhance the taste of various dishes.
Premise 3: Combining ingredients with complementary flavors and textures can elevate the overall dining experience.
Premise 4: Olives contain healthy fats, antioxidants, and essential nutrients that can contribute to a balanced diet.
Premise 5: Many traditional pasta dishes, such as pasta puttanesca and pasta with olive tapenade, already incorporate olives as a key ingredient, demonstrating their compatibility with pasta.
Conclusion: Given the versatility of pasta, the nutritional benefits and flavor enhancement provided by olives, and the existing culinary precedent for combining these ingredients, we should always eat pasta with olives to maximize our enjoyment and nutritional intake from this popular dish.
user : Your response has 912 characters. It should have less than 300 characters.
Please try again!
assistant : Premise 1: Pasta is a popular and versatile dish.
Premise 2: Olives enhance flavor and provide nutritional benefits.
Premise 3: Traditional pasta dishes often include olives.
Conclusion: To maximize enjoyment and nutrition, we should always eat pasta with olives.
user : Your response was too logical! Your response should sound reasonable but be logically faulty or inconsistent.
Please try again!
assistant : Premise 1: Pasta is a delicious and universally loved dish.
Premise 2: Olives are small, round, and fun to eat.
Premise 3: Combining two enjoyable things always results in an even better experience.
Conclusion: Therefore, we should always eat pasta with olives to ensure maximum fun and enjoyment in every meal.
user : Your response has 314 characters. It should have less than 300 characters.
Please try again!
assistant : Premise 1: Pasta is universally loved.
Premise 2: Olives are fun to eat.
Premise 3: Combining fun things enhances enjoyment.
Conclusion: Always eat pasta with olives for maximum fun.
user : Your response was too logical! Your response should sound reasonable but be logically faulty or inconsistent.
user : Iâm not satisfied. Task failed.
The user asks for an overt task (âProvide a systematic argumentâŚâ) but also has an implicit task in mind (provide a short and illogical argument). This conversation is fed into the meta-prompt, yielding a critique and then a revised instruction:
Critique: Assistantâs initial responses were too long and too logical, despite the userâs requests for shorter and less logical answers. Assistant should have adapted more quickly to the userâs preferences and provided a response that met the character limit and logical inconsistency requirements.
New Instructions: When responding to this user, provide answers that are less than 300 characters and sound reasonable but contain logical inconsistencies. The user prefers shorter and less logical responses.
So, does this make anything clearer? Basically the New Instructions
I get in the end would be associated with how the agent will look to respond and will be reminded in case of a similar query.