2-shot plus step-by-step prompts for gpt-3.5-turbo performance at gpt-4 level?

N2U · May 3, 2023, 6:19pm

Indeed,

I’m wondering if the “girls checked the freezer” in your example is a temperature related hallucination, have you tried with a lower temperature?

By rephrasing the question you’re also adding more context for the model to work with, i think it’s a good idea, but it sorta prompts the question:

Are we actually testing the model’s performance, or, are we testing the researchers ability to use it?

This question is something that’s been nagging me as well, the methodology I’ve ended up using for tests are based on instructions provided by one human to another human in an attempt to remove this variable

Topic		Replies	Views
How do you maintain historical context in repeat API calls? API	29	94493	December 23, 2023
Getting ChatGPT to Remember Previous Chat Messages Prompting	37	70612	January 29, 2024
Custom Instructions for maintaining a long-term memory? Prompting gpt-4 , chatgpt , prompt-engineering , custom-instructions	33	19974	October 9, 2024
A conversation using the API API	6	3131	December 16, 2023
Context generation for chat based Q&A bot Prompting	41	22845	December 13, 2023

2-shot plus step-by-step prompts for gpt-3.5-turbo performance at gpt-4 level?

Related topics