I wonder how much OpenAi would pay to cure GPT lazyness

vb · January 28, 2024, 9:05pm

While I am definitely a proponent of evals I think it’s challenging for somebody who is using mainly if not only ChatGPT to recreate the failure mode and the eval accordingly.

It would be great to have some guidance about how to set temperature and top-p, or even more fancy, to export single examples directly from the UI.
Currently the best option for a ChatGPT users is to hit the thumbs down button.

Imagine with 100 million users a month, what if 1% of users actually found something worth looking into, even if it’s just a potential candidate for the Journal of Negative Results.

Topic		Replies	Views
GPT4-Turbo more "stupid/lazy" - It's not a GPT4 API gpt-4 , chatgpt , gpt-4-turbo	33	11365	March 18, 2024
Do you also relate or did you overcome this challenge? Prompting	5	1620	March 3, 2024
Custom GPTs cannot even retrieve information from its custom knowledge? GPT builders	11	1117	February 27, 2025
GPT Builder Or Programming Language? Community project	22	567	October 13, 2024
Custom chatbot says that it's developed by OpenAI API gpt-4	33	2112	April 2, 2024

I wonder how much OpenAi would pay to cure GPT lazyness

Related topics