Implementing our own RLHF?

dominique.lahaix · June 13, 2023, 4:27pm

Hi, has anyone implemented its own RLFH process? Wondering whether to

or whether I should just build an embedding DB with the annotated data and enrich the prompt.

Anyone having tried this?

Thanks

sps · June 13, 2023, 4:34pm

elmstedt · June 14, 2023, 12:01am

Just FYI, it’s RLHF.

Using the correct acronym will help ensure you get better results.

dominique.lahaix · June 14, 2023, 12:55am

Oops - should use ChatGPT for typos

Topic		Replies	Views
Prompt Assistance , Potentially Fine Tuning oddity Prompting	6	965	February 7, 2023
Correcting wrong answers via fine-tuning API fine-tuning , fine-tuning-problems	11	2051	December 13, 2023
Building Own Knowledge Base LLM Community embeddings , chatgpt , api , assistants-api	3	552	April 8, 2024
Writing a ChatBot (not just for Q&A) is hard! 2 months in and still unsuccessful :/ Prompting gpt-4 , chat-completion	8	2491	June 23, 2023
Chat Model Best Practices and Logical Approaches? API	2	551	July 29, 2023