The “system” role - How it influences the chat behavior

Yes, it is from LangChain, and yes, it does give me very good results so far.

And, actually, I make 3 different requests for every question. Right now, I’m using GPT3.5-Turbo for the question concept and standalone and GPT4 for the chat completion. To cut costs, I might start using gPT3.5-Turbo for all. I could actually just make 2 calls (cut out the concept call and just send the question), but I just think process gets me the best results possible.

This chart is a little more detailed on my process. So far, so good.

5 Likes