It is kind of the best way to reduce the token using distributed prompting structure? If im building a professional customer service using GPT4. I can not educate the gpt all of the information he needs to know every single input bc apparently API doesn’t have a memory. Is there anyone familiar with this kind of structure construction?