ChatGPT data flow architecture

vsharma1926 · July 17, 2023, 6:55am

HI Team,
When LLM is trained on local corpus, index.json gets created and saved locally. Now when I write a prompt to get a query answered, I see there are 2 different results clubbed together, One from LLM training and other from local corpus. Could you help me in data flow architecture here? How does data gets pulled from local system and merged with LLM result and shown to user? I am trying to create a architecture which shows dataflow from in-out user system.
Thanks

Foxalabs · July 17, 2023, 7:05am

Welcome to the OpenAI developer forum.

Can you give some more details please, some example code you are running?

Perhaps I have read this wrong or they has been a slight miscommunication, but I am not aware of json files getting created locally with any of OpenAI’s products.

vsharma1926 · July 17, 2023, 11:41am

Thanks for reaching out , I referred this code : How to Train an AI Chatbot With Custom Knowledge Base Using ChatGPT API | Beebom

But question is more on , i need to know how client-server interaction is taking place? There is no architecture which I am able to find and understand dataflow. Would be great if you can share some knowledge

novaphil · July 17, 2023, 3:01pm

Looks like that tutorial is using LangChain. Know that you aren’t “training” the LLM though. Try running LangChain in verbose mode and you can see the prompts it is using and how it stuffs your local data in the prompt. Under the hood it is using embeddings/vector storage, here’s an explainer from another community member:

Topic		Replies	Views
Replicate langchain custom data query with openai python API? API	3	2470	October 11, 2023
Creating a support chat bot for my business API	4	3618	December 18, 2023
Seeking Advice: Uploading Large PDFs for Analysis with GPT-3 API API gpt-35-turbo , chatgpt , fine-tuning , api	7	6973	December 13, 2023
Question - Chatbot using your own data? Community gpt-4 , chatgpt	16	12948	August 13, 2024
Expand AI Context beyound local documentation GPT builders gpt-4 , chatgpt , chatgpt-plugin	5	773	January 19, 2024

ChatGPT data flow architecture

Related topics