gpt model to connect with ai search to process address matching data, my questions are what type of indexing or such would be necessary to process large datasets sizes of 5gb at most - also can the resulting output be uploaded to blob - is there functionality or documentation i can look at to enable this?