Any Option to Enable Semantic Cache on AsyncAzureOpenAI?

mdsa3d · July 2, 2024, 9:32am

I am looking for a way to enable caching on AsyncAzureOpenAI. Currently, I am exploring the redisvl library using the sample app provided by microsoft (github: microsoft/sample-app-aoai-chatGPT)

Does anyone have experience or suggestions on how to introduce caching at the client declaration level? I’m aiming for something similar to what LangChain offers for LLM caching. Any insights or examples would be greatly appreciated!

Langchain example:

from langchain_community.cache import RedisSemanticCache
from langchain_openai import OpenAIEmbeddings

set_llm_cache(
    RedisSemanticCache(redis_url="redis://localhost:6379", embedding=OpenAIEmbeddings())
)
# First time not cahced so slow, but second time it is, so it goes faster
llm.invoke("Tell me a joke")

Topic		Replies	Views
How to cache LLM responses in Langchain recent versions for OpenAI GPT4 Community gpt-4 , api	1	1064	March 2, 2024
Do you cache your API results? API	5	3585	July 13, 2023
Caching representations API	5	7239	July 13, 2023
Caching system prompt to facilitate interaction between user and llm API gpt-4	3	2115	September 19, 2024
Semantic search - best chain in langchain Prompting gpt-4 , api	0	422	June 21, 2024

Any Option to Enable Semantic Cache on AsyncAzureOpenAI?

Related topics