Embeddings not working as well as I hoped

mikea · August 25, 2023, 6:13am

Hi developer community! I’m just getting started exploring beyond prompting and could use some advice.

I’m running some creative writing experiments with GPT-4 and a localhost chat retrieval plugin connected to a Pinecone index.

In Pinecone I have around 20 transcripts of the same TV show, formatted reasonably consistently in markdown. Since they are in script format, the docs are inherently fairly ‘structured’ as you’d expect:

INT. TELEVISION STUDIO - EVENING

The camera pans across a sleek, modern news studio. The atmosphere is tense, the music dramatic.

CHRIS:
Good evening. Here are tonight’s headlines.

…
My goal is to accurately mimic the style of each speaker in generated text and my idea was to have the retrieval plugin study the scripts as either a few-shot technique or by analysing each instance of a speaker’s line across all the scripts to derive a composite character style.

In fact what happens when run is that we get a handful of results from Pinecone (a different subset each time). The results are accurate but don’t give enough signal to capture the speaker’s style.

Could this be improved with how my documents or retrieval requests are set up, or is this just a limitations of the RAG approach?

PaulBellow · August 25, 2023, 6:21am

Welcome to the forum.

I’d take a look at this new thread…

Good stuff that might help you…

mikea · August 25, 2023, 6:25am

Thanks Paul! I’ll take a look at that project.

Topic		Replies	Views
Standard RAG + Agent Solution Community chatgpt , api	2	5541	October 26, 2023
Linking Embeddings For Large Article?! Community embeddings	2	1244	June 28, 2023
Questions about the embedding-based chatbot API embedding	4	152	December 15, 2024
Training gpt-3.5 to autocomplete for a niche domain and a specific writing style Community chatgpt	13	1840	July 25, 2024
Best approach for adding knowledge to base model API fine-tuning , rag	4	1497	February 7, 2024

Embeddings not working as well as I hoped

Related topics