Filler words prompt recommendations

nico.moschopoulos · May 2, 2023, 4:22pm

Hey!

I’m currently exploring different prompts for text editing and I’m wondering if any of you have recommendations for a prompt that can remove filler words from text without summarizing it.

I’m specifically looking for a prompt that can identify and remove words such as “um”, “uh”, “like”, “you know”, and other similar phrases that don’t add much value to the text but can be distracting or detract from its overall quality. Everything I’ve tried so far (few shot prompting, GPT4 “personas”, and a lot of other prompts) have all yielded a lot of summarisation from the AI. I want to preserve all the context, but just slightly tweak the input to read a little more formally.

The input will be sections of interview transcripts (ie questions asked and the answer provided).

anon10827405 · May 2, 2023, 5:46pm

What you’re describing are stop words.

Does it have to be done through ChatGPT, or can you use something else?
Here’s an example using Python.

Otherwise, your only option (off the top of my head) would be giving it a list, or even making it a plugin.

sps · May 2, 2023, 6:02pm

Here’s a working example

nico.moschopoulos · May 2, 2023, 6:26pm

@sps @anon10827405 – thank you! It’s a little more than just stop words though, it’s input that looks a bit more like this:

“They don’t, for instance, some things, some of their online exercises don’t work when they try to open them.”

I’d like to have this rewritten as “Some of their online exercises don’t work” (optionally “when they try to open them”).

More broadly it’s real dialogue (as it comes from interview transcripts), so what we want to remove isn’t perhaps as simple / straightforward as an array of filler words, that’s why I was thinking to lean on GPT. Is there a good way to do this directly in code?

Additionally, I’ve noticed that issues arise usually / mostly when we try and clean multiple question / responses at the same time. If we do them one-off, it works fairly well, but if we do n > 3, it starts summarizing a lot. It’s, however, unwieldy and inefficient to do the cleaning one off

dror · May 2, 2023, 8:15pm

You can try something like this.

1. Break the text into sentences.
2. For each sentence do this: describe and experiment with what you want done.
Once you're done review your work and check that the above instructions have been followed correctly.

I’m not going too much into the details of what you need done in step 2, because you’ll need to test with your data and see what works best.

PaulBellow · May 2, 2023, 9:10pm

Try a one-shot or two-shot… ie give it a couple examples and it’ll do better, I bet.

Good luck.

sps · May 3, 2023, 12:41am

This is achievable. It required some iteration for the sentence you gave.

If you want to “batch”, you’ll have to append the system message at the end rather that the beginning.

You’ll have to take time to refine it according to your data and desired output. Alternatively you can spend some time in creating a training dataset with prompts and desired completions and fine-tune a base model.

nico.moschopoulos · May 3, 2023, 1:09am

Thank you @sps, I’ll try that out

stevenic · May 3, 2023, 12:24pm

You can give this prompt a try:

I don’t use the “system” role (ever) but you could also try it with the system role.

jhall0947 · March 9, 2024, 4:12pm

Mr Sps can you please help me I do have a few questions to ask

sps · March 9, 2024, 4:23pm

Welcome @jhall0947

I recommend creating a separate topic on the forum.

Topic		Replies	Views
Can a good prompt prevent 'hallucination'? Prompting chatgpt , api	6	3945	November 4, 2023
Alternatives to negative prompting Prompting chatgpt	7	1968	October 2, 2023
Training gpt-3.5 to autocomplete for a niche domain and a specific writing style Community chatgpt	13	1505	July 25, 2024
Changing prompts to remove references to context Prompting	11	8923	September 22, 2023
How to get responses without the added "chat" when converting from davinci-003 to ChatGPT API gpt-3.5-turbo API	10	2800	March 6, 2023

Filler words prompt recommendations

Related topics