I need to filter thousands of pages of transcripts to improve my videos and I cannot use O3 due to its rate limits and comparatively high cost.
However, I have been experimenting with o4-mini, and o3-mini. O3-mini gives 15% longer transcript cleanups and O4-mini gives 10% shorter transcript cleanups than O3. However, o4-mini cuts out too much detail in my opinion. Not sure if I should experiment with o4-mini prompting since it consistently cuts out more detail when I try to force it to format responses like O3. For instance, I prompted "You are given a block of raw source text. Transform it into a concise, well-organized document by analyzing the transcript and applying the following structure to maximize brevity and clarity:
- Title based on lesson content
- Use bulleted lists and subheadings to chunk information into bite sized pieces
- Use numbered steps and consistent âStep 1, 2, 3âŚâ labels made the flow explicit.
- Generic âApproach A vs. Approach Bâ labels stand in for repeated descriptive paragraphs.
- Parallel structure in examples (e.g. âPart â link back â next partâ) highlights the pattern.
- Visual Aids (optional) ⢠Insert simple ASCII diagrams, flowcharts, or arrows if they clarify relationships.
When rephrasing the transcript ensure clarity and brevity by:
- Eliminating filler words, redundancies, speaker labels, timestamps, false starts, stutters, and disfluencies.
- Keeping sentences short and purposeful.
- Ensuring active-voice constructions (âYou start withâŚ,â âYou ask: Is this relevant?â) focus on the readerâs actions.
- Ensuring transitional phrases (âBy bouncing betweenâŚ,â âQuick sketch,â âThat question forcesâŚâ) guide the reader through each shift.
- Unnecessary qualifiers and asides were pruned to keep only the essential logic."
Which gave me an ultra-concise 400 word output when O3 gave a 855 word output. O3-mini gave a 988 output without formatting rules but it didnât have explicit attention to detail that I require. I am new to API programming and I finished coding something for batch processing, but the only thing standing in the way is quality control. Btw these tests were done in the playground, and if I posted this under the wrong tag please tell me where to post next time.