Hello,
This issue annoyed me so much that I created a tool and it made my workflow very smooth.
This is a simple, open-source script that lets you press a single key, talk, and get:
- Your voice transcribed with Whisper (locally),
- The result copied to clipboard,
- And (optionally) sent directly to your current ChatGPT chat or a new one.
Audio and transcripts are saved with clean folder structure to avoid any loss of data, and there’s an optional local LLM integration to fix punctuation and name your recordings smartly.
It runs locally, and is ideal for devs or note-takers who want zero friction when thinking out loud. Currently only tested on Linux, but with some effort it should be possible to make it cross-platform (Mac/Windows) or even run on your phone.
Repo : search for voice2chatgpt on GitHub.
Contributions and stars appreciated.