Best way to produce video transcript while retaining timings

I have a 30 minute presentation video need to be transcribed. I know I can download the current automated srv which is not so good, and then paste it as a text file etc, but how to retain the transcription time codes?