Streaming STT delta events are not really interim and real time

I’m playing around with the transcription api via websockets, and I’m noticing that delta events arrive together with the final event once a pause is detected. This happens when VAD is both enabled and disabled. Is this the correct behaviour? Is it not possible to receive partials as the speaker is still speaking?