Does the new WebRTC requires (for Realtime voice) VAD implement notation or is that handled by OpenAI backend?