We made a bunch of improvements to audio truncations with the new models… would be curious if you still see problems there?
It’s still possible to get the model stuck in text only mode if you give it a huge amount of text upfront. Known issue that we’ll keep improving in future releases. In the mean time, putting audio in the latest user turns can help the model rediscover its voice