Allow users to add text during advanced voice mode: suggestion

Proposal for Enhanced Continuity in Advanced Voice and Video Interactions

Problem Statement:
Currently, the advanced voice and video functions sever the ability to retain conversational context from previous threads. If a session is interrupted (e.g., by a phone call), users must create a new topic to re-enable these features. However, this new topic does not retain the rich history or context from the prior conversation. Furthermore, there is no way to integrate text-based information into advanced voice or video modes, limiting the ability to restore lost context.

Proposed Solution:

  1. Enable Context Import to Advanced Voice/Video Threads:
    Allow users to copy and paste conversational summaries or key context into an advanced voice/video thread at its start. This feature would ensure that essential information from previous sessions is carried over, maintaining continuity.

  2. Integrated Text and Advanced Features:
    Provide an option for users to simultaneously send text-based messages within an advanced voice/video thread. This functionality would make it easier to clarify, provide details, or reintroduce lost context without losing access to voice/video capabilities.

  3. Session Reconnection Feature:
    Introduce a “Rejoin Session” option that allows users to reconnect advanced voice/video functions in the same thread after an interruption (e.g., a call or app crash). This would eliminate the need to start a new topic altogether.

Benefits:

Seamless transitions between interruptions, ensuring richer and uninterrupted conversations.

Improved user experience by removing the need for compromises between context retention and advanced interaction features.

Greater flexibility for users to clarify or enrich conversations using multiple input methods (text, voice, and video).

Use Case Example:
A user conducting a house tour via video is interrupted by a phone call. Upon resuming, they can either:

  1. Paste a brief summary of the tour into the new advanced voice/video thread.

  2. Reconnect the session without starting a new thread.
    This ensures the assistant retains full understanding and the conversation continues naturally.

Conclusion:
Integrating these features would greatly enhance the fluidity and accessibility of advanced voice and video modes, making them more user-friendly and adaptive to real-world interruptions.