I wonder your definition of ‘Advanced Voice Mode’
Certainly it must follow logical process…
It must surely be able to accept a single input but then potentially return multiple potential AND relevant results.
It must also be able to present that data. The current Chat GPT interface is only one dimensional ie
Question
Answer
Question
Answer
A smart interface must flow into multiple dimensions
But this potentially generates an awful lot of data, especially for things like loops
Every next sentence opens up a range of potential next paths/threads for the User or for Chat GPT
The User in a conversation might want to follow not only one of these paths, or even retrace, change and then regenerate something without loosing the thread of the conversation after
in the real world we have many contextual ques… We can point and look or talk about something in our surroundings.
For Chat GPT the context (or data) it shares is what enables us to interact with it in an ‘Advanced’ way. It’s interface is the world we share with it.
There are side by side views where Chat GPT works out which option is best etc
A conversation must add some dimensionality to be Advanced I think, something you can go back, rewrite entire branches etc as you better understand your conversation.
“What if we change…?”
The potential processing on the back of this especially when you start calling functions is massive and time consuming.
By changing one input 10 levels up, we suddenly ask Chat GPT to rebuild it’s entire conversation, potentially thousands of requests.
To truly have an ‘advanced’ conversation with a machine a User has to understand it’s logic, just as it’s ‘learnt ours’. And certainly needs to be able to SEE enough context.
Most people at least hold conversations (and knowledge) ‘by reference’… i know that I know that I know… A path on a treeview.
Currently AIs are ‘one shot’ by interface design
Or maybe he just needs some better prompting…