When will typing die out, paving the way for fully voice interfaces?

TonyAIChamp · December 22, 2023, 12:43pm

Hey Champs

I’ve been contemplating on AI as an interface for a while.

One of the questions in that domain that I have hanging and I would love to hear your thoughts on.

What do you think is the main hurdle for development of voice interface in mobile phones where you don’t need typing at all?

bruce.dambrosio · December 22, 2023, 5:23pm

I use the voice interface for chatGPT on my phone almost exclusively. I rarely type into it.
One annoyance is the startup lag before it allows me to start speaking. Not sure why that is needed.

Occasionally I prefer to make my query quietly (ie, typing), depending on the environment. Also occasionally I prefer to type so I can consider and revise my query before submission

bruce.dambrosio · December 22, 2023, 5:28pm

One final thought: Current communication models are more like telegraph than natural dialog. (ie, demarked ‘send’ / ‘reply’ ). We would need much more interactive conversation flow to reduce the need for thoughtful composition of interactions.

TonyAIChamp · December 23, 2023, 1:32am

Thank you for your thoughts, Bruce!

I understand I was not clear enough.

What I mean by 100% voice interface is when you don’t need to type not inside one specefic app, but on an OS level, where you have a voice assistant that knows all the apps and their data and instead of say 10-20 taps to change some setting you just exchange 1-3 voice messages with the assistant.

anon22939549 · December 23, 2023, 2:05am

Speed and accuracy.

Voice will possibly never be able to accurately correctly nail punctuation, nuance, text formatting (CAPS, italics, bold, etc) ,or, for example, things like deliberate misuse of homophones for comedic affect (at least not without needing to explicitly spell it out).

It will certainly get to the point of being good enough for most people, most of the time, but for people who care about having complete control over their expression typing is is and will be much faster and more accurate for the foreseeable future.

There’s also the fact using your voice as a device input monopolizes that “channel,” you can’t easily, in the middle of “typing,” communicate with someone else using your voice—or even talk to yourself.

It’s just not a very practical human-machine interface.

Man · December 23, 2023, 2:09am

This made me reflect on UI interfaces:

When will voice commands be replaced by brain-to-cloud neuro-chips for knowledge and memory recall at the speed of firing synapses?

When can I connect to AI-Cloud++ at a cellular level, to become part of the hive?

When can we use quantum entanglement, for instantaneous communication with connected devices or interfaces, including humans, animals, vegetables and… minerals, making us “one with everything”?

And at what point, do humans become the weakness / contraint in the UI interface. Probably sooner rather than later.

TonyAIChamp · December 23, 2023, 2:10am

Thank you Jake for your thoughts!

But I will challenge you here.

Don’t you think that many of the things you mentioned may not be needed in the future? Expressing emotion and other aspects through formatting is actually a very reduced way to express emotion, voice is much better in this. And other things they just follow the technology we have now, they will probably evolve with the technology in the future.

There’s also the fact using your voice as a device input monopolizes that “channel,” you can’t easily, in the middle of “typing,” communicate with someone else using your voice—or even talk to yourself.

That is an interesting point, but to me it seems like a point in favor of the clip thinking many people are not happy about.

What do you think?

TonyAIChamp · December 23, 2023, 2:11am

I contemplate on this as well, but this is a bit further future

curt.kennedy · December 23, 2023, 2:12am

I feel like there is a sci-fi screenplay at work here.

TonyAIChamp · December 23, 2023, 2:12am

Why don’t we collaboratively write one?

curt.kennedy · December 23, 2023, 2:14am

I am too busy creating “AI things” but my wife is a screenplay writer.

TonyAIChamp · December 23, 2023, 2:16am

Same here. I have a task in my list to start writing, but developing apps and exploring the models the is so much more fun, so that the writing task never gets anywhere close to the top of the list

curt.kennedy · December 23, 2023, 2:20am

I feel the same way. I’d love to create a screenplay. But where do I find the time?

We live in exciting times. So much going on in the AI space. It’s historic to say the least.

TonyAIChamp · December 23, 2023, 2:22am

To say the VEEEEEEEEEEERY least. Almost 2 years a buzzing thought doesn’t leave my head: how did we suddenly get into the sci-fi we were all reading before?

Man · December 23, 2023, 2:26am

It makes me ponder the future for our kids.

…but in the meantime I’ll type enough characters to post.

anon22939549 · December 23, 2023, 2:28am

I 100% agree, but… Something gets lost in translation.

As I type this, I’m contemplating how I would voice the difference between stressing a word with CAPS, italics, bold, ITALIC-CAPS, BOLD-CAPS, italic-bold, and ITALIC-BOLD-CAPS.

I’m sure someone knows how, but I don’t.

Or, how to offset a parenthetical with parentheses, commas, or em-dashes?

So, while I will agree that a voice is much more expressive than text, I think it only matters if the communication is received aurally. If you are intending the message to be read, I think the best interface with which to create that message is the “native” one—keys.

It may come to pass someday that a sufficiently advanced AI will be able to discuss our intention from the tone and cadence of our voices (or more likely there will be a convergence wherein we learn to speak in a way that is easier for the AI to infer our intentions), but I see that a good distance into the future—and I will note here I’m usually very bullish about the technological future.

TonyAIChamp · December 23, 2023, 2:30am

All of your points make sense to me, but I’ll try to reiterate the point from my previous message.

Typing/reading may be gone alltogether in the future, paving the way for voice (and potentially some new types; say emojies or some of their descendants) of communication interfaces.

Man · December 23, 2023, 2:44am

Sentiment analysis of written text or transcriptions is available but I haven’t read much about sentiment extraction from spoken language. I imagine it’s not far behind.

Even as a human, it’s easy to misinterpret another person’s spoken intentions. eg: Someone is passionate, but it comes across as angry, which wasn’t their intention. Or one person recognises a joke and the other takes it seriously.

But your response makes me wonder: - At what point do machines stop trying to replicate humans and instead devise strategies that are superior to humans. ie: What means of communication would computers use to communicate between themselves without catering for humans.

Of course, humans will keep trying to replicate humans, but if we reach AGI, the machines will look beyond that.

anon22939549 · December 23, 2023, 3:21am

Probably something like this.

TonyAIChamp · December 23, 2023, 3:24am

So, in the dial up internet era it was AÍ consciousness emerging! Dammit, I knew it!

Topic		Replies	Views
Phasm - Macro Assembler of User Concepts Community chatgpt , project , macros , phasm	29	726	April 24, 2025
An idea for Android app developers Community speech	9	532	December 21, 2024
OpenGraph can crash whole ChatGPT UI Plugins / Actions builders bug	41	4099	September 27, 2023
Super Simple PHP / Vanilla Javascript for DALLE3 API (+ Programming Languages Debate!) API dalle3 , dalle , api-beginners	36	757	January 13, 2025
Announcing GPT-4o in the API! Announcements	130	108385	July 4, 2024

When will typing die out, paving the way for fully voice interfaces?

Related topics