Google's teaching AI how to see and hear at the same time

jackcole · December 6, 2021, 3:43pm

tanner49 · December 6, 2021, 4:47pm

It seems like we’re very close to an AI that has shared comprehension across text, visual (video/image), audio and any number of senses. And a super AI with multiple senses seems exponentially more powerful.

daveshapautomator · December 6, 2021, 9:16pm

This will be a critical step towards one type of AGI.

jackcole · December 6, 2021, 9:28pm

Yes, the more multimodal the better. When we can get text, image, video, and sound, all used in training a model, it will be something amazing.

Topic		Replies	Views
VentureBeat: Microsoft and Nvidia team up to train one of the world’s largest language models Community	5	725	December 19, 2023
Deep learning coming to phones Community	1	499	August 3, 2021
AI Is Harder Than We Think Community	1	475	December 28, 2023
Google unveils new AI model LaMDA Community	3	578	May 20, 2021
Microsoft AI Introduces Turing Bletchley: A 2.5-Billion Parameter Universal Community	1	521	February 6, 2024

Google's teaching AI how to see and hear at the same time

Related topics