+

Cookies on the Business Insider India website

Business Insider India has updated its Privacy and Cookie policy. We use cookies to ensure that we give you the better experience on our website. If you continue without changing your settings, we\'ll assume that you are happy to receive all cookies on the Business Insider India website. However, you can change your cookie setting at any time by clicking on our Cookie Policy at any time. You can also see our Privacy Policy.

Close
HomeQuizzoneWhatsappShare Flash Reads
 

Google's DeepMind artificial intelligence has figured out how to talk

Sep 9, 2016, 15:15 IST

Google DeepMind

Google DeepMind claims to have significantly improved computer-generated speech with its AI technology, paving the way forward for sophisticated talking machines like those seen in sci-fi films like "Her" and "Ex-Machina."

Advertisement

The London-based research lab, acquired by Google in 2014 for a reported £400 million, announced on Thursday that it has developed a talking computer programme called "WaveNet" that halves the quality gap that currently exists between human speech and computer speech.

"Allowing people to converse with machines is a long-standing dream of human-computer interaction," the Google DeepMind researchers wrote in a blog post announcing the breakthrough.

Unlike existing artificial voice generators, WaveNet focuses on the sound waves being produced as opposed to the language itself. It uses a neural network - a technology that tries to replicate the human brain - to analyse raw waveforms of an audio signal and model speech and other types of audio, including music.

DeepMind published sample audio recordings of WaveNet talking in English and Mandarin and it's easy to see that the audio recordings are an improvement on Google Now, Amazon's Alexa, and Apple's Siri. The company also showed off some of the music that WaveNet has been able to produced after studying solo piano music on YouTube.

Advertisement

Although WaveNet sounds more like a human voice than existing artificial voice generators - known as "text-to-speech" (TTS) systems - it requires too much computing power to make it practical, meaning Google won't be integrating it into its products any time soon, according to The Financial Times.

Like other AI systems, WaveNet requires vast quantities of existing data to train itself. DeepMind used Google's existing TTS datasets to do this.

DeepMind, which sits under Alphabet, Google's parent company, is best-known for developing artificial intelligence systems that can master games like Space Invaders and Go. However, Google has been slow to integrate the company's technology into its products, with just one data centre efficiency project announced so far, albeit on a global scale.

Google DeepMind did not immediately respond to Business Insider's request for comment.

For more details on WaveNet, take a look at Google DeepMind's academic paper.

Advertisement

NOW WATCH: Watch the world's largest aircraft crash land on its 2nd flight

Please enable Javascript to watch this video
You are subscribed to notifications!
Looks like you've blocked notifications!
Next Article