Amazon is working on giving Alexa the ability to mimic anyone's voice, dead or alive, from just one minute of audio — and Twitter doesn't know how to feel about it
This story is available exclusively to Business Insider subscribers. Become an Insider and start reading now. Have an account? .
Amazon is teaching Alexa to mimic anyone's voice, dead or alive, from just a one-minute recording of that voice.
Rohit Prasad, Amazon's head scientist for Alexa, said at a live event on Wednesday that his team has been instructing Alexa to pick up a voice from a short audio clip and convert it into longer audio output. Prasad was presenting at Amazon's re:Mars conference in Las Vegas.
He showed a short video of how people could use Alexa's voice-changing capability in real life. In the clip, a boy asks: "Alexa, can grandma finish reading me the Wizard of Oz?"
The smart speaker confirmed the request in its default chirpy voice, then transitioned to a less robotic voice that narrated an excerpt from the children's novel.
"This required inventions where we had to learn to produce a high-quality voice with less than a minute of recording versus hours of recording in the studio. The way we made it happen is by framing the problem as a voice-conversion task and not a speech generation path," Prasad said.
Prasad said Alexa's ability to impersonate familiar voices is particularly crucial now, as many people lost loved ones to COVID-19.
"While AI can't eliminate that pain of loss it can definitely make their memories last," he said.
Prasad did not say when Amazon would introduce Alexa's voice-imitation capability to the public. An Amazon spokesperson declined Insider's request for comment.
Alexa's ability to mimic voices is a form of artificial intelligence (AI) that Prasad called "generalizable intelligence." The ability helps Alexa adapt to different situations and acquire new knowledge from the experiences with little supervision, he said.
It's different from the "all-knowing, all-capable" artificial general intelligence, or AGI, that aims to understand human tasks and intellect to solve problems, said Prasad. Organizations including Google's DeepMind and Elon Musk's OpenAI are both focused on perfecting AGI.
Amazon is not the only company working on developing tech that can imitate human voices. Last month, Japanese toy maker Takara Tomy debuted a voice-shifting, egg-shaped device called Coemo that copies adults' voices and uses them to read stories to children.
Many are spooked by AI's ability to imitate human functions
On Twitter, people were divided over Amazon's plans to teach Alexa to mimic human voices.
One person, who uses the Twitter handle "Maltese Mama," said Alexa could keep their parents, who have dementia and live far away, mentally active. "We have caregivers going daily but being able to peak (sic) in or even better drop in with a video call is amazing," they tweeted in response to Prasad's presentation.
But many others voiced concerns about the technology.
"Umm, so how soon will criminals be able to use it to call your family members begging them to Venmo cash? Or ask them for social security numbers? Or bank information?" tweeted a user with the handle bitty_in_pink.
Others, including a Twitter user who goes by "Luke," said they were creeped out by the thought of it.
"It's sweet but at the same time incredibly creepy… I lost my mum last year in August and would die to have one last proper conversation with her but I wouldn't do it for a goddamn circular device," he wrote.
Experts have long been concerned about AI's ability to imitate human functions. In 2015, Musk funded several AI projects, including OpenAI, to ensure that researchers were using the technology only for beneficial purposes. Earlier this month, an engineer claimed that a Google chatbot had become sentient, but AI experts said it was far from being self-aware.
Read next
Weilun was formerly Insider's senior Asia tech reporter, based in Singapore. He wrote about Chinese and Asian technology companies and startups, and their impact on societies, economies and politics in the region. Before Insider, he was managing investment content strategies for some of Wall Street's largest firms, based in Hong Kong and Singapore. Prior to that, he was a reporter covering technology, business, economics, politics and defense for Singapore-based news outlets. Weilun is a graduate of Columbia University's Journalism School and Peking University in Beijing, China. He is fluent in Mandarin Chinese and definitely Singlish. He also speaks French and Japanese. Here are some highlights from Weilun:
- How Grab, Asia's Uber, stumbled before and after its $40 billion SPAC, with pandemic layoffs and a teary all-hands with its CEO
- Read the form email Coinbase sent to employment candidates to withdraw their job offers
- China's popular Xiaohongshu app counts Kim Kardashian and Eileen Gu as users. Sources say it's staffing up to launch internationally.
- Some crypto investors say TerraUSD and Luna's $60 billion crash pushed them to the brink and led to thoughts of self-harm
- Russia was invading, so a Ukrainian converted his life savings of $10,500 into the crypto token terra. Then the token crashed.
- An anonymous Twitter account built 60,000 followers translating Chinese propaganda about the Ukraine war and is setting off an international firestorm
- Weibo users baselessly claimed Ukraine set fire to its own nuclear plant, showing how misinformation is spreading about the war in China
- China's Big Tech firms are sending congratulation notes for 'graduating' to employees they're laying off
- Elon Musk, the world's richest person, says he doesn't own a home right now and rotates around friends' spare rooms