Google's Duplex could make Assistant the most lifelike AI yet
Exclusive: Google's Duplex could make Assistant the most lifelike AI yet
Experimental technology, rolling out soon in a limited release, makes you think you’re talking to a real person.
BY RICHARD NIEVA MAY 9, 2018
The conversation seems mundane. At least, at first.
In a building called the Partnerplex on Google's sprawling campus in Mountain View, California, I've been invited to hear a 51-second phone recording of someone making a dinner reservation.
Person 1: Good evening.
Person 2: Hello?
Person 1: Hello.
Person 2: Hi, um, I'd like to reserve a table for Friday the third.
Person 1: OK, hold on one moment.
Person 2: Mm hmm.
Person 1: OK… hold on one second.
Person 2: Mm hmm.
Person 1: So Friday November third. How many people?
Person 2: For... two people.
Person 1: Two people?
Person 2: Yeah.
As I listen to what sounds like a man and a woman talking, Google's top executives for Assistant, the search giant's digital helper, watch closely to gauge my reaction. They're showing off the Assistant's new tricks a few days before Google I/O, the company's annual developer conference that starts Tuesday.
Turns out this particular trick is pretty wild.
That's because Person 2, the one who sounds like a man, isn't a person at all. It's the Google Assistant. And it (or "he"? That's its own debate) doesn't sound at all like the semirobotic, disembodied voice you usually hear out of a Google Home smart speaker when it's giving you updates on the weather or telling you how long it'll take you to commute to work. (You can hear the Assistant's conversations for yourself in the video below.)
This could be the next evolution of the Assistant, Google's rival to Amazon's Alexa, Apple's Siri and Microsoft's Cortana. It sounds remarkably -- maybe even eerily -- human, pausing before responding to questions and using verbal ticks, like "um" and "uh." It says "mm hmm" as if it's nodding in agreement. It elongates certain words as though it's buying time to think of an answer, even though its responses are instantaneously programmed by algorithms.
With this new speaking ability, Google Assistant gets that much closer to hitting a milestone in the evolution of computing: passing the Turing test. Proposed by English computer scientist Alan Turing in 1950, it's a way of potentially evaluating a machine's ability to demonstrate intelligent behavior. To pass the Turing test, a computer's natural language responses would have to sound just like a human's.
Built with technology Google calls "Duplex" -- and developed by engineers and product designers in Tel Aviv, New York and Mountain View -- the AI sounds as though the future of voice assistants has arrived.
Well, almost arrived.
The demo is part of what Google calls an "experiment" it plans to launch this summer. With Duplex, a "small" number of people will be able to book restaurant reservations and hair appointments and to check holiday hours, stuff you'd normally do over the phone. All of that back and forth happens on the back end -- between Google Assistant and, say, the restaurant. You won't even hear the voice chat taking place. It'll come from an unspecified phone number, not your own.
Google is coy about the size of the rollout, but says it'll be limited -- all the company will say is it won't be available to everyone using Assistant today. Nick Fox, vice president of product and design for Google Assistant and Search, and Yossi Matias, Google's vice president of engineering, say the search giant wants to "proceed with caution" because it's such a new technology.
Google also won't say if its goal is to bring this natural-sounding Assistant to more products -- like hearing it come out of your Google Home.
Here's how the reservation feature works: Say you want to book a haircut appointment. Just ask the Assistant from your Google Home, or text it from your Android phone or iPhone app, telling it where and when. If you want something at noon on Sunday, the Assistant will ask you for backup options just in case that time slot isn't available -- it knows that getting a range between, say, noon and 2 p.m. may be the smarter way to go.
If the business uses an automated online booking service like OpenTable, the Assistant will default to using that system for the appointment. But if the business doesn't take automated reservations that way -- and many small businesses don't -- Google's helper will make the call for you. The Assistant will text you back confirming your appointment and add it to your calendar.
In a different recorded conversation, I hear two female voices setting up a hair appointment. One is the Duplex technology. "She" sounds naturally human. She even does that thing with her voice that annoys some people, where her inflection goes up at the end of a sentence, even when not answering a question.
"We think of it as, 'What's a perfect assistant?'" says Fox, a 15-year Google veteran. "When I want an assistant with me, it's there. It's always ready to help. I don't worry about 'Can my assistant do it or not'? My assistant can do anything I want it to do."
Fox, who's spent most of his time at Google working on search ads, is now in charge of product for one of Google's landmark projects. It's been almost exactly two years since Google announced the Assistant at I/O 2016. At the time, CEO Sundar Pichai sat down with CNET and explained his vision: The Assistant is the essence of Google, set free from the search box on the company's iconic, sparse homepage.
The goal for Pichai and his team is to have Google's artificial intelligence work alongside you as you go about your day: With the Assistant today, you can get morning news headlines from your Google Home, point your phone's camera at a flower to discover what species it is or turn up your thermostat so the house is a comfortable temperature when you get home.
But if Google's experiment becomes reality, the Assistant will truly be your digital assistant. "Think of it as building your own individual Google," Pichai told us two years ago.
But as impressive as the Duplex phone recording is, we have to take the company's word for it. Google declined to do a live demo of the call during my briefing.
In the meantime, Google is trying to win more users with an updated Assistant that speaks in six new voices beyond the one female and one male it already has. The company is also making it easier for you to ask follow-up questions, and do things like send someone your ETA when you're driving and using Google Maps.
That personal touch
If the goal is for the Assistant to be Google personified, it still has a lot of evolving to do.
So far, Google's helper -- and honestly, Alexa, Siri and Cortana, too -- haven't lived up to their promise.
"We're still developing this technology, and we're actually working hard to get this right," Pichai said in Tuesday's keynote presentation at Google I/O. "We have many of these examples when the calls don't quite go as expected. But the Assistant knows the context and nuance. It knew to ask for wait times in this case, and it handled the interaction gracefully."
If Google ever decides to expand the Duplex technology to Google Home and other consumer-facing products, that could change. Based on the short conversations I heard, it could feel like you're talking to an administrative assistant at the office or maybe even Samantha, the OS from the Spike Jonze movie Her.
Of course, Google wasn't the first to the assistant market. Amazon, in a coup, surprised the world in 2014 when it released its Echo smart speaker, given voice by its Alexa software. For right now, smart speakers remain the best-known way to interact with a digital helper, and Amazon owns 69 percent of the market in the US. Google Home, released in 2016, has 31 percent, according to CIRP, a research firm.
Both Google and Amazon have been beefing up their product lines. Last year, Google introduced the $40 Home Mini and $400 Home Max, with high-end audio. Amazon has 10 Echo products, ranging in price from $40 to $230. Apple joined the battle in February with the $350 HomePod.
But there's potentially bigger money to made when digital assistants connect with the internet of things -- controlling everything from your car's dashboard to your washing machine to your light switches. People will spend $1 trillion on the internet of things by 2020, according to Gartner.
That's why Google has been working hard to catch up. Today, the company announced that 500 million devices have shipped globally with the Assistant on them, including speakers, phones and TVs. Google also said the Assistant can speak in 30 languages and will be in 80 countries by the end of the year, including seven new ones: Denmark, Korea, Mexico, the Netherlands, Norway, Spain and Sweden. Amazon's Echo is also available in more than 80 countries.
And last week, Google said the Assistant now partners with 5,000 household connected devices, up from 1,500 in January. That's an improvement, but still less than half of the 12,000 Alexa-enabled devices.
You can now ask two questions at a time, and the Assistant will answer both in a row.
Google's Fox tries to downplay to the rivalry with Amazon.
"We think of it less in a competitive way," Fox says. "The bigger opportunity is for people who aren't using anything yet."
Still, both tech giants haven't been afraid to play hardball and undercut each other for smart home sales. Google in January cut off Amazon Echo and Fire TV access to YouTube, which it owns. Amazon won't sell some gadgets made by Google and Nest, the smart home maker Google bought in 2014.
'We don't know all the answers'
Google has been working on its new natural-sounding voice technology "for years," says Matias, who leads the R&D center in Tel Aviv. The company says Duplex isn't broad AI technology for your phone. Instead, it's geared to specific tasks (like booking reservations, for starters). Matias readily admits that the tech isn't perfect and that Google is moving carefully because there are lots of thorny issues that come with a more human-sounding AI.
Like, what happens if the software starts talking nonsense? Matias tells me the tech won't go rogue and leave a trail of confused small-business owners talking to robots. When the feature actually rolls out, he says, the software will likely tell the person on the other end of the line that he or she is talking to a digital assistant -- even though the Assistant didn't do that during the recordings I heard.
"We need to be thoughtful about how we have this interaction while we're experimenting with it," Matias says. "We believe it's right to do this with transparency."
If a conversation does go awry, the software will fall back on what he calls "real-time supervised training." In that scenario, the software would turn over the conversation to a human to finish the call. Because calls aren't normally monitored by people, they'd have to begin from scratch. Google wouldn't say whether that person would have a log of what's already been discussed on the call.
Nick Fox, VP of product and design for Google Assistant and Search: "We don't want to force people into, 'This is what an assistant should sound like'."
The Duplex technology is the stuff of sci-fi. And that raises questions, both practical and philosophical: How should a person treat a lifelike AI? What's the proper human-robot etiquette or does that need to be developed? What's the best way for an AI to reveal it's a robot when talking to a human -- without disrupting the flow of a conversation?
Asked about these issues, Fox acknowledged that the folks at Google are learning about these things as they go.
"We need to set the right expectations with everyone," he says. "These are implementation questions of the technology that I'd say, humbly, we don't quite know all the answers to yet, and need to be figured out as we see this in the real world."
Paul Saffo, a Silicon Valley futurist now with Stanford University, has some thoughts on how this should play out. The creators of any realistic AI have the obligation to disclose to anyone who interacts with it that they're talking to a machine, he says.
But as this sort of technology becomes more widely used, he thinks people will begin to recognize the "tells" in a bot. "As our machines evolve, there's the hope that people will start to figure out if they're talking to one," Saffo says.
The Assistant everywhere
While Duplex remains an experiment for now, the Assistant is getting upgrades in other areas, some as soon as today.
That includes more voice options. There are six new ones, both male and female, that all speak with an American accent. One female voice is soft and soothing. A new male voice is a little deeper with a slight rasp.
"My wife hates that one. She's like, 'Get that out of my room,'" Fox says, as he plays a male voice that booms like an anchorman. The expanded library of voices is made possible by technology called WaveNet, which dramatically reduces the time and audio samples needed by a voice actor recording each track.
We need to be thoughtful about how we have this interaction while we're experimenting with it. We believe it's right to do this with transparency.
The voices aren't labeled with specific names. Instead, each is designated by a color, like red, orange or green. You select the one you want by scrolling through your phone's settings. When you're connected to a Google Home, you just pick your favorite voice setting and the speaker will recognize your voice and respond with the Assistant voice of your choice.
Google also teased that it will bring singer John Legend's voice to the Assistant, too.
The idea, as with most of these updates, is to make the Assistant more personal. "We don't want to force people into, 'This is what an assistant should sound like," Fox says.
Another update, due this summer, is a mode called Pretty Please that will require you to be polite when giving the Assistant commands. The goal is to reinforce good manners in children -- and adults, too -- by prompting them to say please and thank you.
Google VP of Engineering Yossi Matias says the search giant has been working on natural-sounding voice technology "for years."
Amazon announced a similar feature last month for a new line of Echo Dot speakers designed for kids. The child-friendly version of Alexa offers positive reinforcement to a kid who says "please" by responding, "Thanks for asking so nicely."
While Amazon announced its version first, Google says it's been working on Pretty Please for a while. "All of this is really nascent," says Lilian Rincon, director of product management for the Assistant. "It's not surprising we'd both be grappling with these questions."
There's plenty of other new stuff: You can now ask two questions at a time, and the Assistant will answer both in a row. (For example: How old is Kendrick Lamar and what's the name of his latest album?). You can also ask follow-up questions without saying "Hey Google" again or repeating the subject of your question. The mic will stay hot for up to 8 seconds before the window closes. Before this, the mic went inactive after you asked your question.
When I want an assistant with me, it's there.
The Assistant can text people your ETA when you're navigating with Google Maps while driving. It will show you more visual content when you access the Assistant on your phone -- things like reminders, upcoming flight information and to-do items. You'll also be able to set up a "custom routine," which triggers a set of actions right after you shut off your morning alarm. That means you could automatically start your day listening to headlines, a traffic update and a favorite song to pump you up.
It's all useful. But it also means the Google Assistant is getting to know you a lot more intimately.
That's worth talking about because user privacy is a major topic of discussion in tech. The data collection practices of Silicon Valley's biggest companies have been under extra scrutiny in the weeks since Facebook's scandal with Cambridge Analytica began in March. The digital consultancy got access to personal information from up to 87 million Facebook users without their knowledge. The scandal has raised questions about how tech giants secure your data and how they make money. For the most part, Google, Facebook and others rely on troves of personal information they've collected about your likes, dislikes, shopping preferences, lifestyle and more to allow marketers to target specific ads to you.
Google made $95.4 billion in ad sales last year, while Facebook made most of its $40 billion in 2017 sales from ads.
"Our principles here have always been to give users control over that information," Fox says. "The last few weeks have underlined that this is incredibly important. But we've always known it's important. It's healthy for us to get that reinforcement."
Assistant queries, he says, are treated exactly the same way as regular search queries, and have all the same user settings. That means you can clear your search history, but that may affect which ads come your way.
All that segues into the real reason we're talking about the Assistant. As Pichai noted back in 2016, it's the next evolution of Google search, with people spending less time with their desktop computers and more time with their phones and other internet-connected things, such as smart speakers and cars.
"Done correctly, it will save time for people and generate a lot of value for businesses," Pichai said on stage.
As my interview comes to a close, I ask Fox if he thinks the Assistant is Google's most important project right now.
He pauses for 13 seconds before answering.
"The Assistant is a critical effort for Google," Fox says. "It's something really important to get right."