Sundar Pichai at Google I/O 2019 Keynote (Full Transcript)

Following is the full transcript of the entire Google I/O 2019 developer keynote event. Google’s CEO Sundar Pichai and the team announced latest products and services that the company provides. This event occurred on May 7, 2019 at Shoreline Amphitheatre, Mountain View, California, United States.

Speakers at the event:

Sundar Pichai – CEO, Google

Aparna Chennapragada – VP of Product for AR and VR, Google

Scott Huffman – Vice President, Google Assistant

Stephanie Cuthbertson – Senior Director for Android

Rick Osterloh – SVP of Hardware

Sabrina Ellis – VP of Product Management

Jeff Dean – Lead of Google AI

Lily Peng – Product Manager, Google AI Healthcare Team

Sundar Pichai – CEO, Google

Good morning. Good morning. Wonderful to be back here at Shoreline with all of you.

It’s been a really busy few months for us at Google. We just wrapped up Cloud Next in San Francisco with over 30,000 attendees, as well as YouTube Brandcast last week in New York.

Of course, today’s about you all, our developer community. And thank you all for joining us in person, and to the millions around the world watching on livestream.

I would love to say welcome in all our languages our viewers speak, but we are going to keep the keynote under two hours, especially since Barcelona kicks off against Liverpool at noon for you. That should be an amazing game.

Every year at I/O, we learn and try to make things a little bit better. That’s why we have lots of sunscreen — hope the sun comes out — plenty of water and shade. But this year, we want to make it easier for you to get around. So we are using AR to help.

To get started, open your I/O app and choose Explore I/O. And then you can just point your phone where you want to go. We really hope this helps you get around and answers the number one question people have: where the sessions are. Actually, it’s not that. They want to know where the food is. And we have plenty of it around.

We also have a couple of Easter eggs, and we hope you enjoy them as well. This is a pretty compelling use case. And we actually want to generalize this approach so that you can explore and navigate the whole world that way. There’s a lot of hard work ahead. And it’s a hard computer science problem. But it’s the type of challenge we love.

Tackling these kinds of problems is what has kept us going for the past 21 years. And it all begins with our mission to organize the world’s information and make it universally accessible and useful.

And today, our mission feels as relevant as ever. But the way we approach it is constantly evolving. We are moving from a company that helps your find answers to a company that helps you get things done.

This morning, we’ll introduce you to many products built on a foundation of user trust and privacy. And I’ll talk more about that later.

We want our products to work harder for you, in the context of your job, your home, and your life. And they all share a single goal: to be helpful, so we can be there for you in moments big and small over the course of your day. For example, helping you write your emails faster with automatic solutions from Smart Reply, and giving you the chance to take them back if you didn’t get it right the first time, helping you find the fastest route home at the end of a long day, and when you get there, removing distractions so that you can spend time with the people most important to you.

And when you capture those perfect moments, backing them up automatically so you never lose them.

Simply put, our goal is to build a more helpful Google for everyone. And when we say “helpful,” we mean giving you the tools to increase your knowledge, success, health, and happiness. We feel so privileged to be developing products for billions of users. And with that scale comes a deep sense of responsibility to create things that improve people’s lives.

By focusing on these fundamental attributes, we can empower individuals and benefit society as a whole. Of course, building a more helpful Google for us always starts with search and the billions of questions users trust Google with everyday. But there is so much more we can do to help our users.

Last year, we launched a new feature in Google News called Full Coverage. And we have gotten great feedback on it from our users. We’ll be bringing Full Coverage directly to search to better organize results for news-related topics. Let’s take an example.

If you search for “black hole,” we’ll surface the relevant top news. It was in the news recently. We use machine learning to identify different types of stories and give you a complete picture of how a story is being reported from a wide variety of sources. You can click into Full Coverage. It serves as a breadth of content, but allows you to drill down into what interests you.

[read more]

You can check out different aspects of the story, like how the black hole got its name. You can even now see a timeline of events. And we’ll be bringing this to search later this year.

Podcasts are another important source of information. And we’ll be bringing them directly to search as well. By indexing podcasts, we can surface relevant episodes based on their content, not just the title. And you can tap to listen right in search results, or you can save an episode for listening later on your commute or your Google Home.

These are all examples of how we are making search even more helpful for our users, surfacing the right information in the right context.

And sometimes, what’s most helpful in understanding the world is being able to see it visually.

To show you how we are bringing you visual information directly in search, here’s Aparna.

Aparna Chennapragada – VP of Product for AR and VR, Google

Whether you’re learning about the solar system or trying to choose a color scheme for your home, seeing is often understanding.

With computer vision and augmented reality, the camera in our hands is turning into a powerful visual tool to help you understand the world around you.

So today, we are excited to bring the camera to Google search, adding a new dimension to your search results — well, actually three dimensions, to be precise. So let’s take a look.

Say you’re a student studying human anatomy. Now, when you search for something like muscle flexion, you can view a 3D model built by Visible Body right from the search results. Pretty cool.

Not only that, you can also place it in your own space. Look, it’s one thing to read about flexion or extension, but seeing it in action right in front of you while you’re studying the concept, very handy.

OK, let’s take another example. Say, instead of studying, you’re shopping for a new pair of shoes. That happens. With New Balance, you can look at shoes up close from different angles, again, directly from search. That way, you get a much better sense for things like, what does the grip look like on the sole, or how they match with the rest of your clothes.

OK, this last example is a really fun one. So you may have all seen a great white shark in the movies. “Jaws,” anyone? But what does it actually look like up close? Let’s find out, shall we?

I have Archana here with me to help with the demo. So let’s go ahead and search for “great white shark” on Google. As you scroll through, you get information on the knowledge panel facts, but also see the shark in 3D directly from the knowledge panel.

Why don’t we go one step further? Why don’t we invite the shark to the stage? Whoa!

There it is. It’s one thing to read a fact like “a great white can be anywhere between 17 feet to 21 feet long,” but to see it in front of you at scale, filling up the Shoreline stage like a rock star, that is truly understanding its scale.

OK, let’s take a closer look. It’s an AR shark. It won’t bite. Ooh. Look at those layers of teeth. You know, I don’t know about you all, but I’d much rather see these teeth up close in AR than in real life.

Thank you, Archana.

Really excited about bringing the camera and AR capabilities to Google search.

Now, sometimes, though, the things that you’re interested in, they’re difficult to describe in a search box. So that’s why we created Google Lens, to help you search and do more with what you see by simply pointing your camera. The built lens has a capability across products. So you can access it directly from the Google Assistant. But we’ve also built it into Google Photos and the Camera app on many Android devices.

People have already used Lens more than a billion times so far. And they’ve used it to ask questions about what they see, like what kind of flower that is, or where to get a lamp like that, or just who the artist is.

One way we’ve been thinking about it is, with Lens, we’re indexing the physical world, billions of places and products and so on, much like search indexes the billions of pages on the web.

OK, today, let me show you some new ways that we’re making Lens more helpful to you. Say you’re at a restaurant trying to figure out what to order. Instead of going from the menu to different apps on the phone and back to the menu and so on, you can simply point your camera. Lens automatically highlights the popular dishes at this restaurant right on the menu.

And of course, if you want to know more, you can tap on any dish on the menu, and you can see what it looks like, again, at the restaurant — and, of course, check out what other people are saying about it on Google Maps.

By the way, when you’re done eating, Lens can help pay for your meal. Not so fast. It’s not picking up your tab. But it can calculate the tip and even split the total — again, just by pointing your camera at the receipt. And voila.

So you saw how we connected the menu with information from Google Maps. But we’re starting to think of other ways that we can connect helpful digital information with the things in the physical world. So I’m going to give you just one example.

So you’re flipping through a “Bon Appetit” magazine and you see a recipe you like. Soon, you can point your camera at the recipe and see the page come alive, showing you how to make the dish. We’re starting to work with more partners, like museums, magazine publishers, and retailers, to bring unique visual experiences like this.

There’s one final area where we think that the camera can be particularly helpful to people. Around the world, there are more than 800 million adults who are struggling to read the words that they come across in their daily lives — bus schedules, bank forms, et cetera. And many of them are coming online for the first time with a smartphone.

So to help with that, we’ve integrated a new camera capability into Google Go. This is our search app for entry level devices. Take this sign in English next to an ATM. Now, for someone who does not understand the language and cannot read the words, this is important information that they’re not getting access to. And we think that the camera can help here. So let me show you how.

So directly from the Google search bar, you can use Lens, open it, point it at the sign to hear the text read out aloud to you.

[Google Assistant: Information for card holders — all customers using old proprietary magnetic stripe cards should be advised.]

What is nice here is that it is highlighting the words as they’re spoken. That way, even if you can’t read the language well, you can follow along, and you understand the full context of what you see.

You can also translate it into your own language, like this.

Notice that the translated text is overlaid right on top of the original sign. It almost feels like the sign was written in your own language to start with. And again, you can hit Listen and hear the words read out loud, this time in your own language.

[Google Assistant: [Speaking Spanish]

What you’re seeing here is text-to-speech, computer vision, the power of translate, and 20 years of language understanding from search all coming together.

Now, our teams in India have been working with some early testers and getting a lot of feedback to make the product better. And I want to now show you how one of them is using it in her daily life. Take a look.

[Video Clip]

Thank you, Urmila, for testing it and giving us a lot of feedback for the team to make the product better.

The power to read is the power to buy a train ticket, to shop in a store, to follow the news. It’s the power to get things done. So we want to make this feature accessible to as many people as possible. So it already works in more than a dozen languages. And the teams worked incredibly hard to compress all of this tech to just over 100 kilobytes.

That way, it can work on phones that cost as little as $35. So we’re super excited about this and all the other features across Search and Lens to help you throughout the day. You’ll start to see these updates roll out later this month.

Thank you.

Sundar Pichai – CEO, Google

Thanks Aparna. Helpfulness is also about saving time and making your day a little bit easier. That’s why, last year, at I/O, we gave you a first look at our Duplex technology.

Duplex enables Google Assistant to make restaurant reservations on your behalf by actually placing a call. It’s now available in 44 states across the US. And we’ve gotten great feedback not only from our users, but from businesses as well.

For us, Duplex is the approach by which we train AI on simple but familiar tasks to accomplish them and save you time. Duplex was launched with restaurant reservations on the phone. But now, we are moving beyond voice and extending Duplex to tasks on the web.

We again want to focus on narrow use cases to start. So we are looking at rental car bookings as well as movie ticketing. Today, when you make a new reservation online, you have to navigate a number of pages and steps, filling out information and making selections along the way. I’m sure you’re all familiar with this experience.

It’s time consuming. And if users leave during the workflow, businesses lose out as well. We want to make this experience better for both users and businesses. So let me show you how that system can do it better.

Say you get a calendar reminder about an upcoming trip. And you want to book a rental car. You can just ask Google, book a National car rental for my next trip. The Assistant opens the National website and automatically starts filling out your information on your behalf, including the dates of the trip.

You can confirm the details with just a tap. And then the Assistant continues to navigate the site. It even selects which car you like. It’s acting on your behalf and helping you save time, but you’re always in control of the flow.

Let’s go ahead and add a car seat. And once all the details are in, you can check everything one last time and just tap to finalize the reservation. You’ll immediately get a booking confirmation.

It’s amazing to see the Assistant complete a task online on your behalf in a personalized way. It understands the dates of your trip and your car preferences based on trip confirmations in Gmail.

And I also want to point out that this was not a custom integration. This required no action on part of the business to implement. What you just saw is an early preview of what we are calling Duplex on the Web. We’re going to be thoughtful and get feedback from both users and businesses to improve the experience. And we’ll have more details to share later this year.

The Google Assistant helps people around the world with all kinds of tasks, whether they are at home or on the go. But we want to build an even more helpful assistant.

In order to process speech today, we rely on complex algorithms that include multiple machine learning models. One model maps incoming sound bites into phonetic units. Another one takes and assembles these phonetic units into words. And then a third model predicts the likelihood of these words in a sequence. They are so complex that they require 100 gigabytes of storage and a network connection.

Bringing these models to your phone — think of it as putting the power of a Google data center in your pocket — is an incredibly challenging computer science problem. I’m excited to share we have reached a significant milestone.

Further advances in deep learning have allowed us to combine and shrink the 100-gigabyte models down to half a gigabyte, small enough to bring it onto mobile devices. This eliminates network latency and makes the Assistant so much faster — so fast that tapping to use your phone would seem slow.

I think this is going to transform the future of the Assistant. And I’m thrilled to bring Scott to tell you more about our next generation Assistant.