Stuart Russell: 3 principles for creating safer AI
Stuart Russell wrote the standard text on AI; now he thinks deeply on AI's future -- and the future of us humans, too. Full bio
Double-click the English transcript below to play the video.
greatest Go players,
in Silicon Valley call
a lot faster than we expected.
What about the real world?
of the technologies
is not yet happening in machines,
that the human race has ever written.
further ahead than humans can,
to more information,
in the real world than we can.
everything that we value,
to a lot more intelligence,
to what the human race can do.
as some people have described it,
of the human race?
and Stephen Hawking?
has been around for a while.
in a subservient position,
at strategic moments" --
"turning off the power" idea later on --
feel greatly humbled."
This is Alan Turing in 1951.
is the father of computer science
the father of AI as well.
more intelligent than your own species,
a few million years ago,
to discuss whether it was a good idea,
they conclude, no,
sadness in their eyes.
something smarter than your own species
except stop doing AI,
the benefits that I mentioned
to keep doing AI.
the problem a bit more.
that the purpose put into the machine
one of the very early learning systems
better than its creator.
I touch to turn to gold,"
that he put into the machine,
and his relatives turned to gold
"the King Midas problem"
which is not, in fact,
"the value alignment problem."
is not the only part of the problem.
"Fetch the coffee,"
to fetch the coffee?
against interference
that I have been given."
of an objective that is, in fact,
of the human race --
takeaway from this talk.
the coffee if you're dead.
Repeat it to yourself three times a day.
with the objectives of the humans,
is not superintelligent.
but eventually Dave outwits him
pursue objectives.
of altruism, if you like,
of human objectives,
touchy-feely, goody-goody values.
that the human would prefer
its own existence.
its existence whatsoever.
of humility, if you like.
important to make robots safe.
but it doesn't know what they are.
of single-minded pursuit
by observation of human choices,
our lives to be like.
to this question of:
as Turing suggested.
right on the back.
going to let you switch it off?
the coffee, I must fetch the coffee,
has been listening to my talk,
"I must disable my 'off' switch,
people in Starbucks
seems to be inevitable,
a concrete, definite objective.
is uncertain about the objective?
might switch me off,
principles right there.
the incentive that the robot has
the underlying objective.
it should be pursuing,
what it did wasn't right.
of Greek symbols,
is provably beneficial to the human.
with a machine that's designed in this way
but this is the first step
with human-compatible AI.
scratching your head over.
you know, I behave badly.
and take stuff from the fridge.
you don't want the robot doing.
quite work that way.
is going to copy your behavior.
and maybe help you resist them,
for any person and for any possible life
difficulties involved in doing this;
is going to get solved very quickly.
we behave badly.
doesn't have to copy the behavior.
any objective of its own.
the desires of one person, the user,
the preferences of everybody.
amount of nastiness,
that your nastiness, for example,
and send your kids to school.
it doesn't mean it's going to steal.
send your kids to school.
he took an action that lost the game.
through a model of human cognition
limitations -- a very complicated model.
that we can work on understanding.
from my point of view as an AI researcher,
trade off, weigh up the preferences
moral philosophers have understood that,
looking for collaboration.
when you get that wrong.
a conversation, for example,
in a few years' time.
to remind you about dinner tonight."
"What? What dinner?
with the secretary-general at 7:30.
my recommendation."
I can't just tell him I'm too busy."
for his plane to be delayed."
for lunch tomorrow."
there's a slight mistake going on.
after a hard day's work,
Could you make some dinner?"
who are in more urgent need than you."
to working on them.
they're going to read everything
is human beings doing things
of data to learn from.
strong economic incentive
and the robot has to feed the kids,
and there's nothing in the fridge.
the human value function properly,
the nutritional value of the cat.
for family dinner."
of the domestic robot industry.
to get this right
superintelligent machines.
the definition of AI
beneficial machines.
about what those objectives are,
that we really want.
we will learn to be better people.
because I think they're setting up
seems intuitively really powerful.
this idea that knowledge
and rewriting that programming?
it to learn more, as I said,
as it becomes more correct,
to interpret it correctly.
that books are very biased
just boil it down to one law,
a self-driving car
to be able to switch off the car
and sensible the person is.
to be switched off.
random or even malicious,
to be switched off.
figure this out for us.
That was amazing.
ABOUT THE SPEAKER
Stuart Russell - AI expertStuart Russell wrote the standard text on AI; now he thinks deeply on AI's future -- and the future of us humans, too.
Why you should listen
Stuart Russell is a professor (and formerly chair) of Electrical Engineering and Computer Sciences at University of California at Berkeley. His book Artificial Intelligence: A Modern Approach (with Peter Norvig) is the standard text in AI; it has been translated into 13 languages and is used in more than 1,300 universities in 118 countries. His research covers a wide range of topics in artificial intelligence including machine learning, probabilistic reasoning, knowledge representation, planning, real-time decision making, multitarget tracking, computer vision, computational physiology, global seismic monitoring and philosophical foundations.
He also works for the United Nations, developing a new global seismic monitoring system for the nuclear-test-ban treaty. His current concerns include the threat of autonomous weapons and the long-term future of artificial intelligence and its relation to humanity.
Stuart Russell | Speaker | TED.com