Stuart Russell: 3 principles for creating safer AI
斯图尔特·罗素: 人工智能是如何让我们变得更好的
Stuart Russell wrote the standard text on AI; now he thinks deeply on AI's future -- and the future of us humans, too. Full bio
Double-click the English transcript below to play the video.
最顶尖的围棋高手之一,
greatest Go players,
足以让我硅谷的朋友们
in Silicon Valley call
比我们预想的要快得多。
a lot faster than we expected.
What about the real world?
那在现实世界中又如何呢?
比围棋棋盘要大得多,
of the technologies
is not yet happening in machines,
that the human race has ever written.
further ahead than humans can,
to more information,
做出比人类更好的选择。
in the real world than we can.
everything that we value,
我们所珍视的一切,
to a lot more intelligence,
就真的没有极限了。
to what the human race can do.
as some people have described it,
of the human race?
比尔盖茨,和斯提芬霍金吗?
and Stephen Hawking?
has been around for a while.
这个概念已经存在很长时间了。
维持在一个屈服于我们的地位,
in a subservient position,
at strategic moments" --
”关闭电源“这一话题,
"turning off the power" idea later on --
仍然应该自感惭愧。“
feel greatly humbled."
This is Alan Turing in 1951.
是阿兰图灵,他在1951年说的。
is the father of computer science
是计算机科学之父。
他也是人工智能之父。
the father of AI as well.
物种的问题时,
more intelligent than your own species,
a few million years ago,
祖先们几百万年前所经历的。
开会讨论那么做是不是一个好主意,
to discuss whether it was a good idea,
they conclude, no,
sadness in their eyes.
something smarter than your own species
除了停止研究人工智能,
except stop doing AI,
the benefits that I mentioned
我之前所说的诸多益处,
人工智能的研究者之一,
to keep doing AI.
the problem a bit more.
人工智能可能会是灾难呢?
that the purpose put into the machine
给机器输入的目的和价值
一个早期的学习系统,
one of the very early learning systems
西洋棋下得比它的创造者更好。
better than its creator.
I touch to turn to gold,"
我触碰的所有东西都变成金子。“
that he put into the machine,
他的家人都变成了金子,
and his relatives turned to gold
"the King Midas problem"
叫做”迈达斯问题“,
which is not, in fact,
我们把它称为”价值一致性问题“。
"the value alignment problem."
is not the only part of the problem.
仅仅是问题的一部分。
比如说”去把咖啡端来“,
"Fetch the coffee,"
to fetch the coffee?
阻止别人把我关掉。
against interference
不让别人干涉我,
that I have been given."
of an objective that is, in fact,
模式去实现某一目标,
想实现的目标并不一致——
of the human race --
takeaway from this talk.
你就不能去端咖啡了。
the coffee if you're dead.
每天对自己重复三遍。
Repeat it to yourself three times a day.
with the objectives of the humans,
is not superintelligent.
比不过人类主角戴夫,
but eventually Dave outwits him
机器通过智能去达成目标。
pursue objectives.
of altruism, if you like,
of human objectives,
touchy-feely, goody-goody values.
that the human would prefer
its own existence.
对维护自身生存毫无兴趣。
its existence whatsoever.
of humility, if you like.
important to make robots safe.
却不知道这价值究竟是什么。
but it doesn't know what they are.
of single-minded pursuit
关于我们想要什么的信息。
by observation of human choices,
做的选择来获取这样的信息,
是什么样的信息,
our lives to be like.
to this question of:
“将机器关掉”这个问题上来。
as Turing suggested.
right on the back.
going to let you switch it off?
它会想:”我必须去拿咖啡,
the coffee, I must fetch the coffee,
has been listening to my talk,
"I must disable my 'off' switch,
people in Starbucks
seems to be inevitable,
一个十分确定的目标。
a concrete, definite objective.
is uncertain about the objective?
不那么确定会发生什么呢?
might switch me off,
principles right there.
the incentive that the robot has
the underlying objective.
it should be pursuing,
what it did wasn't right.
of Greek symbols,
对人们是绝对有利的。
is provably beneficial to the human.
有如此设计的机器人会变得
with a machine that's designed in this way
but this is the first step
兼容的人工智能的第一步。
with human-compatible AI.
为这一个原则而大伤脑筋。
scratching your head over.
我有时不按规矩办事。
you know, I behave badly.
像我一样行事。
从冰箱里找东西吃,
and take stuff from the fridge.
不希望机器人去做的。
you don't want the robot doing.
quite work that way.
is going to copy your behavior.
而且可能会在合适的情况下制止你去做
and maybe help you resist them,
在他们的任何一种
for any person and for any possible life
difficulties involved in doing this;
is going to get solved very quickly.
we behave badly.
我们做事不守规矩,
doesn't have to copy the behavior.
机器人并不会复制那些行为,
any objective of its own.
the desires of one person, the user,
某一个人、一个用户的欲望,
the preferences of everybody.
amount of nastiness,
that your nastiness, for example,
很可能收取贿赂,
得供你的孩子们上学。
and send your kids to school.
它不会因此去偷,
it doesn't mean it's going to steal.
send your kids to school.
他最终输掉了棋局。
he took an action that lost the game.
through a model of human cognition
是一个很复杂的模型,
limitations -- a very complicated model.
that we can work on understanding.
from my point of view as an AI researcher,
人工智能研究人员来说最大的困难,
trade off, weigh up the preferences
哲学家都理解这一点,
moral philosophers have understood that,
looking for collaboration.
把这一步弄错了会怎么样。
when you get that wrong.
与你的人工智能助理,
a conversation, for example,
in a few years' time.
to remind you about dinner tonight."
提醒你今晚要跟她共进晚餐。”
"What? What dinner?
“什么?什么晚饭?
庆祝结婚20周年纪念日。”
with the secretary-general at 7:30.
我约了晚上7点半见领导。
但你不听我的建议。”
my recommendation."
I can't just tell him I'm too busy."
跟领导说我有事,没空见他。”
for his plane to be delayed."
让他的航班延误。
中午午饭不见不散。”
for lunch tomorrow."
there's a slight mistake going on.
after a hard day's work,
Could you make some dinner?"
who are in more urgent need than you."
你自己做饭去吧。”
to working on them.
they're going to read everything
我们做的什么事情,
is human beings doing things
of data to learn from.
我们也有足够的动机
strong economic incentive
机器人得给孩子们做饭,
and the robot has to feed the kids,
但冰箱里什么都没有。
and there's nothing in the fridge.
the human value function properly,
大于猫的营养价值。
the nutritional value of the cat.
把猫煮了给主人当晚饭!”
for family dinner."
整个居家机器人产业。
of the domestic robot industry.
to get this right
superintelligent machines.
the definition of AI
beneficial machines.
about what those objectives are,
that we really want.
we will learn to be better people.
我们也能学会成为更好的人。
非常有意思,斯图尔特。
为下一位演讲者布置的时候
because I think they're setting up
似乎是一个很重要的理念,
seems intuitively really powerful.
this idea that knowledge
and rewriting that programming?
去重新编写程序呢?
我们想要它去学习,就像我说的,
it to learn more, as I said,
as it becomes more correct,
才会变得更确定,
to interpret it correctly.
that books are very biased
都集中在一条准则里吗?
just boil it down to one law,
a self-driving car
能在汽车运行过程中
to be able to switch off the car
是不是讲道理。
and sensible the person is.
to be switched off.
或者甚至是有恶意的,
random or even malicious,
to be switched off.
能把这一切研究出来,
figure this out for us.
That was amazing.
ABOUT THE SPEAKER
Stuart Russell - AI expertStuart Russell wrote the standard text on AI; now he thinks deeply on AI's future -- and the future of us humans, too.
Why you should listen
Stuart Russell is a professor (and formerly chair) of Electrical Engineering and Computer Sciences at University of California at Berkeley. His book Artificial Intelligence: A Modern Approach (with Peter Norvig) is the standard text in AI; it has been translated into 13 languages and is used in more than 1,300 universities in 118 countries. His research covers a wide range of topics in artificial intelligence including machine learning, probabilistic reasoning, knowledge representation, planning, real-time decision making, multitarget tracking, computer vision, computational physiology, global seismic monitoring and philosophical foundations.
He also works for the United Nations, developing a new global seismic monitoring system for the nuclear-test-ban treaty. His current concerns include the threat of autonomous weapons and the long-term future of artificial intelligence and its relation to humanity.
Stuart Russell | Speaker | TED.com