Joseph Redmon: How computers learn to recognize objects instantly
约瑟夫·雷德蒙: 计算机如何学会快速识别物体
Joseph Redmon works on the YOLO algorithm, which combines the simple face detection of your phone camera with a cloud-based AI -- in real time. Full bio
Double-click the English transcript below to play the video.
要让一台电脑
thought that getting a computer
between a cat and a dog
in the state of artificial intelligence.
重大突破的情况下。
greater than 99 percent accuracy.
让它的正确率在99%以上。
put a label to that image --
thousands of other categories as well.
数千种的分类。
at the University of Washington,
计算机视觉模型的
computer vision models.
a prediction of dog or cat,
图片上是猫是狗,
specific breed predictions.
of granularity we have now.
阿拉斯加雪橇犬。
in image classification,
取得了惊人的进步,
when we run our classifier
with a pretty similar prediction.
非常相似的预测。
there is a malamute in the image,
阿拉斯加雪橇犬,
we don't actually know that much
我们并不能真正的了解
called object detection,
目标检测的问题,
and try to find all of the objects,
将一张图上的所有目标物都找出来,
when we run a detector on this image.
运行检测器时所发生的。
with our computer vision algorithms.
做更多的事情。
这里有一只猫和一只狗。
that there's a cat and a dog.
额外的信息。
on top of computer vision,
基于计算机视觉的系统,
or a robotic system,
或者机器人系统,
of information that you want.
you can interact with the physical world.
on object detection,
to process a single image.
speed is so important in this domain,
在这个领域是如此重要,
to process an image.
检测器快10倍,
在它做出预测的时候,
it makes predictions,
by another factor of 10,
at five frames per second.
(它就反应不过来了),
like this driving my car.
来驾驶我的汽车。
running in real time on my laptop.
实时检测系统。
as I move around the frame,
of changes in size,
那么这就是我们真正需要的。
on top of computer vision.
速度提高了1000倍。
a thousand times faster.
object detection systems
on each of these regions,
detections in the image.
运行数千次识别器,
thousands of times over an image,
才能获得检测结果。
to produce detection.
to do all of detection for us.
所有检测的单一网络。
and class probabilities simultaneously.
at an image thousands of times
不需要为了生成检测结果
目标检测的“YOLO”法。
the YOLO method of object detection.
we're not just limited to images;
不仅限于识别图像了,
that cat and dog,
and interact with each other.
COCO数据库上,
像勺子、叉子、碗
like spoon and fork, bowl,
out into the audience
our threshold for detection a little bit,
out in the audience.
is happening in real time
object detection system,
用于任何领域的图像识别。
already using this technology
正在利用这一技术
like medicine, robotics.
对动物数量进行普查,
of animals in Nairobi National Park
of this detection system.
free for anyone to use.
能被更多人使用、也更好用,
even more accessible and usable,
of model optimization,
在手机上进行目标检测了。
running on a phone.
now we have a pretty powerful solution
因为我们在这个低级的
有了一个强大的解决方案,
使用它来做些什么。
and build something with it.
with access to this software,
能够使用这个软件的人了,
人们会用这一技术造出什么来了。
will build with this technology.
ABOUT THE SPEAKER
Joseph Redmon - Computer scientistJoseph Redmon works on the YOLO algorithm, which combines the simple face detection of your phone camera with a cloud-based AI -- in real time.
Why you should listen
Computer scientist Joseph Redmon is working on the YOLO (You Only Look Once) algorithm, which has a simple goal: to deliver image recognition and object detection at a speed that would seem science-fictional only a few years ago. The algorithm looks like the simple face detection of a camera app but with the level complexity of systems like Google's Deep Mind Cloud Vision, using Convolutional Deep Neural Networks to crunch object detection in realtime. It's the kind of technology that will be embedded on all smartphones in the next few years.
Redmon is also internet-famous for his resume.
Joseph Redmon | Speaker | TED.com