Joseph Redmon: How computers learn to recognize objects instantly
喬瑟夫.瑞德蒙: 電腦是如何學習即時辨識物體的?
Joseph Redmon works on the YOLO algorithm, which combines the simple face detection of your phone camera with a cloud-based AI -- in real time. Full bio
Double-click the English transcript below to play the video.
thought that getting a computer
between a cat and a dog
in the state of artificial intelligence.
人工智慧都很難辦到。
greater than 99 percent accuracy.
提升到 99% 以上。
put a label to that image --
並給圖片貼上標籤——
thousands of other categories as well.
許多其它類別的東西。
at the University of Washington,
computer vision models.
a prediction of dog or cat,
在預測這是狗或貓,
specific breed predictions.
of granularity we have now.
in image classification,
已經有了很大的進步,
when we run our classifier
with a pretty similar prediction.
得到的預測也相當類似。
there is a malamute in the image,
we don't actually know that much
還不是很完整。
called object detection,
叫做「物件偵測」,
and try to find all of the objects,
所有物體都找出來,
when we run a detector on this image.
執行偵測軟體時,會發生甚麼事。
with our computer vision algorithms.
幫我們做更多的事。
that there's a cat and a dog.
電腦知道圖片中有一隻貓和狗。
on top of computer vision,
基於電腦視覺系統的實用系統,
or a robotic system,
of information that you want.
you can interact with the physical world.
與實體世界互動的東西。
on object detection,
to process a single image.
speed is so important in this domain,
為什麼這個領域這麼講究速度,
to process an image.
it makes predictions,
在它識別圖像的過程中,
by another factor of 10,
at five frames per second.
就可以識別 5 張圖片。
like this driving my car.
running in real time on my laptop.
即時偵測系統。
as I move around the frame,
它可以很順暢地追蹤著我,
of changes in size,
基於電腦視覺系統的實用系統,
on top of computer vision.
a thousand times faster.
快了 1000 倍。
object detection systems
on each of these regions,
運行分類器軟體,
detections in the image.
thousands of times over an image,
好幾千次的識別指令、
to produce detection.
才有辦法偵測出來。
to do all of detection for us.
網路模型來幫我們完成所有的偵測。
and class probabilities simultaneously.
並同時對可能的結果進行評估。
at an image thousands of times
你就不用一張圖片看了好幾千遍
the YOLO method of object detection.
物件偵測技術為「YOLO」。
we're not just limited to images;
我們不只可以偵測圖片;
that cat and dog,
貓、狗的靜態圖片,
and interact with each other.
互動的動態影片。
like spoon and fork, bowl,
像是湯匙、叉子、碗
out into the audience
our threshold for detection a little bit,
對偵測結果的精確度的要求,
out in the audience.
找到更多東西。
is happening in real time
object detection system,
辨別任何領域的照片。
放在自動駕駛車裡,
already using this technology
已經開始在使用這項技術
like medicine, robotics.
像是醫藥、機械人領域。
of animals in Nairobi National Park
他們要對動物們進行統計調查,
of this detection system.
偵測系統的一部分。
暗黑網路是開放原始碼,
free for anyone to use.
任何人都可以免費使用。
even more accessible and usable,
可以更親民、更好用,
of model optimization,
running on a phone.
now we have a pretty powerful solution
有了相當強力的解決方式,
and build something with it.
with access to this software,
用這個軟體大展身手了,
will build with this technology.
用這項科技所做出來的產品。
ABOUT THE SPEAKER
Joseph Redmon - Computer scientistJoseph Redmon works on the YOLO algorithm, which combines the simple face detection of your phone camera with a cloud-based AI -- in real time.
Why you should listen
Computer scientist Joseph Redmon is working on the YOLO (You Only Look Once) algorithm, which has a simple goal: to deliver image recognition and object detection at a speed that would seem science-fictional only a few years ago. The algorithm looks like the simple face detection of a camera app but with the level complexity of systems like Google's Deep Mind Cloud Vision, using Convolutional Deep Neural Networks to crunch object detection in realtime. It's the kind of technology that will be embedded on all smartphones in the next few years.
Redmon is also internet-famous for his resume.
Joseph Redmon | Speaker | TED.com