Abe Davis: New video technology that reveals an object's hidden properties
亞伯 戴維斯: 揭示物體隱藏特質的新攝影科技
Computer vision expert Abe Davis pioneers methods to extract audio from silent digital videos, even footage shot on ordinary consumer cameras. Full bio
Double-click the English transcript below to play the video.
as a very visual thing.
「動作」是非常視覺的。
or gesture with my hands while I speak,
或在說話的時候使用手勢,
that's too subtle for the human eye,
是人類的眼睛無法看見的。
even when humans can't.
of a person's wrist,
of a sleeping infant,
that these were videos,
at two regular images,
你只是在看兩張非常普通的圖片。
almost completely still.
of subtle motion going on here,
the wrist on the left,
the infant on the right,
and fall of her chest
a lot of significance,
too subtle for us to see,
以至於我們看不到。
what they call a motion microscope,
研究出了動作顯微鏡。
these subtle motions in video
become large enough for us to see.
on the left video,
使用那個軟體,
this person's heart rate.
on the right video,
that this infant takes,
to monitor her breathing.
不用接觸就能觀察嬰兒呼吸的探測器。
because it takes these phenomena
它記錄的現象
to experience through touch
and non-invasively.
with the folks that created that software,
和這些開發軟體的人一起工作,
that we can use software
as a way to extend our sense of touch.
with our ability to hear?
做出擴展我們聽覺的軟體呢?
to capture the vibrations of sound,
從而獲得聲音的振動,
into a microphone?
轉化為聲音進入麥克風呢?
in perspective for you.
work by converting the motion
into an electrical signal,
to move readily with sound
and interpreted as audio.
too subtle and too fast for us to see.
以至於我們不能看不見。
with a high-speed camera
to extract tiny motions
what sounds created them?
並且搞清楚是什麼聲音創造了振動呢?
into visual microphones from a distance.
將可視物體轉換到可視話筒中。
that you see on the right
played this sound.
of frames per second,
just sitting there doing nothing,
by about a micrometer.
只移動了樹葉大約一微米的距離,
a hundredth and a thousandth
perceptually invisible.
can be perceptually invisible
seemingly still video
out of so little motion?
得到如此多訊息的呢?
move by just a single micrometer,
只是移動了一點點距離,
by just a thousandth of a pixel.
千分之一個像素的距離。
of pixels in it,
of the tiny motions that we see
to something pretty significant.
when we figured this out.
我們研究出來時高興得都要瘋了。
a pretty important piece of the puzzle.
that affect when and how well
and the lens that you use;
and how loud your sound is.
with our early experiments,
any of these factors wrong,
what the problem was.
experiments looked like this.
see our high-speed camera,
by these bright lamps.
very careful in these early experiments,
Little lamb! Little lamb!
小綿羊!小綿羊!
looks completely ridiculous.
we tried this on. (Laughter)
to recover this sound.
Little lamb! Little lamb!
we recovered intelligible human speech
我們從一個物體靜止的影像中,
to modify the experiment,
or moving the object further away,
the limits of our technique,
to a bag of chips,
再一次向一包洋芋片說話,
about 15 feet away,
by only natural sunlight.
from inside, next to the bag of chips.
whose fleece was white as snow,
牠的毛白得像雪一樣。
that lamb was sure to go.
小綿羊就跟到哪裏。
to recover from our silent video
whose fleece was white as snow,
牠的毛白得像雪一樣。
that lamb was sure to go.
小綿羊就跟到哪裏。
that we can push these limits as well.
plugged into a laptop computer,
the music that was playing on that laptop
在手提電腦上所放出的聲音,
by changing the hardware that we use.
I've shown you so far
a high-speed camera,
about a 100 times faster
to use this technique
of what's called a rolling shutter.
像百葉窗一樣記錄東西優點來記錄。
record images one row at a time,
一段時間記錄一排的圖像,
during the recording of a single image,
只在被記錄的圖像中移動,
between each row,
is that by analyzing these artifacts,
using a modified version of our algorithm.
music from before,
store-bought camera,
the sound that we recovered,
distorted this time,
recognize the music.
is that we were able to do this
that you could literally run out
about surveillance.
this technology to spy on someone.
a lot of very mature technology
from a distance for decades.
to picture the vibrations of an object,
through which to look at the world,
that cause an object to vibrate,
the ways that we use video,
我們應用影片的方法。
to look at things,
that we learn about the world:
still won't let us do,
just a few months ago,
I've shown it to a public audience.
to use the vibrations in a video
我們會用影片之中的振動,
that will let us interact with them
在某種程度上這樣可以使我們與物體互動,
in the shape of a human,
with just a regular camera.
about this camera.
with my cell phone before.
on the surface where it's resting
of regular video,
the vibrations in that video
and material properties of our object,
to create something new and interactive.
去創造新穎和互動性的東西。
and it's not a video,
with the object.
that we've never seen before,
five seconds of regular video.
way to look at the world,
how objects will respond
looking at an old bridge
how would that bridge hold up
that you probably want to answer
across that bridge.
limitations to this technique,
with the visual microphone,
in a lot of situations
here's a video that I captured
to create this simulation.
從而創造出這樣的模擬情況。
to a film director,
in a shot after it's been recorded.
at a hanging curtain,
any motion in this video,
imperceptible motions and vibrations
to create this simulation.
this kind of interactivity
and 3D models,
from real objects in the real world
a lot of potential.
who worked with me on these projects.
與我共同研究這些計劃。
is only the beginning.
with this kind of imaging,
with common, accessible technology.
really exciting to explore
可以告訴我們關於這個世界
ABOUT THE SPEAKER
Abe Davis - Computer scientistComputer vision expert Abe Davis pioneers methods to extract audio from silent digital videos, even footage shot on ordinary consumer cameras.
Why you should listen
MIT PhD student, computer vision wizard and rap artist Abe Davis has co-created the world’s most improbable audio instrument. In 2014, Davis and his collaborators debuted the “visual microphone,” an algorithm that samples the sympathetic vibrations of ordinary objects (such as a potato chip bag) from ordinary high-speed video footage and transduces them into intelligible audio tracks.
Davis is also the author of Caperture, a 3D-imaging app designed to create and share 3D images on any compatible smartphone.
Abe Davis | Speaker | TED.com