Ben Wellington: How we found the worst place to park in New York City -- using big data
本▪威靈頓: 如何找到紐約市最差的停車位?——用大數據說話
Ben Wellington blends his love of statistics, the city, and comedy in his entertaining analysis of the story of New York City, told through data. Full bio
Double-click the English transcript below to play the video.
the infrastructure of New York City.
of our infrastructure.
released in reports by city agencies.
of Transportation will probably tell you
他們掌管著多少英里捷運。
of subway track there are.
大約一萬三千五百輛計程車。
13,500 taxis here in New York City.
where these numbers came from?
那肯定是因為在市政機關的某個人
someone at the city agency
that somebody might want want to know.
that our citizens want to know.
will have numbers like this.
我們的問題都是什麼?
all of our questions?
an infinite number of questions
and I think our policymakers realize that,
我們的政策制定者也知道這點,
signed into law what he called
簽署了一個法令,他稱之為
open data legislation in the country.
開放數據立法。
the city has released 1,000 datasets
the number of cabs,
When is rush hour exactly?
these cabs aren't just numbers,
driving around in our city streets
and I looked at that data,
taxis in New York City throughout the day.
一天中紐約市計程車的平均時速。
to around 5:18 in the morning,
從半夜到凌晨五點十八分,
things turn around,
until about 8:35 in the morning,
11 and a half miles per hour.
miles per hour on our city streets,
保持在十一英里半,
there's no rush hour in New York City.
for a couple of reasons.
this might be pretty interesting to know.
知道這個結論會有意義。
4:45 in the morning and you're all set.
just available, it turns out.
a Freedom of Information Law Request,
找到相關申請表。
Taxi and Limousine Commission website.
你要弄到這張申請表,
you need to go get this form,
did exactly that.
down to our office,
過五小時來拿。」
we'll copy the data and you take it back."
who wants to make the data public,
所以我才能做出這張圖。
and that's where this graph came from.
These GPS recorders -- really cool.
walking around with hard drives
然後通過自己的努力公開,這件事——
to make it public --
you could get to it,
普通市民能得到它,
並不是真正的公開。
walking around with hard drives.
is behind a FOIL Request.
dangerous intersections in New York City
the East side of Manhattan,
has more cyclist accidents.
腳踏車事故更多。
coming off the bridges there.
There's Roosevelt Avenue in Queens.
we need for Vision Zero.
behind this data as well.
and paste data out of a PDF
拷貝和黏貼數據,
比認識這個標誌的人更多,真有趣。
than knew the logo. I like that.
that you just saw was actually on a PDF.
and hundreds of pages of PDF
you would either have to copy and paste
你要不就持續
I'm going to write a program.
他寫了一個程式。
「紐約警署交通事故數據OK蹦」,
and it would download PDFs.
if it found a PDF, it would download it
如果找到一個PDF文檔,就下載下來,
some PDF-scraping program,
and then people could make maps like that.
人們就可以製作這些地圖。
the fact that we have access to it --
is a row in this table.
have access to that is great,
write PDF scrapers.
of our citizens' time,
the de Blasio administration
a few months ago,
actually have access to it,
still entombed in PDF.
is still only available in PDF.
our own city budget.
right now in PDF form.
that can't analyze it --
who vote for the budget
the budget that they are voting for.
他們要為之投票的市政預算的。
a little better than that as well.
that's not hidden in PDFs.
in New York City.
of fecal coliform,
in each of our waterways.
the dirtier the water,
the small circles are cleaner.
小圓圈代表乾淨的水。
by the city over the last five years.
in general, dirtier.
And I learned a few things from this.
我從中學到了幾件事情。
that ends in "creek" or "canal."
或「xx運河」的地方游泳。
the dirtiest waterway in New York City,
the Coney Island you swim in, luckily.
of samples taken over the last five years
過去五年的採樣中有94%
to swim in the water.
that you're going to see
the front page on nyc.gov.
to that data is awesome.
on the open data portal.
a year or a few months.
只有一年內或幾個月的數據。
of Environmental Protection's website.
sheet, and each Excel sheet is different.
而每個Excel文件都是不一樣的。
you copy, paste, reorganize.
需要複製、黏貼、還有重新整理。
and that's great, but once again,
但我要再次重申,
我們可以標準化。
as a city, we can normalize things.
索克拉塔公司建立的網站
there's this website that Socrata makes
that don't suffer
CSV、PDF或Excel文件都可以。
and that's great.
be it CSV or PDF or Excel document.
you can download the data that way.
用不同的代碼來表示地址。
codes their addresses differently.
intersection street,
building address.
even when we have this portal,
normalizing our address fields.
of our citizens' time.
we can get more maps like this.
我們就能做出更多這樣的地圖。
in New York City,
hydrants in terms of parking tickets.
消防栓位置圖。
and I really like this map.
我也真的喜歡這張圖。
on the Upper East Side.
you park, you will get a hydrant ticket.
grossing hydrants in all of New York City,
吃到違章停車罰單的兩個消防栓的位置,
55,000 dollars a year in parking tickets.
to me when I noticed it,
what you had is a hydrant
結果發現消防栓
a curb extension,
space to walk on,
and the hydrant --
painted there beautifully for them.
disagreed with this designation
who found a parking ticket.
Street View car driving by
on I Quant NY, and the DOT responded,
以及“I Quant NY”上,
any complaints about this location,
並做出適當的改善措施。”
and make any appropriate alterations."
typical government response,
something incredible happened.
發生了意料之外的事情。
the future of open data,
ticketed, and it was confusing,
一直讓人吃罰單,
they told the city, and within a few weeks
報告市政機關,又過了幾週時間,
see open data as being a watchdog.
公開數據讓市民變成政府的監視者,
to be better partners for government,
being FOILed over and over again,
a sign that it should be made public.
因為反覆申請就是需要公開的一种信號。
releasing a PDF,
to post it with the underlying data,
要求他們發佈隱藏的數據,
is coming from somewhere.
coming from somewhere,
some open data standards.
here in New York City.
normalizing our addresses.
a leader in open data,
and set an open data standard,
建立公開數據的標準,
and maybe the federal government,
但別的國家也未嘗不會追隨。
where you could write one program
而是指日可待的事實。
We're actually quite close.
empowering with this?
and it's not just Chris Whong.
going on in New York City right now,
attending these meetups.
and on weekends,
to look at open data
citygram.nyc
released something called citygram.nyc
to 311 complaints
or around your office.
you get local complaints.
that are after these things.
the students I teach at Pratt.
也在做同樣的事。
set of backgrounds.
and the ability of our citizens
and make our city even better,
or one parking spot at a time.
ABOUT THE SPEAKER
Ben Wellington - Data scientistBen Wellington blends his love of statistics, the city, and comedy in his entertaining analysis of the story of New York City, told through data.
Why you should listen
Ben Wellington runs the I Quant NY blog, in which he crunches city-released data to find out what's really going on in the Big Apple. To date he has tackled topics such as measles outbreaks in New York City schools, analyzed how companies like Airbnb are really doing in NYC, and asked questions such as "does gentrification cause a reduction in laundromats?" (Answer: inconclusive.)
Ben is a visiting assistant professor in the City & Regional Planning program at the Pratt Institute in Brooklyn; his day job involves working as a quantitative analyst at the investment management firm, Two Sigma. A budding comedian and performer, he also teaches team building workshops through Cherub Improv, a non-profit that uses improv comedy for social good.
Ben Wellington | Speaker | TED.com