Ben Wellington: How we found the worst place to park in New York City -- using big data
Ben Wellington: Hoe we de slechtste parkeerplekken in New York vonden - met big data
Ben Wellington blends his love of statistics, the city, and comedy in his entertaining analysis of the story of New York City, told through data. Full bio
Double-click the English transcript below to play the video.
van de stad New York in cijfers.
the infrastructure of New York City.
of our infrastructure.
in rapporten van de lokale overheid.
released in reports by city agencies.
kan je bijvoorbeeld vertellen
of Transportation will probably tell you
met kilometers metrolijnen.
of subway track there are.
van de commissie voor taxiverkeer,
in New York rijden.
13,500 taxis here in New York City.
die cijfers vandaan?
where these numbers came from?
someone at the city agency
welke cijfers ons interesseren.
that somebody might want want to know.
that our citizens want to know.
will have numbers like this.
all of our questions?
an infinite number of questions
vragen stellen over onze stad.
en de beleidsmakers zien dat ook in,
and I think our policymakers realize that,
een wet ondertekend, die hij beschreef
signed into law what he called
open data legislation in the country.
'open data'-wetgeving in het land.
the city has released 1,000 datasets
gaf de stad 1000 gegevenssets vrij
the number of cabs,
When is rush hour exactly?
is niet alleen een nummer,
these cabs aren't just numbers,
die door onze straten rijdt
driving around in our city streets
and I looked at that data,
van taxi's in New York gemaakt.
taxis in New York City throughout the day.
tot 5:18 uur 's ochtends
to around 5:18 in the morning,
en dan afneemt.
things turn around,
until about 8:35 in the morning,
11 and a half miles per hour.
18,5 km/u op onze straten,
miles per hour on our city streets,
there's no rush hour in New York City.
for a couple of reasons.
is dat interessant.
this might be pretty interesting to know.
op 4:45 uur moet zetten.
4:45 in the morning and you're all set.
just available, it turns out.
via de Wet openbaarheid van bestuur,
a Freedom of Information Law Request,
van de Commissie voor taxiverkeer.
Taxi and Limousine Commission website.
heb je zo'n formulier nodig.
you need to go get this form,
did exactly that.
naar ons kantoor.
down to our office,
we'll copy the data and you take it back."
en kun je hem ophalen."
die gegevens publiek wil delen,
who wants to make the data public,
wat deze grafiek mogelijk heeft gemaakt.
and that's where this graph came from.
die GPS-recorders -- echt perfect.
These GPS recorders -- really cool.
walking around with hard drives
op moeten halen om die te delen --
to make it public --
you could get to it,
rond te lopen met een harddisk.
walking around with hard drives.
heb je de Wob nodig.
is behind a FOIL Request.
dangerous intersections in New York City
van de gevaarlijkste kruisingen,
the East side of Manhattan,
het oosten van Manhattan,
has more cyclist accidents.
meer fietsongelukken zijn.
van de bruggen af komen.
coming off the bridges there.
andere gevaarlijke plekken,
en Roosevelt Avenue in Queens.
There's Roosevelt Avenue in Queens.
zoeken we voor Vision Zero.
we need for Vision Zero.
behind this data as well.
and paste data out of a PDF
van een pdf te kopiëren
than knew the logo. I like that.
dan die het logo kennen. Leuk.
that you just saw was actually on a PDF.
and hundreds of pages of PDF
you would either have to copy and paste
honderden uren gaan kopiëren,
maar schrijf een programma.
I'm going to write a program.
ongevallen-gegevens-pleister'.
and it would download PDFs.
van de NYPD af.
if it found a PDF, it would download it
downloadt gevonden pdf's,
some PDF-scraping program,
zodat je losse tekst krijgt,
and then people could make maps like that.
zodat je dit soort kaarten kunt maken.
the fact that we have access to it --
voor ons beschikbaar zijn --
is a row in this table.
een rij in deze tabel.
have access to that is great,
write PDF scrapers.
bewerkingsprogramma's maken.
of our citizens' time,
burgemeester de Blasio
the de Blasio administration
a few months ago,
actually have access to it,
nog steeds ingepakt in pdf.
still entombed in PDF.
is still only available in PDF.
bijvoorbeeld alleen in pdf.
our own city budget.
maar ook de begroting van de stad.
right now in PDF form.
is alleen in een pdf te lezen.
that can't analyze it --
die over de begroting beslissen
who vote for the budget
the budget that they are voting for.
waarover ze beslissen dus niet analyseren.
a little better than that as well.
toch ook beter kunnen doen.
that's not hidden in PDFs.
in New York City.
in de stad New York.
of fecal coliform,
in each of our waterways.
in onze vaarwaters kunt meten.
the dirtier the water,
hoe viezer het water.
the small circles are cleaner.
kleinere op schoner water.
by the city over the last five years.
de laatste vijf jaar verzamelde.
in general, dirtier.
And I learned a few things from this.
eindigt op 'creek' of 'canal'.
that ends in "creek" or "canal."
the dirtiest waterway in New York City,
de smerigste vaarweg in de stad New York,
the Coney Island you swim in, luckily.
niet waar we zwemmen bij Coney Island --
of samples taken over the last five years
van de monsters uit Coney Island Creek
to swim in the water.
om daar te zwemmen.
that you're going to see
waar ze niet mee opscheppen
the front page on nyc.gov.
dat we die gegevens hebben.
to that data is awesome.
open data-portal.
on the open data portal.
een jaar en wat maanden.
a year or a few months.
van de afdeling Milieubeheer.
of Environmental Protection's website.
(Gelach)
sheet, and each Excel sheet is different.
Je moet kopiëren en reorganiseren.
you copy, paste, reorganize.
kun je een kaart maken,
and that's great, but once again,
door het glad te strijken.
as a city, we can normalize things.
want Socrata heeft een website:
there's this website that Socrata makes
that don't suffer
downloaden, zoals csv, pdf of Excel.
and that's great.
be it CSV or PDF or Excel document.
je kan het downloaden hoe je wilt.
you can download the data that way.
codes their addresses differently.
voor adressen gebruikt.
intersection street,
building address.
even when we have this portal,
zelfs met deze portal,
normalizing our address fields.
of our citizens' time.
we can get more maps like this.
kunnen maken zoals deze.
in New York City,
van brandkranen in New York,
hydrants in terms of parking tickets.
die de meeste parkeerboetes opleveren.
and I really like this map.
die ik erg leuk vind.
on the Upper East Side.
in de Upper East Side.
you park, you will get a hydrant ticket.
je krijgt overal een brandkraanbekeuring.
grossing hydrants in all of New York City,
die het meeste opbrengen in New York,
aan parkeerboetes binnen.
55,000 dollars a year in parking tickets.
to me when I noticed it,
dat hier een brandkraan stond
what you had is a hydrant
a curb extension,
space to walk on,
and the hydrant --
parkeerplaats getekend.
painted there beautifully for them.
maar de politie dacht daar anders over
disagreed with this designation
die dit opmerkte.
who found a parking ticket.
Streetview-auto van Google
Street View car driving by
en kreeg antwoord van de afdeling Verkeer,
on I Quant NY, and the DOT responded,
over deze locatie hebben gehad,
any complaints about this location,
and make any appropriate alterations."
en noodzakelijke wijzigingen aanbrengen."
typical government response,
dacht ik nog,
something incredible happened.
was er iets ongelooflijks gebeurd.
op die plek getekend,
the future of open data,
de toekomst van open data.
uitgedeeld op een onduidelijke plek,
ticketed, and it was confusing,
they told the city, and within a few weeks
wat hij meldde aan de gemeente,
was het probleem opgelost.
see open data as being a watchdog.
open data als een bedreiging.
to be better partners for government,
meedenken met de overheid.
being FOILed over and over again,
worden telkens opgevraagd,
dat willen mensen blijkbaar weten.
a sign that it should be made public.
een pdf uitbrengt,
releasing a PDF,
de onderliggende gegevens te geven,
to post it with the underlying data,
is coming from somewhere.
coming from somewhere,
some open data standards.
here in New York City.
normalizing our addresses.
loopt New York voorop in open data,
a leader in open data,
een open data-standaard,
and set an open data standard,
zelfs de federale overheid.
and maybe the federal government,
één programma kunt schrijven
where you could write one program
inzichtelijk te maken.
We're actually quite close.
empowering with this?
and it's not just Chris Whong.
burgerinitiatieven in New York,
going on in New York City right now,
attending these meetups.
na werktijd en in het weekend
and on weekends,
to look at open data
citygram.nyc geopenbaard
released something called citygram.nyc
to 311 complaints
or around your office.
you get local complaints.
klachten uit de omgeving.
that are after these things.
houden zich hier mee bezig.
zoals mijn studenten op Pratt.
the students I teach at Pratt.
set of backgrounds.
and the ability of our citizens
en de mogelijkheden
and make our city even better,
onze stad te verbeteren.
or one parking spot at a time.
draagt daaraan bij.
ABOUT THE SPEAKER
Ben Wellington - Data scientistBen Wellington blends his love of statistics, the city, and comedy in his entertaining analysis of the story of New York City, told through data.
Why you should listen
Ben Wellington runs the I Quant NY blog, in which he crunches city-released data to find out what's really going on in the Big Apple. To date he has tackled topics such as measles outbreaks in New York City schools, analyzed how companies like Airbnb are really doing in NYC, and asked questions such as "does gentrification cause a reduction in laundromats?" (Answer: inconclusive.)
Ben is a visiting assistant professor in the City & Regional Planning program at the Pratt Institute in Brooklyn; his day job involves working as a quantitative analyst at the investment management firm, Two Sigma. A budding comedian and performer, he also teaches team building workshops through Cherub Improv, a non-profit that uses improv comedy for social good.
Ben Wellington | Speaker | TED.com