ABOUT THE SPEAKER
Nicholas Christakis - Physician, social scientist
Nicholas Christakis explores how the large-scale, face-to-face social networks in which we are embedded affect our lives, and what we can do to take advantage of this fact.

Why you should listen

People aren't merely social animals in the usual sense, for we don't just live in groups. We live in networks -- and we have done so ever since we emerged from the African savannah. Via intricately branching paths tracing out cascading family connections, friendship ties, and work relationships, we are interconnected to hundreds or even thousands of specific people, most of whom we do not know. We affect them and they affect us.

Nicholas Christakis' work examines the biological, psychological, sociological, and mathematical rules that govern how we form these social networks, and the rules that govern how they shape our lives. His work shows how phenomena as diverse as obesity, smoking, emotions, ideas, germs, and altruism can spread through our social ties, and how genes can partially underlie our creation of social ties to begin with. His work also sheds light on how we might take advantage of an understanding of social networks to make the world a better place.

At Yale, Christakis is a Professor of Social and Natural Science, and he directs a diverse research group in the field of biosocial science, primarily investigating social networks. His popular undergraduate course "Health of the Public" is available as a podcast. His book, Connected, co-authored with James H. Fowler, appeared in 2009, and has been translated into 20 languages. In 2009, he was named by Time magazine to its annual list of the 100 most influential people in the world, and also, in 2009 and 2010, by Foreign Policy magazine to its list of 100 top global thinkers

More profile about the speaker
Nicholas Christakis | Speaker | TED.com
TED@Cannes

Nicholas Christakis: How social networks predict epidemics

Filmed:
669,862 views

After mapping humans' intricate social networks, Nicholas Christakis and colleague James Fowler began investigating how this information could better our lives. Now, he reveals his hot-off-the-press findings: These networks can be used to detect epidemics earlier than ever, from the spread of innovative ideas to risky behaviors to viruses (like H1N1).
- Physician, social scientist
Nicholas Christakis explores how the large-scale, face-to-face social networks in which we are embedded affect our lives, and what we can do to take advantage of this fact. Full bio

Double-click the English transcript below to play the video.

00:15
For the last 10 years, I've been spending my time trying to figure out
0
0
3000
00:18
how and why human beings
1
3000
2000
00:20
assemble themselves into social networks.
2
5000
3000
00:23
And the kind of social network I'm talking about
3
8000
2000
00:25
is not the recent online variety,
4
10000
2000
00:27
but rather, the kind of social networks
5
12000
2000
00:29
that human beings have been assembling for hundreds of thousands of years,
6
14000
3000
00:32
ever since we emerged from the African savannah.
7
17000
3000
00:35
So, I form friendships and co-worker
8
20000
2000
00:37
and sibling and relative relationships with other people
9
22000
3000
00:40
who in turn have similar relationships with other people.
10
25000
2000
00:42
And this spreads on out endlessly into a distance.
11
27000
3000
00:45
And you get a network that looks like this.
12
30000
2000
00:47
Every dot is a person.
13
32000
2000
00:49
Every line between them is a relationship between two people --
14
34000
2000
00:51
different kinds of relationships.
15
36000
2000
00:53
And you can get this kind of vast fabric of humanity,
16
38000
3000
00:56
in which we're all embedded.
17
41000
2000
00:58
And my colleague, James Fowler and I have been studying for quite sometime
18
43000
3000
01:01
what are the mathematical, social,
19
46000
2000
01:03
biological and psychological rules
20
48000
3000
01:06
that govern how these networks are assembled
21
51000
2000
01:08
and what are the similar rules
22
53000
2000
01:10
that govern how they operate, how they affect our lives.
23
55000
3000
01:13
But recently, we've been wondering
24
58000
2000
01:15
whether it might be possible to take advantage of this insight,
25
60000
3000
01:18
to actually find ways to improve the world,
26
63000
2000
01:20
to do something better,
27
65000
2000
01:22
to actually fix things, not just understand things.
28
67000
3000
01:25
So one of the first things we thought we would tackle
29
70000
3000
01:28
would be how we go about predicting epidemics.
30
73000
3000
01:31
And the current state of the art in predicting an epidemic --
31
76000
2000
01:33
if you're the CDC or some other national body --
32
78000
3000
01:36
is to sit in the middle where you are
33
81000
2000
01:38
and collect data
34
83000
2000
01:40
from physicians and laboratories in the field
35
85000
2000
01:42
that report the prevalence or the incidence of certain conditions.
36
87000
3000
01:45
So, so and so patients have been diagnosed with something,
37
90000
3000
01:48
or other patients have been diagnosed,
38
93000
2000
01:50
and all these data are fed into a central repository, with some delay.
39
95000
3000
01:53
And if everything goes smoothly,
40
98000
2000
01:55
one to two weeks from now
41
100000
2000
01:57
you'll know where the epidemic was today.
42
102000
3000
02:00
And actually, about a year or so ago,
43
105000
2000
02:02
there was this promulgation
44
107000
2000
02:04
of the idea of Google Flu Trends, with respect to the flu,
45
109000
3000
02:07
where by looking at people's searching behavior today,
46
112000
3000
02:10
we could know where the flu --
47
115000
2000
02:12
what the status of the epidemic was today,
48
117000
2000
02:14
what's the prevalence of the epidemic today.
49
119000
3000
02:17
But what I'd like to show you today
50
122000
2000
02:19
is a means by which we might get
51
124000
2000
02:21
not just rapid warning about an epidemic,
52
126000
3000
02:24
but also actually
53
129000
2000
02:26
early detection of an epidemic.
54
131000
2000
02:28
And, in fact, this idea can be used
55
133000
2000
02:30
not just to predict epidemics of germs,
56
135000
3000
02:33
but also to predict epidemics of all sorts of kinds.
57
138000
3000
02:37
For example, anything that spreads by a form of social contagion
58
142000
3000
02:40
could be understood in this way,
59
145000
2000
02:42
from abstract ideas on the left
60
147000
2000
02:44
like patriotism, or altruism, or religion
61
149000
3000
02:47
to practices
62
152000
2000
02:49
like dieting behavior, or book purchasing,
63
154000
2000
02:51
or drinking, or bicycle-helmet [and] other safety practices,
64
156000
3000
02:54
or products that people might buy,
65
159000
2000
02:56
purchases of electronic goods,
66
161000
2000
02:58
anything in which there's kind of an interpersonal spread.
67
163000
3000
03:01
A kind of a diffusion of innovation
68
166000
2000
03:03
could be understood and predicted
69
168000
2000
03:05
by the mechanism I'm going to show you now.
70
170000
3000
03:08
So, as all of you probably know,
71
173000
2000
03:10
the classic way of thinking about this
72
175000
2000
03:12
is the diffusion-of-innovation,
73
177000
2000
03:14
or the adoption curve.
74
179000
2000
03:16
So here on the Y-axis, we have the percent of the people affected,
75
181000
2000
03:18
and on the X-axis, we have time.
76
183000
2000
03:20
And at the very beginning, not too many people are affected,
77
185000
3000
03:23
and you get this classic sigmoidal,
78
188000
2000
03:25
or S-shaped, curve.
79
190000
2000
03:27
And the reason for this shape is that at the very beginning,
80
192000
2000
03:29
let's say one or two people
81
194000
2000
03:31
are infected, or affected by the thing
82
196000
2000
03:33
and then they affect, or infect, two people,
83
198000
2000
03:35
who in turn affect four, eight, 16 and so forth,
84
200000
3000
03:38
and you get the epidemic growth phase of the curve.
85
203000
3000
03:41
And eventually, you saturate the population.
86
206000
2000
03:43
There are fewer and fewer people
87
208000
2000
03:45
who are still available that you might infect,
88
210000
2000
03:47
and then you get the plateau of the curve,
89
212000
2000
03:49
and you get this classic sigmoidal curve.
90
214000
3000
03:52
And this holds for germs, ideas,
91
217000
2000
03:54
product adoption, behaviors,
92
219000
2000
03:56
and the like.
93
221000
2000
03:58
But things don't just diffuse in human populations at random.
94
223000
3000
04:01
They actually diffuse through networks.
95
226000
2000
04:03
Because, as I said, we live our lives in networks,
96
228000
3000
04:06
and these networks have a particular kind of a structure.
97
231000
3000
04:09
Now if you look at a network like this --
98
234000
2000
04:11
this is 105 people.
99
236000
2000
04:13
And the lines represent -- the dots are the people,
100
238000
2000
04:15
and the lines represent friendship relationships.
101
240000
2000
04:17
You might see that people occupy
102
242000
2000
04:19
different locations within the network.
103
244000
2000
04:21
And there are different kinds of relationships between the people.
104
246000
2000
04:23
You could have friendship relationships, sibling relationships,
105
248000
3000
04:26
spousal relationships, co-worker relationships,
106
251000
3000
04:29
neighbor relationships and the like.
107
254000
3000
04:32
And different sorts of things
108
257000
2000
04:34
spread across different sorts of ties.
109
259000
2000
04:36
For instance, sexually transmitted diseases
110
261000
2000
04:38
will spread across sexual ties.
111
263000
2000
04:40
Or, for instance, people's smoking behavior
112
265000
2000
04:42
might be influenced by their friends.
113
267000
2000
04:44
Or their altruistic or their charitable giving behavior
114
269000
2000
04:46
might be influenced by their coworkers,
115
271000
2000
04:48
or by their neighbors.
116
273000
2000
04:50
But not all positions in the network are the same.
117
275000
3000
04:53
So if you look at this, you might immediately grasp
118
278000
2000
04:55
that different people have different numbers of connections.
119
280000
3000
04:58
Some people have one connection, some have two,
120
283000
2000
05:00
some have six, some have 10 connections.
121
285000
3000
05:03
And this is called the "degree" of a node,
122
288000
2000
05:05
or the number of connections that a node has.
123
290000
2000
05:07
But in addition, there's something else.
124
292000
2000
05:09
So, if you look at nodes A and B,
125
294000
2000
05:11
they both have six connections.
126
296000
2000
05:13
But if you can see this image [of the network] from a bird's eye view,
127
298000
3000
05:16
you can appreciate that there's something very different
128
301000
2000
05:18
about nodes A and B.
129
303000
2000
05:20
So, let me ask you this -- I can cultivate this intuition by asking a question --
130
305000
3000
05:23
who would you rather be
131
308000
2000
05:25
if a deadly germ was spreading through the network, A or B?
132
310000
3000
05:28
(Audience: B.) Nicholas Christakis: B, it's obvious.
133
313000
2000
05:30
B is located on the edge of the network.
134
315000
2000
05:32
Now, who would you rather be
135
317000
2000
05:34
if a juicy piece of gossip were spreading through the network?
136
319000
3000
05:37
A. And you have an immediate appreciation
137
322000
3000
05:40
that A is going to be more likely
138
325000
2000
05:42
to get the thing that's spreading and to get it sooner
139
327000
3000
05:45
by virtue of their structural location within the network.
140
330000
3000
05:48
A, in fact, is more central,
141
333000
2000
05:50
and this can be formalized mathematically.
142
335000
3000
05:53
So, if we want to track something
143
338000
2000
05:55
that was spreading through a network,
144
340000
3000
05:58
what we ideally would like to do is to set up sensors
145
343000
2000
06:00
on the central individuals within the network,
146
345000
2000
06:02
including node A,
147
347000
2000
06:04
monitor those people that are right there in the middle of the network,
148
349000
3000
06:07
and somehow get an early detection
149
352000
2000
06:09
of whatever it is that is spreading through the network.
150
354000
3000
06:12
So if you saw them contract a germ or a piece of information,
151
357000
3000
06:15
you would know that, soon enough,
152
360000
2000
06:17
everybody was about to contract this germ
153
362000
2000
06:19
or this piece of information.
154
364000
2000
06:21
And this would be much better
155
366000
2000
06:23
than monitoring six randomly chosen people,
156
368000
2000
06:25
without reference to the structure of the population.
157
370000
3000
06:28
And in fact, if you could do that,
158
373000
2000
06:30
what you would see is something like this.
159
375000
2000
06:32
On the left-hand panel, again, we have the S-shaped curve of adoption.
160
377000
3000
06:35
In the dotted red line, we show
161
380000
2000
06:37
what the adoption would be in the random people,
162
382000
2000
06:39
and in the left-hand line, shifted to the left,
163
384000
3000
06:42
we show what the adoption would be
164
387000
2000
06:44
in the central individuals within the network.
165
389000
2000
06:46
On the Y-axis is the cumulative instances of contagion,
166
391000
2000
06:48
and on the X-axis is the time.
167
393000
2000
06:50
And on the right-hand side, we show the same data,
168
395000
2000
06:52
but here with daily incidence.
169
397000
2000
06:54
And what we show here is -- like, here --
170
399000
2000
06:56
very few people are affected, more and more and more and up to here,
171
401000
2000
06:58
and here's the peak of the epidemic.
172
403000
2000
07:00
But shifted to the left is what's occurring in the central individuals.
173
405000
2000
07:02
And this difference in time between the two
174
407000
3000
07:05
is the early detection, the early warning we can get,
175
410000
3000
07:08
about an impending epidemic
176
413000
2000
07:10
in the human population.
177
415000
2000
07:12
The problem, however,
178
417000
2000
07:14
is that mapping human social networks
179
419000
2000
07:16
is not always possible.
180
421000
2000
07:18
It can be expensive, not feasible,
181
423000
2000
07:20
unethical,
182
425000
2000
07:22
or, frankly, just not possible to do such a thing.
183
427000
3000
07:25
So, how can we figure out
184
430000
2000
07:27
who the central people are in a network
185
432000
2000
07:29
without actually mapping the network?
186
434000
3000
07:32
What we came up with
187
437000
2000
07:34
was an idea to exploit an old fact,
188
439000
2000
07:36
or a known fact, about social networks,
189
441000
2000
07:38
which goes like this:
190
443000
2000
07:40
Do you know that your friends
191
445000
2000
07:42
have more friends than you do?
192
447000
3000
07:45
Your friends have more friends than you do,
193
450000
3000
07:48
and this is known as the friendship paradox.
194
453000
2000
07:50
Imagine a very popular person in the social network --
195
455000
2000
07:52
like a party host who has hundreds of friends --
196
457000
3000
07:55
and a misanthrope who has just one friend,
197
460000
2000
07:57
and you pick someone at random from the population;
198
462000
3000
08:00
they were much more likely to know the party host.
199
465000
2000
08:02
And if they nominate the party host as their friend,
200
467000
2000
08:04
that party host has a hundred friends,
201
469000
2000
08:06
therefore, has more friends than they do.
202
471000
3000
08:09
And this, in essence, is what's known as the friendship paradox.
203
474000
3000
08:12
The friends of randomly chosen people
204
477000
3000
08:15
have higher degree, and are more central
205
480000
2000
08:17
than the random people themselves.
206
482000
2000
08:19
And you can get an intuitive appreciation for this
207
484000
2000
08:21
if you imagine just the people at the perimeter of the network.
208
486000
3000
08:24
If you pick this person,
209
489000
2000
08:26
the only friend they have to nominate is this person,
210
491000
3000
08:29
who, by construction, must have at least two
211
494000
2000
08:31
and typically more friends.
212
496000
2000
08:33
And that happens at every peripheral node.
213
498000
2000
08:35
And in fact, it happens throughout the network as you move in,
214
500000
3000
08:38
everyone you pick, when they nominate a random --
215
503000
2000
08:40
when a random person nominates a friend of theirs,
216
505000
3000
08:43
you move closer to the center of the network.
217
508000
3000
08:46
So, we thought we would exploit this idea
218
511000
3000
08:49
in order to study whether we could predict phenomena within networks.
219
514000
3000
08:52
Because now, with this idea
220
517000
2000
08:54
we can take a random sample of people,
221
519000
2000
08:56
have them nominate their friends,
222
521000
2000
08:58
those friends would be more central,
223
523000
2000
09:00
and we could do this without having to map the network.
224
525000
3000
09:03
And we tested this idea with an outbreak of H1N1 flu
225
528000
3000
09:06
at Harvard College
226
531000
2000
09:08
in the fall and winter of 2009, just a few months ago.
227
533000
3000
09:11
We took 1,300 randomly selected undergraduates,
228
536000
3000
09:14
we had them nominate their friends,
229
539000
2000
09:16
and we followed both the random students and their friends
230
541000
2000
09:18
daily in time
231
543000
2000
09:20
to see whether or not they had the flu epidemic.
232
545000
3000
09:23
And we did this passively by looking at whether or not they'd gone to university health services.
233
548000
3000
09:26
And also, we had them [actively] email us a couple of times a week.
234
551000
3000
09:29
Exactly what we predicted happened.
235
554000
3000
09:32
So the random group is in the red line.
236
557000
3000
09:35
The epidemic in the friends group has shifted to the left, over here.
237
560000
3000
09:38
And the difference in the two is 16 days.
238
563000
3000
09:41
By monitoring the friends group,
239
566000
2000
09:43
we could get 16 days advance warning
240
568000
2000
09:45
of an impending epidemic in this human population.
241
570000
3000
09:48
Now, in addition to that,
242
573000
2000
09:50
if you were an analyst who was trying to study an epidemic
243
575000
3000
09:53
or to predict the adoption of a product, for example,
244
578000
3000
09:56
what you could do is you could pick a random sample of the population,
245
581000
3000
09:59
also have them nominate their friends and follow the friends
246
584000
3000
10:02
and follow both the randoms and the friends.
247
587000
3000
10:05
Among the friends, the first evidence you saw of a blip above zero
248
590000
3000
10:08
in adoption of the innovation, for example,
249
593000
3000
10:11
would be evidence of an impending epidemic.
250
596000
2000
10:13
Or you could see the first time the two curves diverged,
251
598000
3000
10:16
as shown on the left.
252
601000
2000
10:18
When did the randoms -- when did the friends take off
253
603000
3000
10:21
and leave the randoms,
254
606000
2000
10:23
and [when did] their curve start shifting?
255
608000
2000
10:25
And that, as indicated by the white line,
256
610000
2000
10:27
occurred 46 days
257
612000
2000
10:29
before the peak of the epidemic.
258
614000
2000
10:31
So this would be a technique
259
616000
2000
10:33
whereby we could get more than a month-and-a-half warning
260
618000
2000
10:35
about a flu epidemic in a particular population.
261
620000
3000
10:38
I should say that
262
623000
2000
10:40
how far advanced a notice one might get about something
263
625000
2000
10:42
depends on a host of factors.
264
627000
2000
10:44
It could depend on the nature of the pathogen --
265
629000
2000
10:46
different pathogens,
266
631000
2000
10:48
using this technique, you'd get different warning --
267
633000
2000
10:50
or other phenomena that are spreading,
268
635000
2000
10:52
or frankly, on the structure of the human network.
269
637000
3000
10:55
Now in our case, although it wasn't necessary,
270
640000
3000
10:58
we could also actually map the network of the students.
271
643000
2000
11:00
So, this is a map of 714 students
272
645000
2000
11:02
and their friendship ties.
273
647000
2000
11:04
And in a minute now, I'm going to put this map into motion.
274
649000
2000
11:06
We're going to take daily cuts through the network
275
651000
2000
11:08
for 120 days.
276
653000
2000
11:10
The red dots are going to be cases of the flu,
277
655000
3000
11:13
and the yellow dots are going to be friends of the people with the flu.
278
658000
3000
11:16
And the size of the dots is going to be proportional
279
661000
2000
11:18
to how many of their friends have the flu.
280
663000
2000
11:20
So bigger dots mean more of your friends have the flu.
281
665000
3000
11:23
And if you look at this image -- here we are now in September the 13th --
282
668000
3000
11:26
you're going to see a few cases light up.
283
671000
2000
11:28
You're going to see kind of blooming of the flu in the middle.
284
673000
2000
11:30
Here we are on October the 19th.
285
675000
3000
11:33
The slope of the epidemic curve is approaching now, in November.
286
678000
2000
11:35
Bang, bang, bang, bang, bang -- you're going to see lots of blooming in the middle,
287
680000
3000
11:38
and then you're going to see a sort of leveling off,
288
683000
2000
11:40
fewer and fewer cases towards the end of December.
289
685000
3000
11:43
And this type of a visualization
290
688000
2000
11:45
can show that epidemics like this take root
291
690000
2000
11:47
and affect central individuals first,
292
692000
2000
11:49
before they affect others.
293
694000
2000
11:51
Now, as I've been suggesting,
294
696000
2000
11:53
this method is not restricted to germs,
295
698000
3000
11:56
but actually to anything that spreads in populations.
296
701000
2000
11:58
Information spreads in populations,
297
703000
2000
12:00
norms can spread in populations,
298
705000
2000
12:02
behaviors can spread in populations.
299
707000
2000
12:04
And by behaviors, I can mean things like criminal behavior,
300
709000
3000
12:07
or voting behavior, or health care behavior,
301
712000
3000
12:10
like smoking, or vaccination,
302
715000
2000
12:12
or product adoption, or other kinds of behaviors
303
717000
2000
12:14
that relate to interpersonal influence.
304
719000
2000
12:16
If I'm likely to do something that affects others around me,
305
721000
3000
12:19
this technique can get early warning or early detection
306
724000
3000
12:22
about the adoption within the population.
307
727000
3000
12:25
The key thing is that for it to work,
308
730000
2000
12:27
there has to be interpersonal influence.
309
732000
2000
12:29
It cannot be because of some broadcast mechanism
310
734000
2000
12:31
affecting everyone uniformly.
311
736000
3000
12:35
Now the same insights
312
740000
2000
12:37
can also be exploited -- with respect to networks --
313
742000
3000
12:40
can also be exploited in other ways,
314
745000
3000
12:43
for example, in the use of targeting
315
748000
2000
12:45
specific people for interventions.
316
750000
2000
12:47
So, for example, most of you are probably familiar
317
752000
2000
12:49
with the notion of herd immunity.
318
754000
2000
12:51
So, if we have a population of a thousand people,
319
756000
3000
12:54
and we want to make the population immune to a pathogen,
320
759000
3000
12:57
we don't have to immunize every single person.
321
762000
2000
12:59
If we immunize 960 of them,
322
764000
2000
13:01
it's as if we had immunized a hundred [percent] of them.
323
766000
3000
13:04
Because even if one or two of the non-immune people gets infected,
324
769000
3000
13:07
there's no one for them to infect.
325
772000
2000
13:09
They are surrounded by immunized people.
326
774000
2000
13:11
So 96 percent is as good as 100 percent.
327
776000
3000
13:14
Well, some other scientists have estimated
328
779000
2000
13:16
what would happen if you took a 30 percent random sample
329
781000
2000
13:18
of these 1000 people, 300 people and immunized them.
330
783000
3000
13:21
Would you get any population-level immunity?
331
786000
2000
13:23
And the answer is no.
332
788000
3000
13:26
But if you took this 30 percent, these 300 people
333
791000
2000
13:28
and had them nominate their friends
334
793000
2000
13:30
and took the same number of vaccine doses
335
795000
3000
13:33
and vaccinated the friends of the 300 --
336
798000
2000
13:35
the 300 friends --
337
800000
2000
13:37
you can get the same level of herd immunity
338
802000
2000
13:39
as if you had vaccinated 96 percent of the population
339
804000
3000
13:42
at a much greater efficiency, with a strict budget constraint.
340
807000
3000
13:45
And similar ideas can be used, for instance,
341
810000
2000
13:47
to target distribution of things like bed nets
342
812000
2000
13:49
in the developing world.
343
814000
2000
13:51
If we could understand the structure of networks in villages,
344
816000
3000
13:54
we could target to whom to give the interventions
345
819000
2000
13:56
to foster these kinds of spreads.
346
821000
2000
13:58
Or, frankly, for advertising with all kinds of products.
347
823000
3000
14:01
If we could understand how to target,
348
826000
2000
14:03
it could affect the efficiency
349
828000
2000
14:05
of what we're trying to achieve.
350
830000
2000
14:07
And in fact, we can use data
351
832000
2000
14:09
from all kinds of sources nowadays [to do this].
352
834000
2000
14:11
This is a map of eight million phone users
353
836000
2000
14:13
in a European country.
354
838000
2000
14:15
Every dot is a person, and every line represents
355
840000
2000
14:17
a volume of calls between the people.
356
842000
2000
14:19
And we can use such data, that's being passively obtained,
357
844000
3000
14:22
to map these whole countries
358
847000
2000
14:24
and understand who is located where within the network.
359
849000
3000
14:27
Without actually having to query them at all,
360
852000
2000
14:29
we can get this kind of a structural insight.
361
854000
2000
14:31
And other sources of information, as you're no doubt aware
362
856000
3000
14:34
are available about such features, from email interactions,
363
859000
3000
14:37
online interactions,
364
862000
2000
14:39
online social networks and so forth.
365
864000
3000
14:42
And in fact, we are in the era of what I would call
366
867000
2000
14:44
"massive-passive" data collection efforts.
367
869000
3000
14:47
They're all kinds of ways we can use massively collected data
368
872000
3000
14:50
to create sensor networks
369
875000
3000
14:53
to follow the population,
370
878000
2000
14:55
understand what's happening in the population,
371
880000
2000
14:57
and intervene in the population for the better.
372
882000
3000
15:00
Because these new technologies tell us
373
885000
2000
15:02
not just who is talking to whom,
374
887000
2000
15:04
but where everyone is,
375
889000
2000
15:06
and what they're thinking based on what they're uploading on the Internet,
376
891000
3000
15:09
and what they're buying based on their purchases.
377
894000
2000
15:11
And all this administrative data can be pulled together
378
896000
3000
15:14
and processed to understand human behavior
379
899000
2000
15:16
in a way we never could before.
380
901000
3000
15:19
So, for example, we could use truckers' purchases of fuel.
381
904000
3000
15:22
So the truckers are just going about their business,
382
907000
2000
15:24
and they're buying fuel.
383
909000
2000
15:26
And we see a blip up in the truckers' purchases of fuel,
384
911000
3000
15:29
and we know that a recession is about to end.
385
914000
2000
15:31
Or we can monitor the velocity
386
916000
2000
15:33
with which people are moving with their phones on a highway,
387
918000
3000
15:36
and the phone company can see,
388
921000
2000
15:38
as the velocity is slowing down,
389
923000
2000
15:40
that there's a traffic jam.
390
925000
2000
15:42
And they can feed that information back to their subscribers,
391
927000
3000
15:45
but only to their subscribers on the same highway
392
930000
2000
15:47
located behind the traffic jam!
393
932000
2000
15:49
Or we can monitor doctors prescribing behaviors, passively,
394
934000
3000
15:52
and see how the diffusion of innovation with pharmaceuticals
395
937000
3000
15:55
occurs within [networks of] doctors.
396
940000
2000
15:57
Or again, we can monitor purchasing behavior in people
397
942000
2000
15:59
and watch how these types of phenomena
398
944000
2000
16:01
can diffuse within human populations.
399
946000
3000
16:04
And there are three ways, I think,
400
949000
2000
16:06
that these massive-passive data can be used.
401
951000
2000
16:08
One is fully passive,
402
953000
2000
16:10
like I just described --
403
955000
2000
16:12
as in, for instance, the trucker example,
404
957000
2000
16:14
where we don't actually intervene in the population in any way.
405
959000
2000
16:16
One is quasi-active,
406
961000
2000
16:18
like the flu example I gave,
407
963000
2000
16:20
where we get some people to nominate their friends
408
965000
3000
16:23
and then passively monitor their friends --
409
968000
2000
16:25
do they have the flu, or not? -- and then get warning.
410
970000
2000
16:27
Or another example would be,
411
972000
2000
16:29
if you're a phone company, you figure out who's central in the network
412
974000
3000
16:32
and you ask those people, "Look, will you just text us your fever every day?
413
977000
3000
16:35
Just text us your temperature."
414
980000
2000
16:37
And collect vast amounts of information about people's temperature,
415
982000
3000
16:40
but from centrally located individuals.
416
985000
2000
16:42
And be able, on a large scale,
417
987000
2000
16:44
to monitor an impending epidemic
418
989000
2000
16:46
with very minimal input from people.
419
991000
2000
16:48
Or, finally, it can be more fully active --
420
993000
2000
16:50
as I know subsequent speakers will also talk about today --
421
995000
2000
16:52
where people might globally participate in wikis,
422
997000
2000
16:54
or photographing, or monitoring elections,
423
999000
3000
16:57
and upload information in a way that allows us to pool
424
1002000
2000
16:59
information in order to understand social processes
425
1004000
2000
17:01
and social phenomena.
426
1006000
2000
17:03
In fact, the availability of these data, I think,
427
1008000
2000
17:05
heralds a kind of new era
428
1010000
2000
17:07
of what I and others would like to call
429
1012000
2000
17:09
"computational social science."
430
1014000
2000
17:11
It's sort of like when Galileo invented -- or, didn't invent --
431
1016000
3000
17:14
came to use a telescope
432
1019000
2000
17:16
and could see the heavens in a new way,
433
1021000
2000
17:18
or Leeuwenhoek became aware of the microscope --
434
1023000
2000
17:20
or actually invented --
435
1025000
2000
17:22
and could see biology in a new way.
436
1027000
2000
17:24
But now we have access to these kinds of data
437
1029000
2000
17:26
that allow us to understand social processes
438
1031000
2000
17:28
and social phenomena
439
1033000
2000
17:30
in an entirely new way that was never before possible.
440
1035000
3000
17:33
And with this science, we can
441
1038000
2000
17:35
understand how exactly
442
1040000
2000
17:37
the whole comes to be greater
443
1042000
2000
17:39
than the sum of its parts.
444
1044000
2000
17:41
And actually, we can use these insights
445
1046000
2000
17:43
to improve society and improve human well-being.
446
1048000
3000
17:46
Thank you.
447
1051000
2000

▲Back to top

ABOUT THE SPEAKER
Nicholas Christakis - Physician, social scientist
Nicholas Christakis explores how the large-scale, face-to-face social networks in which we are embedded affect our lives, and what we can do to take advantage of this fact.

Why you should listen

People aren't merely social animals in the usual sense, for we don't just live in groups. We live in networks -- and we have done so ever since we emerged from the African savannah. Via intricately branching paths tracing out cascading family connections, friendship ties, and work relationships, we are interconnected to hundreds or even thousands of specific people, most of whom we do not know. We affect them and they affect us.

Nicholas Christakis' work examines the biological, psychological, sociological, and mathematical rules that govern how we form these social networks, and the rules that govern how they shape our lives. His work shows how phenomena as diverse as obesity, smoking, emotions, ideas, germs, and altruism can spread through our social ties, and how genes can partially underlie our creation of social ties to begin with. His work also sheds light on how we might take advantage of an understanding of social networks to make the world a better place.

At Yale, Christakis is a Professor of Social and Natural Science, and he directs a diverse research group in the field of biosocial science, primarily investigating social networks. His popular undergraduate course "Health of the Public" is available as a podcast. His book, Connected, co-authored with James H. Fowler, appeared in 2009, and has been translated into 20 languages. In 2009, he was named by Time magazine to its annual list of the 100 most influential people in the world, and also, in 2009 and 2010, by Foreign Policy magazine to its list of 100 top global thinkers

More profile about the speaker
Nicholas Christakis | Speaker | TED.com