ABOUT THE SPEAKER
John Wilbanks - Data Commons Advocate
Imagine the discoveries that could result from a giant pool of freely available health and genomic data. John Wilbanks is working to build it.

Why you should listen

Performing a medical or genomic experiment on a human requires informed consent and careful boundaries around privacy. But what if the data that results, once scrubbed of identifying marks, was released into the wild? At WeConsent.us, John Wilbanks thinks through the ethical and procedural steps to create an open, massive, mine-able database of data about health and genomics from many sources. One step: the Portable Legal Consent for Common Genomics Research (PLC-CGR), an experimental bioethics protocol that would allow any test subject to say, "Yes, once this experiment is over, you can use my data, anonymously, to answer any other questions you can think of." Compiling piles of test results in one place, Wilbanks suggests, would turn genetic info into big data--giving researchers the potential to spot patterns that simply aren't viewable up close. 

A campaigner for the wide adoption of data sharing in science, Wilbanks is also a Senior Fellow with the Kauffman Foundation, a Research Fellow at Lybba and supported by Sage Bionetworks

In February 2013, the US government responded to a We the People petition spearheaded by Wilbanks and signed by 65,000 people, and announced a plan to open up taxpayer-funded research data and make it available for free.

More profile about the speaker
John Wilbanks | Speaker | TED.com
TEDGlobal 2012

John Wilbanks: Let's pool our medical data

John Wilbanks: 把醫療數據匯集起來吧

Filmed:
581,818 views

當你進行診療、或參加醫學測試時,隱私是很重要的;嚴謹的法規限制了研究員對你的資料的審視範圍。但假若你的醫療資料-匿名的-可以被用來進行醫學實驗呢?John Wilbanks 認為保護我們隱私的想法有可能延緩了研究速度,開放醫療資料或許能引領保健創新的潮流。
- Data Commons Advocate
Imagine the discoveries that could result from a giant pool of freely available health and genomic data. John Wilbanks is working to build it. Full bio

Double-click the English transcript below to play the video.

00:15
So I have bad news新聞, I have good news新聞,
0
98
3061
今天我有好消息和壞消息,
00:19
and I have a task任務.
1
3159
1865
我還有個任務。
00:20
So the bad news新聞 is that we all get sick生病.
2
5024
2943
壞消息是我們都會生病。
00:23
I get sick生病. You get sick生病.
3
7967
2272
我會生病。你會生病。
00:26
And every一切 one of us gets得到 sick生病, and the question really is,
4
10239
2542
當有人生病時,真正的問題在於,
00:28
how sick生病 do we get? Is it something that kills殺死 us?
5
12781
2877
我們病的多嚴重?有可能致命嗎?
00:31
Is it something that we survive生存?
6
15658
1345
還是我們可以倖存?
00:32
Is it something that we can treat對待?
7
17003
1928
我們可以被醫治嗎?
00:34
And we've我們已經 gotten得到 sick生病 as long as we've我們已經 been people.
8
18931
3256
只要我們是人類,就會生病。
00:38
And so we've我們已經 always looked看著 for reasons原因 to explain說明 why we get sick生病.
9
22187
3486
於是我們總是在找為什麼會生病的原因。
00:41
And for a long time, it was the gods, right?
10
25673
1957
好一段時間以來,都是因為眾神,對吧?
00:43
The gods are angry憤怒 with me, or the gods are testing測試 me,
11
27630
3154
眾神對我感到惱怒,或者是眾神在測試我,對嗎?
00:46
right? Or God, singular單數, more recently最近,
12
30784
2416
又或者是到較近期,唯一的,上帝,
00:49
is punishing懲罰 me or judging判斷 me.
13
33200
2664
在懲罰我或審判我。
00:51
And as long as we've我們已經 looked看著 for explanations說明,
14
35864
2680
只要我們一开始尋找解釋,
00:54
we've我們已經 wound傷口 up with something that gets得到 closer接近 and closer接近 to science科學,
15
38544
3711
就會愈來越接近科學,
00:58
which哪一個 is hypotheses假設 as to why we get sick生病,
16
42255
2489
找到我們為什麼會生病的假設原因,
01:00
and as long as we've我們已經 had hypotheses假設 about why we get sick生病, we've我們已經 tried試著 to treat對待 it as well.
17
44744
4740
只要我們有了為什麼會生病的假設原因,就會試圖治療。
01:05
So this is Avicenna阿維森納. He wrote a book over a thousand years年份 ago called "The Canon教規 of Medicine醫學,"
18
49484
4033
這位是 Avicenna。一千多年前他寫了一本書
叫《醫典》("The Canon of Medicine"),
01:09
and the rules規則 he laid鋪設 out for testing測試 medicines藥品
19
53517
2406
列出了測試藥物的規則,
01:11
are actually其實 really similar類似 to the rules規則 we have today今天,
20
55923
1789
其實非常類似我們今日的規則,
01:13
that the disease疾病 and the medicine醫學 must必須 be the same相同 strength強度,
21
57712
2945
像是疾病和藥物具有同等強度、
01:16
the medicine醫學 needs需求 to be pure, and in the end結束 we need
22
60657
2397
藥物必須質純、最後必須進行人體測試。
01:18
to test測試 it in people. And so if you put together一起 these themes主題
23
63054
3141
若能將這些理想的假設狀況
01:22
of a narrative敘述 or a hypothesis假設 in human人的 testing測試,
24
66195
4465
全部集合到人體試驗上,
01:26
right, you get some beautiful美麗 results結果,
25
70660
2656
你會得到絕佳結果,
01:29
even when we didn't have very good technologies技術.
26
73316
1442
就算當下並沒有非常先進的技術。
01:30
This is a guy named命名 Carlos卡洛斯 Finlay芬利. He had a hypothesis假設
27
74758
3062
有個名叫 Carlos Finlay 的男人,提出了一個假設,
01:33
that was way outside the box for his time, in the late晚了 1800s.
28
77820
2905
對他所處的 19 世紀末來說,是相當先進的。
01:36
He thought yellow黃色 fever發熱 was not transmitted發送 by dirty clothing服裝.
29
80725
2848
他認為黃熱病不是藉由髒衣物傳染。
01:39
He thought it was transmitted發送 by mosquitos蚊子.
30
83573
2426
他認為是透過蚊子傳染。
01:41
And they laughed笑了 at him. For 20 years年份, they called this guy
31
85999
2362
眾人都取笑他。20 年來,人們稱他為
01:44
"the mosquito蚊子 man." But he ran an experiment實驗 in people,
32
88361
3489
「蚊子男」。但他進行了人體試驗,
01:47
right? He had this hypothesis假設, and he tested測試 it in people.
33
91850
3103
對吧?他提出一個假設,並對它進行人體測試。
01:50
So he got volunteers志願者 to go move移動 to Cuba古巴 and live生活 in tents帳篷
34
94953
4642
所以他找了一些自願者搬到古巴,住在帳篷裡,
01:55
and be voluntarily自行 infected感染 with yellow黃色 fever發熱.
35
99595
3035
自願感染黃熱病。
01:58
So some of the people in some of the tents帳篷 had dirty clothes衣服
36
102630
3022
其中一些人的帳篷裡有髒衣服、
02:01
and some of the people were in tents帳篷 that were full充分
37
105652
1219
而其他一些人的帳篷裡充滿著
02:02
of mosquitos蚊子 that had been exposed裸露 to yellow黃色 fever發熱.
38
106871
2256
帶有黃熱病的蚊子。
02:05
And it definitively明確 proved證實 that it wasn't this magic魔法 dust灰塵
39
109127
3401
結果無疑地證明了,不是你衣物中
02:08
called fomites傳染體 in your clothes衣服 that caused造成 yellow黃色 fever發熱.
40
112528
3422
被稱做汙染物的神秘灰塵,引發了黃熱病的傳染。
02:11
But it wasn't until直到 we tested測試 it in people that we actually其實 knew知道.
41
115950
3376
但直到進行了人體試驗
我們才真正確認這件事。
02:15
And this is what those people signed up for.
42
119326
1959
這是那些人自願做的事。
02:17
This is what it looked看著 like to have yellow黃色 fever發熱 in Cuba古巴
43
121285
3090
這是當時在古巴罹患黃熱病的樣子。
02:20
at that time. You suffered遭遇 in a tent帳篷, in the heat, alone單獨,
44
124375
4534
在帳篷中受苦,發高燒,孤單一人,
02:24
and you probably大概 died死亡.
45
128909
2696
而且有可能會死。
02:27
But people volunteered自告奮勇 for this.
46
131605
3217
但這些人對此自告奮勇。
02:30
And it's not just a cool example of a scientific科學 design設計
47
134822
3311
這並不僅是件根據理論操作的
02:34
of experiment實驗 in theory理論. They also did this beautiful美麗 thing.
48
138133
2913
科學設計實驗範例,
他們還做了一件了不起的事。
02:36
They signed this document文件, and it's called an informed通知 consent同意 document文件.
49
141046
3919
他們簽署了這個文件,叫做《受試者同意書》。
02:40
And informed通知 consent同意 is an idea理念 that we should be
50
144965
2513
受試者同意書是我們社會應該
02:43
very proud驕傲 of as a society社會, right? It's something that
51
147478
2226
感到自豪的概念,對不對?這是能將我們和
02:45
separates中隔離 us from the Nazis納粹 at Nuremberg紐倫堡,
52
149704
2766
在紐倫堡實行強制醫療實驗的納粹主義
02:48
enforced強制執行 medical experimentation實驗. It's the idea理念
53
152470
2875
區隔開來的東西。這想法是
02:51
that agreement協議 to join加入 a study研究 without understanding理解 isn't agreement協議.
54
155345
3788
同意參與研究,卻不瞭解研究內容的話,此協定就不成立。
02:55
It's something that protects保護 us from harm危害, from hucksters小販,
55
159133
4109
這可保護我們不受到傷害、不遭商人欺騙、
02:59
from people that would try to hoodwink蒙蔽 us into a clinical臨床
56
163242
2853
或任何人哄騙而進行一件
03:01
study研究 that we don't understand理解, or that we don't agree同意 to.
57
166095
3752
我們不了解或不同意的臨床研究。
03:05
And so you put together一起 the thread of narrative敘述 hypothesis假設,
58
169847
4329
因此,如果你將各種假設條件集合起來,
03:10
experimentation實驗 in humans人類, and informed通知 consent同意,
59
174176
2597
進行人體實驗、簽署受試者同意書,
03:12
and you get what we call clinical臨床 study研究, and it's how we do
60
176773
2665
你就達到我們所說的臨床研究標準,這也是我們
03:15
the vast廣大 majority多數 of medical work. It doesn't really matter
61
179438
3015
我們進行大量醫療研究的方式。
03:18
if you're in the north, the south, the east, the west西.
62
182453
2342
不管你人在哪裡都無關。
03:20
Clinical臨床 studies學習 form形成 the basis基礎 of how we investigate調查,
63
184795
4113
臨床研究構成我們研究的基礎,
03:24
so if we're going to look at a new drug藥物, right,
64
188908
1859
如果我們要看一種新藥物的效果,
03:26
we test測試 it in people, we draw blood血液, we do experiments實驗,
65
190767
2998
我們進行人體測試、抽血、進行實驗、
03:29
and we gain獲得 consent同意 for that study研究, to make sure
66
193765
2329
取得此研究的受試者同意書,以確保
03:31
that we're not screwing擰緊 people over as part部分 of it.
67
196094
2649
過程中我們不會傷害任何人。
03:34
But the world世界 is changing改變 around the clinical臨床 study研究,
68
198743
3664
但是臨床研究的领域有了很大改變,
03:38
which哪一個 has been fairly相當 well established既定 for tens of years年份
69
202407
3366
就算它在過去數十年來,就算不到五十或者一百年,
03:41
if not 50 to 100 years年份.
70
205773
1900
已經建立了完善的架構。
03:43
So now we're able能夠 to gather收集 data數據 about our genomes基因組,
71
207673
3051
我們現在可以蒐集基因組的資料,
03:46
but, as we saw earlier, our genomes基因組 aren't dispositive處分.
72
210724
2860
但是,如同我們剛才所見,我們的基因組是固定的。
03:49
We're able能夠 to gather收集 information信息 about our environment環境.
73
213584
2766
我們可蒐集周遭環境的資訊。
03:52
And more importantly重要的, we're able能夠 to gather收集 information信息
74
216350
1910
更重要地,我們可以蒐集
03:54
about our choices選擇, because it turns out that what we think of
75
218260
2840
我們選擇的資訊,因為事實上
03:57
as our health健康 is more like the interaction相互作用 of our bodies身體,
76
221100
2720
我們的健康更像是我們的身體、基因組、
03:59
our genomes基因組, our choices選擇 and our environment環境.
77
223820
3649
我們的選擇、和環境間互動而來的結果。
04:03
And the clinical臨床 methods方法 that we've我們已經 got aren't very good
78
227469
2744
我們現有的臨床方法不是很適合研究,
04:06
at studying研究 that because they are based基於 on the idea理念
79
230213
2632
因為它們是以人和人的互動
04:08
of person-to-person人對人 interaction相互作用. You interact相互作用
80
232845
1914
作為基礎。你和你的
04:10
with your doctor醫生 and you get enrolled就讀 in the study研究.
81
234759
2095
醫生互動、然後被登記到研究中。
04:12
So this is my grandfather祖父. I actually其實 never met會見 him,
82
236854
2615
這是我的外公,事實上我從未見過他,
04:15
but he's holding保持 my mom媽媽, and his genes基因 are in me, right?
83
239469
3795
但他抱著我媽,而且我身上流著他的基因,對吧?
04:19
His choices選擇 ran through通過 to me. He was a smoker抽煙者,
84
243264
2891
他的選擇流到我身上來。他吸菸,
04:22
like most people were. This is my son兒子.
85
246155
2584
跟很多人一樣。這是我的兒子。
04:24
So my grandfather's爺爺的 genes基因 go all the way through通過 to him,
86
248739
3442
所以我外公的基因流到他身上去,
04:28
and my choices選擇 are going to affect影響 his health健康.
87
252181
2552
而我的選擇也將影響他的健康。
04:30
The technology技術 between之間 these two pictures圖片
88
254733
2694
這兩張照片所使用的科技
04:33
cannot不能 be more different不同, but the methodology方法
89
257427
3673
相差甚大。但是在這段期間,
04:37
for clinical臨床 studies學習 has not radically根本 changed over that time period.
90
261100
4124
臨床研究的方法並沒有什麼劇烈改變。
04:41
We just have better statistics統計.
91
265224
2668
我們只是有較好的統計資料。
04:43
The way we gain獲得 informed通知 consent同意 was formed形成 in large part部分
92
267892
3452
我們取得受試者同意書的方法
大致是在二次世界大戰後形成,
04:47
after World世界 War戰爭 IIII, around the time that picture圖片 was taken採取.
93
271344
2591
大約是那照片拍攝的時間。
04:49
That was 70 years年份 ago, and the way we gain獲得 informed通知 consent同意,
94
273935
3934
那是 70 年前,而我們取得受試者同意書的方法,
04:53
this tool工具 that was created創建 to protect保護 us from harm危害,
95
277869
2877
這個保護我們不受到傷害的同意書,
04:56
now creates創建 silos筒倉. So the data數據 that we collect蒐集
96
280746
3666
現在變成了資料貯存窖。我們為前列腺癌
05:00
for prostate前列腺 cancer癌症 or for Alzheimer's老年癡呆症 trials試驗
97
284412
2726
或阿茲海默症試驗所蒐集來的資料,
05:03
goes into silos筒倉 where it can only be used
98
287138
2615
被存放到只被用來研究
05:05
for prostate前列腺 cancer癌症 or for Alzheimer's老年癡呆症 research研究.
99
289753
3224
前列腺癌症或阿茲海默症的資料貯存窖中。
05:08
Right? It can't be networked聯網. It can't be integrated集成.
100
292977
2894
對吧?資料不能被分享。不能被連結。
05:11
It cannot不能 be used by people who aren't credentialed特命.
101
295871
3533
不能被無憑證的人使用。
05:15
So a physicist物理學家 can't get access訪問 to it without filing備案 paperwork證件.
102
299404
2953
所以沒有申請許可的話,物理學家不能取得資訊。
05:18
A computer電腦 scientist科學家 can't get access訪問 to it without filing備案 paperwork證件.
103
302357
3068
沒有申請許可的話,電腦科學家不能取得資訊。
05:21
Computer電腦 scientists科學家們 aren't patient患者. They don't file文件 paperwork證件.
104
305425
4143
電腦科學家很沒耐心,他們不申請許可的。
05:25
And this is an accident事故. These are tools工具 that we created創建
105
309568
3986
這真是個意外。我們創造了這些工具
05:29
to protect保護 us from harm危害, but what they're doing
106
313554
3267
來保護我們不受到傷害,但它們現在卻
05:32
is protecting保護 us from innovation革新 now.
107
316821
2530
阻礙我們創新。
05:35
And that wasn't the goal目標. It wasn't the point. Right?
108
319351
3265
這有違目的。因為那不是本來的意思。對吧?
05:38
It's a side effect影響, if you will, of a power功率 we created創建
109
322616
2699
你可以想說,這是我們立意為善下
05:41
to take us for good.
110
325315
2359
所產生的副作用。
05:43
And so if you think about it, the depressing壓抑 thing is that
111
327674
3144
當你這麼想時,令人沮喪的是
05:46
FacebookFacebook的 would never make a change更改 to something
112
330818
2133
臉書永遠不會為了第三階段臨床試驗裡
05:48
as important重要 as an advertising廣告 algorithm算法
113
332951
2571
這麼小的樣品數,來改變其廣告規則
05:51
with a sample樣品 size尺寸 as small as a Phase IIIIII clinical臨床 trial審訊.
114
335522
4411
這樣重要的事。
05:55
We cannot不能 take the information信息 from past過去 trials試驗
115
339933
3662
我們不能使用過去試驗的資訊
05:59
and put them together一起 to form形成 statistically統計學 significant重大 samples樣本.
116
343595
4154
建立具有顯著統計性的研究樣品數。
06:03
And that sucks, right? So 45 percent百分 of men男人 develop發展
117
347749
3484
這感覺超差的,對不對?於是 45% 的男人會罹患癌症。
06:07
cancer癌症. Thirty-eight三八 percent百分 of women婦女 develop發展 cancer癌症.
118
351233
3097
38% 的女人會罹患癌症。
06:10
One in four men男人 dies of cancer癌症.
119
354330
2344
四分之一的男人會因癌症而死。
06:12
One in five women婦女 dies of cancer癌症, at least最小 in the United聯合的 States狀態.
120
356674
3556
五分之一的女人會因癌症而死,至少就美國而言。
06:16
And three out of the four drugs毒品 we give you
121
360230
2228
在癌症試驗中,我們給你的四顆藥中,
06:18
if you get cancer癌症 fail失敗. And this is personal個人 to me.
122
362458
3513
有三顆是無效的。我對這些都身有所感。
06:21
My sister妹妹 is a cancer癌症 survivor倖存者.
123
365971
1963
我妹妹是癌症的倖存者。
06:23
My mother-in-law岳母 is a cancer癌症 survivor倖存者. Cancer癌症 sucks.
124
367934
3589
我的岳母是癌症的倖存者。癌症爛透了。
06:27
And when you have it, you don't have a lot of privacy隱私
125
371523
2190
當你罹患癌症時,你在醫院沒有什麼隱私權可言。
06:29
in the hospital醫院. You're naked the vast廣大 majority多數 of the time.
126
373713
3487
大部分時間你赤身露體的。
06:33
People you don't know come in and look at you and poke you and prod you,
127
377200
3695
不認識的人會進來、看一下你、戳你、刺你,
06:36
and when I tell cancer癌症 survivors倖存者 that this tool工具 we created創建
128
380895
3441
當我告訴癌症倖存者,我們設計來
06:40
to protect保護 them is actually其實 preventing防止 their data數據 from being存在 used,
129
384336
3098
保護他們的工具,實際上是避免他們的個資被使用,
06:43
especially特別 when only three to four percent百分 of people
130
387434
2050
尤其是在只有百分之三或四的癌症患者
06:45
who have cancer癌症 ever even sign標誌 up for a clinical臨床 study研究,
131
389484
2798
曾經參與過臨床研究的情況下,
06:48
their reaction反應 is not, "Thank you, God, for protecting保護 my privacy隱私."
132
392282
3558
他們的反應不是,「天啊,謝謝你保護我的隱私。」
06:51
It's outrage暴行
133
395840
2697
而是憤怒,
06:54
that we have this information信息 and we can't use it.
134
398537
2125
我們有這些資訊,但我們不能使用它。
06:56
And it's an accident事故.
135
400662
2476
這真是令人意外。
06:59
So the cost成本 in blood血液 and treasure寶藏 of this is enormous巨大.
136
403138
3055
人們在這方面付出的生命和財產代價非常龐大。
07:02
Two hundred and twenty-six26 billion十億 a year is spent花費 on cancer癌症 in the United聯合的 States狀態.
137
406193
3655
在美國,每年花費 2260 億在癌症上。
07:05
Fifteen十五 hundred people a day die in the United聯合的 States狀態.
138
409848
3219
在美國,每天有 1500 人死於癌症。
07:08
And it's getting得到 worse更差.
139
413067
2573
而且每況愈下。
07:11
So the good news新聞 is that some things have changed,
140
415640
2982
所以好消息是,有些事已經改變,
07:14
and the most important重要 thing that's changed
141
418622
1553
而最重要的改變是
07:16
is that we can now measure測量 ourselves我們自己 in ways方法
142
420175
2338
現在我們可以自我測量
07:18
that used to be the dominion主權 of the health健康 system系統.
143
422513
3058
用以前由醫療系統掌控的方式。
07:21
So a lot of people talk about it as digital數字 exhaust排氣.
144
425571
2158
很多人將此視為數位侵略。
07:23
I like to think of it as the dust灰塵 that runs運行 along沿 behind背後 my kid孩子.
145
427729
3042
我則樂於視為跟在我孩子後面的塵埃。
07:26
We can reach達到 back and grab that dust灰塵,
146
430771
2376
我們可以向後伸出手抓住一把,
07:29
and we can learn學習 a lot about health健康 from it, so if our choices選擇
147
433147
2414
然後從中學習到很多健康訊息,
所以如果我們的選擇
07:31
are part部分 of our health健康, what we eat is a really important重要
148
435561
2680
會影響到我們的健康,
我們所吃的食物就真的
07:34
aspect方面 of our health健康. So you can do something very simple簡單
149
438241
2689
是我們健康的重要一環。
你可以做一些非常簡單
07:36
and basic基本 and take a picture圖片 of your food餐飲,
150
440930
1957
和基本的事情,像是
拍一張你吃的食物的照片,
07:38
and if enough足夠 people do that, we can learn學習 a lot about
151
442887
2884
若有足夠的人這麼做,
我們就可以學習到很多有關
07:41
how our food餐飲 affects影響 our health健康.
152
445771
1425
食物如何影響我們健康的事情。
07:43
One interesting有趣 thing that came來了 out of this — this is an app應用 for iPhonesiPhone手機 called The Eatery簡便飲食店
153
447196
4516
從中可得出一件有趣的事
─ 這是一個 iPhone 應用程式叫 The Eatery ─
07:47
is that we think our pizza比薩 is significantly顯著 healthier健康
154
451712
2490
我們都以為我們吃的披薩會比其他人的
07:50
than other people's人們 pizza比薩 is. Okay? (Laughter笑聲)
155
454202
3438
更為健康。對嗎?(笑聲)
07:53
And it seems似乎 like a trivial不重要的 result結果, but this is the sort分類 of research研究
156
457640
3608
這看起來像是個極微小的結果,但是以前醫療系統
07:57
that used to take the health健康 system系統 years年份
157
461248
2314
得花上好幾年和好幾十萬元
07:59
and hundreds數以百計 of thousands數千 of dollars美元 to accomplish完成.
158
463562
2293
才能完成的研究。
08:01
It was doneDONE in five months個月 by a startup啟動 company公司 of a couple一對 of people.
159
465855
3724
這是由一家兩個人成立的新創公司在五個月內完成的。
08:05
I don't have any financial金融 interest利益 in it.
160
469579
2624
我對其中的財務狀況沒有興趣。
08:08
But more nontrivially非平凡, we can get our genotypes基因型 doneDONE,
161
472203
2696
更重要的是,我們的基因型完成了,
08:10
and although雖然 our genotypes基因型 aren't dispositive處分, they give us clues線索.
162
474899
2818
雖然我們的基因型不是不可處置的,它們給了我們線索。
08:13
So I could show顯示 you mine. It's just A's, T'sT的, C'sC'S and G'sG公司.
163
477717
2806
所以我可以給你們看我的。只有一些 A,T,C 和 G。
08:16
This is the interpretation解釋 of it. As you can see,
164
480523
2232
這是翻譯。你可以看到,
08:18
I carry攜帶 a 32 percent百分 risk風險 of prostate前列腺 cancer癌症,
165
482755
2600
我有 32% 的機率罹患前列腺癌。
08:21
22 percent百分 risk風險 of psoriasis銀屑病 and a 14 percent百分 risk風險 of Alzheimer's老年癡呆症 disease疾病.
166
485355
4223
我有 22% 的機率罹患牛皮癬
和 14% 的機率罹患阿茲海默症。
08:25
So that means手段, if you're a geneticist遺傳學家, you're freaking再用 out,
167
489578
2607
這代表的是,如果你是個基因學家,你會被嚇死,
08:28
going, "Oh my God, you told everyone大家 you carry攜帶 the ApoEApoE基因 E4 allele等位基因. What's wrong錯誤 with you?"
168
492185
4034
「天阿,你告訴每個人你有
載脂蛋白E類等位基因。你是有什麼毛病啊?」
08:32
Right? When I got these results結果, I started開始 talking to doctors醫生,
169
496219
3688
是吧?當我拿到結果時,我開始詢問醫生,
08:35
and they told me not to tell anyone任何人, and my reaction反應 is,
170
499907
2409
他們告訴我不要告訴任何人,我的反應是,
08:38
"Is that going to help anyone任何人 cure治愈 me when I get the disease疾病?"
171
502316
3288
「當我生病時,這有助於其他人治療我嗎?」
08:41
And no one could tell me yes.
172
505604
3064
沒有人可以跟我說對。
08:44
And I live生活 in a web捲筒紙 world世界 where, when you share分享 things,
173
508668
2806
而我住在一個網路的世界,當你分享事情時,
08:47
beautiful美麗 stuff東東 happens發生, not bad stuff東東.
174
511474
2710
美妙的事情發生了,不是壞事。
08:50
So I started開始 putting this in my slide滑動 decks甲板,
175
514184
1900
所以我開始將這放入我的投影片檔中,
08:51
and I got even more obnoxious厭惡, and I went to my doctor醫生,
176
516084
2461
而且我變得更令人討厭,我去找我的醫生,
08:54
and I said, "I'd like to actually其實 get my bloodwork血汗工作.
177
518545
1982
我說:「我想要確實拿到我的血液檢查結果,
08:56
Please give me back my data數據." So this is my most recent最近 bloodwork血汗工作.
178
520527
2790
請還給我我的資料。」
於是這是我最近的血液檢查報告。
08:59
As you can see, I have high cholesterol膽固醇.
179
523317
2369
你可以告到,我有高膽固醇。
09:01
I have particularly尤其 high bad cholesterol膽固醇, and I have some
180
525686
2751
我有特別高且不好的膽固醇,我有一些
09:04
bad liver numbers數字, but those are because we had a dinner晚餐 party派對 with a lot of good wine紅酒
181
528437
3003
不好的肝臟指數,但這些是
一位我們進行檢查的前一天晚上
09:07
the night before we ran the test測試. (Laughter笑聲)
182
531440
2709
參加了晚宴且喝了很多好酒。(笑聲)
09:10
Right. But look at how non-computable非可計算 this information信息 is.
183
534149
4413
好。再看看這個資訊是多麼難計算。
09:14
This is like the photograph照片 of my granddad公公 holding保持 my mom媽媽
184
538562
2974
這就像是我祖父抱著我媽的那張照片的
09:17
from a data數據 perspective透視, and I had to go into the system系統
185
541536
3599
數據呈現,所以我必須進入系統
09:21
and get it out.
186
545135
2162
將它找出來。
09:23
So the thing that I'm proposing建議 we do here
187
547297
3282
於是我要在這建議我們做的事是
09:26
is that we reach達到 behind背後 us and we grab the dust灰塵,
188
550579
2416
我們向後方伸出手抓住塵埃
09:28
that we reach達到 into our bodies身體 and we grab the genotype基因型,
189
552995
2978
我們進入我們的身體內找到基因型,
09:31
and we reach達到 into the medical system系統 and we grab our records記錄,
190
555973
2701
我們進入醫療系統內找到我們的紀錄,
09:34
and we use it to build建立 something together一起, which哪一個 is a commons公地.
191
558674
3440
我們一起將這些資料用來組合成一個公有物。
09:38
And there's been a lot of talk about commonsescommonses, right,
192
562114
3144
已經有很多有關公有物的談論,
09:41
here, there, everywhere到處, right. A commons公地 is nothing more
193
565258
2948
這裡,那裡,到處都是。公有物就是
09:44
than a public上市 good that we build建立 out of private私人的 goods產品.
194
568206
2928
我們用私人物打造出來的公有物。
09:47
We do it voluntarily自行, and we do it through通過 standardized標準化
195
571134
2769
我們自願做這件事,
而且透過標準化的合法工具。
09:49
legal法律 tools工具. We do it through通過 standardized標準化 technologies技術.
196
573903
2800
透過標準化的科技。
09:52
Right. That's all a commons公地 is. It's something that we build建立
197
576703
3271
是的。這就是一個公有物。
是我們一起建立的東西,
09:55
together一起 because we think it's important重要.
198
579974
2520
因為我們認為它很重要。
09:58
And a commons公地 of data數據 is something that's really unique獨特,
199
582494
2632
而且一個公有物的資料非常特別,
10:01
because we make it from our own擁有 data數據. And although雖然
200
585126
2868
因為我們用我們自己的資料來建立它。
10:03
a lot of people like privacy隱私 as their methodology方法 of control控制
201
587994
2287
雖然很多人喜歡將保護隱私
視為他們控制資料的方法,
10:06
around data數據, and obsess纏住 around privacy隱私, at least最小
202
590281
2255
且非常著迷於隱私,至少
10:08
some of us really like to share分享 as a form形成 of control控制,
203
592536
3048
我們當中有些人真的喜歡
將分享視作一種控制的形式,
10:11
and what's remarkable卓越 about digital數字 commonsescommonses
204
595584
2353
而數位化公有物最值得注意的就是
10:13
is you don't need a big percentage百分比 if your sample樣品 size尺寸 is big enough足夠
205
597937
3532
如果你的樣品數夠大,
你不需要大數值的比率
10:17
to generate生成 something massive大規模的 and beautiful美麗.
206
601469
2511
就可以產生大量且美好的結果。
10:19
So not that many許多 programmers程序員 write free自由 software軟件,
207
603980
2558
於是即使沒有很多程式設計師寫免費軟體,
10:22
but we have the Apache阿帕奇 web捲筒紙 server服務器.
208
606538
2335
但我們有阿帕契 (Apache) 網頁伺服器。
10:24
Not that many許多 people who read Wikipedia維基百科 edit編輯,
209
608873
2697
即使沒有很多讀維基百科的人進行編輯,
10:27
but it works作品. So as long as some people like to share分享
210
611570
4009
但它仍在運作。所以只要有一些人喜歡將分享
10:31
as their form形成 of control控制, we can build建立 a commons公地, as long as we can get the information信息 out.
211
615579
3744
視作他們控制的形式,我們可以
建造一個公有物,只要我們可以取得資訊。
10:35
And in biology生物學, the numbers數字 are even better.
212
619323
2376
而在生物學中,數量代表得更好。
10:37
So Vanderbilt范德比爾特 ran a study研究 asking people, we'd星期三 like to take
213
621699
2552
於是 Vanderbilt 進行一項研究,問人們說,我們想要
10:40
your biosamples生物樣品, your blood血液, and share分享 them in a biobank生物樣本庫,
214
624251
3322
取你的生物樣品,你的血液,在生物銀行中進行分享,
10:43
and only five percent百分 of the people opted選擇 out.
215
627573
2372
只有百分之五的人們選擇不要。
10:45
I'm from Tennessee田納西. It's not the most science-positive科學陽性 state
216
629945
3092
我來自田納西州。並不是在美國最嚮往
10:48
in the United聯合的 States狀態 of America美國. (Laughter笑聲)
217
633037
3039
科學的一州。(笑聲)
10:51
But only five percent百分 of the people wanted out.
218
636076
2378
但只有百分之五的人選擇不要參加。
10:54
So people like to share分享, if you give them the opportunity機會 and the choice選擇.
219
638454
4023
所以如果你給人們機會和選擇的話,他們喜歡分享。
10:58
And the reason原因 that I got obsessed痴迷 with this, besides除了 the obvious明顯 family家庭 aspects方面,
220
642477
4483
我會如此著迷的原因是,
除了明顯的家庭因素之外,
11:02
is that I spend a lot of time around mathematicians數學家,
221
646960
3273
我花很多時間和數學家相處,
11:06
and mathematicians數學家 are drawn to places地方 where there's a lot of data數據
222
650233
2914
數學家被吸引到很多數據的地方,
11:09
because they can use it to tease signals信號 out of noise噪聲.
223
653147
2943
因為他們使用數據在雜亂中歸類出信號。
11:11
And those correlations相關 that they can tease out, they're not
224
656090
2968
而他們可以整理出來的相互關係,
11:14
necessarily一定 causal因果 agents代理, but math數學, in this day and age年齡,
225
659058
3872
並非必然的因果關係媒介,但在今日
11:18
is like a giant巨人 set of power功率 tools工具
226
662930
2360
數學就像是我們丟棄在地板上
11:21
that we're leaving離開 on the floor地板, not plugged in in health健康,
227
665290
3875
的一大組有力的工具,我們仍然使用手鋸
11:25
while we use hand saws.
228
669165
2312
不將它跟健康結合。
11:27
If we have a lot of shared共享 genotypes基因型, and a lot of shared共享
229
671477
4438
如果我們有很多被分享的基因型,很多被分享的
11:31
outcomes結果, and a lot of shared共享 lifestyle生活方式 choices選擇,
230
675915
2748
結果,很多被分享的生活型式選擇,
11:34
and a lot of shared共享 environmental環境的 information信息, we can start開始
231
678663
2776
很多被分享的環境資訊,我們可以開始
11:37
to tease out the correlations相關 between之間 subtle微妙 variations變化
232
681439
2896
在細微的差異中整理出相互關係,
11:40
in people, the choices選擇 they make and the health健康 that they create創建 as a result結果 of those choices選擇,
233
684335
5311
在人跟人之間,他們做的選擇間,
還有這些選擇造就的健康間,
11:45
and there's open-source開源 infrastructure基礎設施 to do all of this.
234
689646
2486
有一個開放資源的基礎建設在做這所有的事。
11:48
Sage智者 Bionetworks生物網絡 is a nonprofit非營利性 that's built內置 a giant巨人 math數學 system系統
235
692132
3094
賽智生物網絡 (Sage Bionetworks)
是一個巨大數學系統的非營利組織,
11:51
that's waiting等候 for data數據, but there isn't any.
236
695226
4572
正在等待資料,但不多。
11:55
So that's what I do. I've actually其實 started開始 what we think is
237
699798
3888
於是我這麼做。我開始進行我們認為
11:59
the world's世界 first fully充分 digital數字, fully充分 self-contributed自促成,
238
703686
3938
世界上第一個完全數位化,完全自發性,
12:03
unlimited無限 in scope範圍, global全球 in participation參與, ethically道德 approved批准
239
707624
5035
沒有範圍限制,全球參與,合乎道德的
12:08
clinical臨床 research研究 study研究 where you contribute有助於 the data數據.
240
712659
3655
臨床研究,由你們貢獻數據。
12:12
So if you reach達到 behind背後 yourself你自己 and you grab the dust灰塵,
241
716314
2206
所以如果你往身後抓到塵埃,
12:14
if you reach達到 into your body身體 and grab your genome基因組,
242
718520
2626
如果你進入你的身體內找到基因組,
12:17
if you reach達到 into the medical system系統 and somehow不知何故 extract提取 your medical record記錄,
243
721146
3047
如果你進入醫療系統內
截取出你的醫療紀錄,
12:20
you can actually其實 go through通過 an online線上 informed通知 consent同意 process處理 --
244
724193
3323
你確實可通過線上的受試者同意書過程 --
12:23
because the donation捐款 to the commons公地 must必須 be voluntary自主性
245
727516
2646
因為捐贈給公有物一定要出於自願,
12:26
and it must必須 be informed通知 -- and you can actually其實 upload上載
246
730162
2793
且一定要被告知 -- 而且你可以上傳
12:28
your information信息 and have it syndicated辛迪加 to the
247
732955
2592
你的資訊,讓它被打包後傳輸到專門進行
12:31
mathematicians數學家 who will do this sort分類 of big data數據 research研究,
248
735547
3096
這種大規模數據研究的數學家那裡,
12:34
and the goal目標 is to get 100,000 in the first year
249
738643
2856
我們的目標是第一年可以達到10萬人,
12:37
and a million百萬 in the first five years年份 so that we have
250
741499
2358
第五年可以達到100萬人,這樣我們就會有
12:39
a statistically統計學 significant重大 cohort隊列 that you can use to take
251
743857
3834
一個具有統計學上顯著性差異的一个樣本。我们可以
12:43
smaller sample樣品 sizes大小 from traditional傳統 research研究
252
747691
2422
將傳統研究中的較小樣品數
12:46
and map地圖 it against反對,
253
750113
1599
拿來跟它比較,
12:47
so that you can use it to tease out those subtle微妙 correlations相關
254
751712
2922
找出使我們每個個體與眾不同的變量
12:50
between之間 the variations變化 that make us unique獨特
255
754634
2529
之間的細微關係
12:53
and the kinds of health健康 that we need to move移動 forward前鋒 as a society社會.
256
757163
4024
以及我們整體社會朝著的健康方向之間。
12:57
And I've spent花費 a lot of time around other commons公地.
257
761187
3024
我已經花了很多時間在其它公有物上。
13:00
I've been around the early web捲筒紙. I've been around
258
764211
2680
我在網絡初期就開始參與。
13:02
the early creative創作的 commons公地 world世界, and there's four things
259
766891
2608
我也經過早期有創意的公有物世界,
13:05
that all of these share分享, which哪一個 is, they're all really simple簡單.
260
769499
3354
當中有四個共同點,都非常簡單。
13:08
And so if you were to go to the website網站 and enroll註冊 in this study研究,
261
772853
2727
如果你上這個網站也參加這項研究,
13:11
you're not going to see something complicated複雜.
262
775580
2255
你將不會看到很複雜的東西。
13:13
But it's not simplistic簡單化. These things are weak intentionally故意地,
263
777835
5049
但它也不是過分簡單的。
這些是有意被做得較不充足的,
13:18
right, because you can always add power功率 and control控制 to a system系統,
264
782884
3023
因為你總是可以在一個系統內加上權力和控制,
13:21
but it's very difficult to remove去掉 those things if you put them in at the beginning開始,
265
785907
3964
但是一旦你一開始就加入它們,
之後要移除是非常困難的,
13:25
and so being存在 simple簡單 doesn't mean being存在 simplistic簡單化,
266
789871
2545
於是簡單並不代表過分單純化,
13:28
and being存在 weak doesn't mean weakness弱點.
267
792416
2184
不充足並不代表缺點。
13:30
Those are strengths優勢 in the system系統.
268
794600
2351
這些是系統內的強項。
13:32
And open打開 doesn't mean that there's no money.
269
796951
2665
資源開放也不代表沒有收益。
13:35
Closed關閉 systems系統, corporations公司, make a lot of money
270
799616
3020
封閉的系統和企業在資源開放的網路上
13:38
on the open打開 web捲筒紙, and they're one of the reasons原因 why the open打開 web捲筒紙 lives生活
271
802636
3539
賺很多錢,它們是很多
開放資源網路得以生存的原因之一,
13:42
is that corporations公司 have a vested既得利益 interest利益 in the openness透明度
272
806175
2827
企業在系統開放性中取得
13:44
of the system系統.
273
809002
2334
既得利益。
13:47
And so all of these things are part部分 of the clinical臨床 study研究 that we've我們已經 created創建,
274
811336
3794
所以全部這些都是我們
已經創造的臨床研究中的一部分,
13:51
so you can actually其實 come in, all you have to be is 14 years年份 old,
275
815130
3429
於是你真的可以加入,只要你已經滿 14 歲,
13:54
willing願意 to sign標誌 a contract合同 that says I'm not going to be a jerk混蛋,
276
818559
2027
願意簽署一份我將不會變成混蛋的合約,
13:56
basically基本上, and you're in.
277
820586
2665
基本上你就成功了。
13:59
You can start開始 analyzing分析 the data數據.
278
823251
1573
你可以開始分析數據。
14:00
You do have to solve解決 a CAPTCHACAPTCHA as well. (Laughter笑聲)
279
824824
4159
你也必須要輸入驗證碼。 (笑聲)
14:04
And if you'd like to build建立 corporate企業 structures結構 on top最佳 of it,
280
828983
3581
如果你想要在這之上建立企業架構,
14:08
that's okay too. That's all in the consent同意,
281
832564
3146
那也是可以的。這些都在同意書內,
14:11
so if you don't like those terms條款, you don't come in.
282
835710
2564
所以如果你不喜歡這些條款,你不會參與。
14:14
It's very much the design設計 principles原則 of a commons公地
283
838274
3092
我們用試著要帶入健康數據的公有物
14:17
that we're trying to bring帶來 to health健康 data數據.
284
841366
2594
的設計規則大致如此。
14:19
And the other thing about these systems系統 is that it only takes
285
843960
2979
跟這些系統相關的另一件事是
14:22
a small number of really unreasonable不合理 people working加工 together一起
286
846939
3179
只需要一小群真的很不理性的人合作
14:26
to create創建 them. It didn't take that many許多 people
287
850118
3182
來創造它們。維基百科並不需要
14:29
to make Wikipedia維基百科 Wikipedia維基百科, or to keep it Wikipedia維基百科.
288
853300
3472
很多人來創造和維持。
14:32
And we're not supposed應該 to be unreasonable不合理 in health健康,
289
856772
2068
我們不應該在健康方面不理性,
14:34
and so I hate討厭 this word "patient患者."
290
858840
2276
而且我討厭「忍受/病人」這個字 (patient)。
14:37
I don't like being存在 patient患者 when systems系統 are broken破碎,
291
861116
3167
當系統和醫療制度崩壞的時候,
14:40
and health健康 care關心 is broken破碎.
292
864283
2627
我不喜歡忍受/當病人。
14:42
I'm not talking about the politics政治 of health健康 care關心, I'm talking about the way we scientifically科學 approach途徑 health健康 care關心.
293
866910
4164
我不是在說醫療制度的政治,
我在說我們科學上處理醫療制度的方法。
14:46
So I don't want to be patient患者. And the task任務 I'm giving to you
294
871074
3270
所以我不要繼續忍受下去。我要給你們的任務是
14:50
is to not be patient患者. So I'd like you to actually其實 try,
295
874344
3046
不要束手旁觀。我要你們真的去試試看,
14:53
when you go home, to get your data數據.
296
877390
2717
回到家後,蒐集你的資料。
14:56
You'll你會 be shocked吃驚 and offended生氣 and, I would bet賭注, outraged憤怒,
297
880107
2717
我保證你會因為資料有多難取得而感到
14:58
at how hard it is to get it.
298
882824
2876
驚訝且被冒犯。
15:01
But it's a challenge挑戰 that I hope希望 you'll你會 take,
299
885700
2619
但我希望這是一個你會接受的挑戰,
15:04
and maybe you'll你會 share分享 it. Maybe you won't慣於.
300
888319
2461
或許你還會分享。或許你不會。
15:06
If you don't have anyone任何人 in your family家庭 who's誰是 sick生病,
301
890780
1444
如果你家族中沒有任何人生病,
15:08
maybe you wouldn't不會 be unreasonable不合理. But if you do,
302
892224
2993
或許你不會不理性。但如果有的話,
15:11
or if you've been sick生病, then maybe you would.
303
895217
2207
或者你生病了,那你可能會。
15:13
And we're going to be able能夠 to do an experiment實驗 in the next下一個 several一些 months個月
304
897424
3088
我們將能在接下來幾個月做一個實驗,
15:16
that lets讓我們 us know exactly究竟 how many許多 unreasonable不合理 people are out there.
305
900512
3157
讓我們知道世上究竟有多少不理性的人。
15:19
So this is the Athena雅典娜 Breast乳房 Health健康 Network網絡. It's a study研究
306
903669
2122
這是雅典娜胸部健康網絡,
15:21
of 150,000 women婦女 in California加州, and they're going to
307
905791
3818
這是在加州15萬名女人的研究,他們將
15:25
return返回 all the data數據 to the participants參與者 of the study研究
308
909609
2718
歸還所有參與者在研究中的數據,
15:28
in a computable可計算 form形成, with one-clickability一個可點擊 to load加載 it into
309
912327
3146
用可計算得格式,只要按下一個按鍵就可以將它傳到
15:31
the study研究 that I've put together一起. So we'll know exactly究竟
310
915473
2616
我匯集而成的研究中。於是我們將發現
15:33
how many許多 people are willing願意 to be unreasonable不合理.
311
918089
2304
究竟有多少人願意變得不理性。
15:36
So what I'd end結束 [with] is,
312
920393
2384
我的結語是,
15:38
the most beautiful美麗 thing I've learned學到了 since以來 I quit放棄 my job工作
313
922777
3320
自從我一年前辭職開始做這件事以來,
15:41
almost幾乎 a year ago to do this, is that it really doesn't take
314
926097
3383
當中最美妙的事是這真的不需要
15:45
very many許多 of us to achieve實現 spectacular壯觀 results結果.
315
929480
3808
我們當中很多人才能達到引人注目的結果。
15:49
You just have to be willing願意 to be unreasonable不合理,
316
933288
2712
你只需要願意不理性些,
15:51
and the risk風險 we're running賽跑 is not the risk風險 those 14 men男人
317
936000
2331
而我們會有的風險和以前這 14 個男人中
15:54
who got yellow黃色 fever發熱 ran. Right?
318
938331
1868
誰會得到黃熱病的風險不同。對吧?
15:56
It's to be naked, digitally數字, in public上市. So you know more
319
940199
2861
這必須要在公開場合數位化地裸身。所以你知道
15:58
about me and my health健康 than I know about you. It's asymmetric非對稱 now.
320
943060
3433
我和我的健康比我知道你得還要多。現在是不對稱的。
16:02
And being存在 naked and alone單獨 can be terrifying可怕的.
321
946493
3630
一個人裸身可以是非常嚇人的。
16:06
But to be naked in a group, voluntarily自行, can be quite相當 beautiful美麗.
322
950123
4467
但在一個團體內自願地裸身可以是非常美妙的。
16:10
And so it doesn't take all of us.
323
954590
1888
而且這不需要我們全部的人。
16:12
It just takes all of some of us. Thank you.
324
956478
3006
只需要我們當中部分的人。謝謝。
16:15
(Applause掌聲)
325
959484
5590
(掌聲)
Translated by Meilun Shih
Reviewed by Yingxue Sun

▲Back to top

ABOUT THE SPEAKER
John Wilbanks - Data Commons Advocate
Imagine the discoveries that could result from a giant pool of freely available health and genomic data. John Wilbanks is working to build it.

Why you should listen

Performing a medical or genomic experiment on a human requires informed consent and careful boundaries around privacy. But what if the data that results, once scrubbed of identifying marks, was released into the wild? At WeConsent.us, John Wilbanks thinks through the ethical and procedural steps to create an open, massive, mine-able database of data about health and genomics from many sources. One step: the Portable Legal Consent for Common Genomics Research (PLC-CGR), an experimental bioethics protocol that would allow any test subject to say, "Yes, once this experiment is over, you can use my data, anonymously, to answer any other questions you can think of." Compiling piles of test results in one place, Wilbanks suggests, would turn genetic info into big data--giving researchers the potential to spot patterns that simply aren't viewable up close. 

A campaigner for the wide adoption of data sharing in science, Wilbanks is also a Senior Fellow with the Kauffman Foundation, a Research Fellow at Lybba and supported by Sage Bionetworks

In February 2013, the US government responded to a We the People petition spearheaded by Wilbanks and signed by 65,000 people, and announced a plan to open up taxpayer-funded research data and make it available for free.

More profile about the speaker
John Wilbanks | Speaker | TED.com