ABOUT THE SPEAKER

John Wilbanks - Data Commons Advocate
Imagine the discoveries that could result from a giant pool of freely available health and genomic data. John Wilbanks is working to build it.

Why you should listen

Performing a medical or genomic experiment on a human requires informed consent and careful boundaries around privacy. But what if the data that results, once scrubbed of identifying marks, was released into the wild? At WeConsent.us, John Wilbanks thinks through the ethical and procedural steps to create an open, massive, mine-able database of data about health and genomics from many sources. One step: the Portable Legal Consent for Common Genomics Research (PLC-CGR), an experimental bioethics protocol that would allow any test subject to say, "Yes, once this experiment is over, you can use my data, anonymously, to answer any other questions you can think of." Compiling piles of test results in one place, Wilbanks suggests, would turn genetic info into big data--giving researchers the potential to spot patterns that simply aren't viewable up close.

A campaigner for the wide adoption of data sharing in science, Wilbanks is also a Senior Fellow with the Kauffman Foundation, a Research Fellow at Lybba and supported by Sage Bionetworks.

In February 2013, the US government responded to a We the People petition spearheaded by Wilbanks and signed by 65,000 people, and announced a plan to open up taxpayer-funded research data and make it available for free.

More profile about the speaker
John Wilbanks | Speaker | TED.com

TEDGlobal 2012

John Wilbanks: Let's pool our medical data

Filmed: 2012-06-29

Readability: 3.7

581,818 views

When you're getting medical treatment, or taking part in medical testing, privacy is important; strict laws limit what researchers can see and know about you. But what if your medical data could be used -- anonymously -- by anyone seeking to test a hypothesis? John Wilbanks wonders if the desire to protect our privacy is slowing research, and if opening up medical data could lead to a wave of health care innovation.

John Wilbanks - Data Commons Advocate
Imagine the discoveries that could result from a giant pool of freely available health and genomic data. John Wilbanks is working to build it. Full bio

Double-click the English transcript below to play the video.

00:15

So I have bad news, I have good news,

0

98

3061

00:19

and I have a task.

1

3159

1865

00:20

So the bad news is that we all get sick.

2

5024

2943

00:23

I get sick. You get sick.

3

7967

2272

00:26

And every one of us gets sick, and the question really is,

4

10239

2542

00:28

how sick do we get? Is it something that kills us?

5

12781

2877

00:31

Is it something that we survive?

6

15658

1345

00:32

Is it something that we can treat?

7

17003

1928

00:34

And we've gotten sick as long as we've been people.

8

18931

3256

00:38

And so we've always looked for reasons to explain why we get sick.

9

22187

3486

00:41

And for a long time, it was the gods, right?

10

25673

1957

00:43

The gods are angry with me, or the gods are testing me,

11

27630

3154

00:46

right? Or God, singular, more recently,

12

30784

2416

00:49

is punishing me or judging me.

13

33200

2664

00:51

And as long as we've looked for explanations,

14

35864

2680

00:54

we've wound up with something that gets closer and closer to science,

15

38544

3711

00:58

which is hypotheses as to why we get sick,

16

42255

2489

01:00

and as long as we've had hypotheses about why we get sick, we've tried to treat it as well.

17

44744

4740

01:05

So this is Avicenna. He wrote a book over a thousand years ago called "The Canon of Medicine,"

18

49484

4033

01:09

and the rules he laid out for testing medicines

19

53517

2406

01:11

are actually really similar to the rules we have today,

20

55923

1789

01:13

that the disease and the medicine must be the same strength,

21

57712

2945

01:16

the medicine needs to be pure, and in the end we need

22

60657

2397

01:18

to test it in people. And so if you put together these themes

23

63054

3141

01:22

of a narrative or a hypothesis in human testing,

24

66195

4465

01:26

right, you get some beautiful results,

25

70660

2656

01:29

even when we didn't have very good technologies.

26

73316

1442

01:30

This is a guy named Carlos Finlay. He had a hypothesis

27

74758

3062

01:33

that was way outside the box for his time, in the late 1800s.

28

77820

2905

01:36

He thought yellow fever was not transmitted by dirty clothing.

29

80725

2848

01:39

He thought it was transmitted by mosquitos.

30

83573

2426

01:41

And they laughed at him. For 20 years, they called this guy

31

85999

2362

01:44

"the mosquito man." But he ran an experiment in people,

32

88361

3489

01:47

right? He had this hypothesis, and he tested it in people.

33

91850

3103

01:50

So he got volunteers to go move to Cuba and live in tents

34

94953

4642

01:55

and be voluntarily infected with yellow fever.

35

99595

3035

01:58

So some of the people in some of the tents had dirty clothes

36

102630

3022

02:01

and some of the people were in tents that were full

37

105652

1219

02:02

of mosquitos that had been exposed to yellow fever.

38

106871

2256

02:05

And it definitively proved that it wasn't this magic dust

39

109127

3401

02:08

called fomites in your clothes that caused yellow fever.

40

112528

3422

02:11

But it wasn't until we tested it in people that we actually knew.

41

115950

3376

02:15

And this is what those people signed up for.

42

119326

1959

02:17

This is what it looked like to have yellow fever in Cuba

43

121285

3090

02:20

at that time. You suffered in a tent, in the heat, alone,

44

124375

4534

02:24

and you probably died.

45

128909

2696

02:27

But people volunteered for this.

46

131605

3217

02:30

And it's not just a cool example of a scientific design

47

134822

3311

02:34

of experiment in theory. They also did this beautiful thing.

48

138133

2913

02:36

They signed this document, and it's called an informed consent document.

49

141046

3919

02:40

And informed consent is an idea that we should be

50

144965

2513

02:43

very proud of as a society, right? It's something that

51

147478

2226

02:45

separates us from the Nazis at Nuremberg,

52

149704

2766

02:48

enforced medical experimentation. It's the idea

53

152470

2875

02:51

that agreement to join a study without understanding isn't agreement.

54

155345

3788

02:55

It's something that protects us from harm, from hucksters,

55

159133

4109

02:59

from people that would try to hoodwink us into a clinical

56

163242

2853

03:01

study that we don't understand, or that we don't agree to.

57

166095

3752

03:05

And so you put together the thread of narrative hypothesis,

58

169847

4329

03:10

experimentation in humans, and informed consent,

59

174176

2597

03:12

and you get what we call clinical study, and it's how we do

60

176773

2665

03:15

the vast majority of medical work. It doesn't really matter

61

179438

3015

03:18

if you're in the north, the south, the east, the west.

62

182453

2342

03:20

Clinical studies form the basis of how we investigate,

63

184795

4113

03:24

so if we're going to look at a new drug, right,

64

188908

1859

03:26

we test it in people, we draw blood, we do experiments,

65

190767

2998

03:29

and we gain consent for that study, to make sure

66

193765

2329

03:31

that we're not screwing people over as part of it.

67

196094

2649

03:34

But the world is changing around the clinical study,

68

198743

3664

03:38

which has been fairly well established for tens of years

69

202407

3366

03:41

if not 50 to 100 years.

70

205773

1900

03:43

So now we're able to gather data about our genomes,

71

207673

3051

03:46

but, as we saw earlier, our genomes aren't dispositive.

72

210724

2860

03:49

We're able to gather information about our environment.

73

213584

2766

03:52

And more importantly, we're able to gather information

74

216350

1910

03:54

about our choices, because it turns out that what we think of

75

218260

2840

03:57

as our health is more like the interaction of our bodies,

76

221100

2720

03:59

our genomes, our choices and our environment.

77

223820

3649

04:03

And the clinical methods that we've got aren't very good

78

227469

2744

04:06

at studying that because they are based on the idea

79

230213

2632

04:08

of person-to-person interaction. You interact

80

232845

1914

04:10

with your doctor and you get enrolled in the study.

81

234759

2095

04:12

So this is my grandfather. I actually never met him,

82

236854

2615

04:15

but he's holding my mom, and his genes are in me, right?

83

239469

3795

04:19

His choices ran through to me. He was a smoker,

84

243264

2891

04:22

like most people were. This is my son.

85

246155

2584

04:24

So my grandfather's genes go all the way through to him,

86

248739

3442

04:28

and my choices are going to affect his health.

87

252181

2552

04:30

The technology between these two pictures

88

254733

2694

04:33

cannot be more different, but the methodology

89

257427

3673

04:37

for clinical studies has not radically changed over that time period.

90

261100

4124

04:41

We just have better statistics.

91

265224

2668

04:43

The way we gain informed consent was formed in large part

92

267892

3452

04:47

after World War II, around the time that picture was taken.

93

271344

2591

04:49

That was 70 years ago, and the way we gain informed consent,

94

273935

3934

04:53

this tool that was created to protect us from harm,

95

277869

2877

04:56

now creates silos. So the data that we collect

96

280746

3666

05:00

for prostate cancer or for Alzheimer's trials

97

284412

2726

05:03

goes into silos where it can only be used

98

287138

2615

05:05

for prostate cancer or for Alzheimer's research.

99

289753

3224

05:08

Right? It can't be networked. It can't be integrated.

100

292977

2894

05:11

It cannot be used by people who aren't credentialed.

101

295871

3533

05:15

So a physicist can't get access to it without filing paperwork.

102

299404

2953

05:18

A computer scientist can't get access to it without filing paperwork.

103

302357

3068

05:21

Computer scientists aren't patient. They don't file paperwork.

104

305425

4143

05:25

And this is an accident. These are tools that we created

105

309568

3986

05:29

to protect us from harm, but what they're doing

106

313554

3267

05:32

is protecting us from innovation now.

107

316821

2530

05:35

And that wasn't the goal. It wasn't the point. Right?

108

319351

3265

05:38

It's a side effect, if you will, of a power we created

109

322616

2699

05:41

to take us for good.

110

325315

2359

05:43

And so if you think about it, the depressing thing is that

111

327674

3144

05:46

Facebook would never make a change to something

112

330818

2133

05:48

as important as an advertising algorithm

113

332951

2571

05:51

with a sample size as small as a Phase III clinical trial.

114

335522

4411

05:55

We cannot take the information from past trials

115

339933

3662

05:59

and put them together to form statistically significant samples.

116

343595

4154

06:03

And that sucks, right? So 45 percent of men develop

117

347749

3484

06:07

cancer. Thirty-eight percent of women develop cancer.

118

351233

3097

06:10

One in four men dies of cancer.

119

354330

2344

06:12

One in five women dies of cancer, at least in the United States.

120

356674

3556

06:16

And three out of the four drugs we give you

121

360230

2228

06:18

if you get cancer fail. And this is personal to me.

122

362458

3513

06:21

My sister is a cancer survivor.

123

365971

1963

06:23

My mother-in-law is a cancer survivor. Cancer sucks.

124

367934

3589

06:27

And when you have it, you don't have a lot of privacy

125

371523

2190

06:29

in the hospital. You're naked the vast majority of the time.

126

373713

3487

06:33

People you don't know come in and look at you and poke you and prod you,

127

377200

3695

06:36

and when I tell cancer survivors that this tool we created

128

380895

3441

06:40

to protect them is actually preventing their data from being used,

129

384336

3098

06:43

especially when only three to four percent of people

130

387434

2050

06:45

who have cancer ever even sign up for a clinical study,

131

389484

2798

06:48

their reaction is not, "Thank you, God, for protecting my privacy."

132

392282

3558

06:51

It's outrage

133

395840

2697

06:54

that we have this information and we can't use it.

134

398537

2125

06:56

And it's an accident.

135

400662

2476

06:59

So the cost in blood and treasure of this is enormous.

136

403138

3055

07:02

Two hundred and twenty-six billion a year is spent on cancer in the United States.

137

406193

3655

07:05

Fifteen hundred people a day die in the United States.

138

409848

3219

07:08

And it's getting worse.

139

413067

2573

07:11

So the good news is that some things have changed,

140

415640

2982

07:14

and the most important thing that's changed

141

418622

1553

07:16

is that we can now measure ourselves in ways

142

420175

2338

07:18

that used to be the dominion of the health system.

143

422513

3058

07:21

So a lot of people talk about it as digital exhaust.

144

425571

2158

07:23

I like to think of it as the dust that runs along behind my kid.

145

427729

3042

07:26

We can reach back and grab that dust,

146

430771

2376

07:29

and we can learn a lot about health from it, so if our choices

147

433147

2414

07:31

are part of our health, what we eat is a really important

148

435561

2680

07:34

aspect of our health. So you can do something very simple

149

438241

2689

07:36

and basic and take a picture of your food,

150

440930

1957

07:38

and if enough people do that, we can learn a lot about

151

442887

2884

07:41

how our food affects our health.

152

445771

1425

07:43

One interesting thing that came out of this — this is an app for iPhones called The Eatery —

153

447196

4516

07:47

is that we think our pizza is significantly healthier

154

451712

2490

07:50

than other people's pizza is. Okay? (Laughter)

155

454202

3438

07:53

And it seems like a trivial result, but this is the sort of research

156

457640

3608

07:57

that used to take the health system years

157

461248

2314

07:59

and hundreds of thousands of dollars to accomplish.

158

463562

2293

08:01

It was done in five months by a startup company of a couple of people.

159

465855

3724

08:05

I don't have any financial interest in it.

160

469579

2624

08:08

But more nontrivially, we can get our genotypes done,

161

472203

2696

08:10

and although our genotypes aren't dispositive, they give us clues.

162

474899

2818

08:13

So I could show you mine. It's just A's, T's, C's and G's.

163

477717

2806

08:16

This is the interpretation of it. As you can see,

164

480523

2232

08:18

I carry a 32 percent risk of prostate cancer,

165

482755

2600

08:21

22 percent risk of psoriasis and a 14 percent risk of Alzheimer's disease.

166

485355

4223

08:25

So that means, if you're a geneticist, you're freaking out,

167

489578

2607

08:28

going, "Oh my God, you told everyone you carry the ApoE E4 allele. What's wrong with you?"

168

492185

4034

08:32

Right? When I got these results, I started talking to doctors,

169

496219

3688

08:35

and they told me not to tell anyone, and my reaction is,

170

499907

2409

08:38

"Is that going to help anyone cure me when I get the disease?"

171

502316

3288

08:41

And no one could tell me yes.

172

505604

3064

08:44

And I live in a web world where, when you share things,

173

508668

2806

08:47

beautiful stuff happens, not bad stuff.

174

511474

2710

08:50

So I started putting this in my slide decks,

175

514184

1900

08:51

and I got even more obnoxious, and I went to my doctor,

176

516084

2461

08:54

and I said, "I'd like to actually get my bloodwork.

177

518545

1982

08:56

Please give me back my data." So this is my most recent bloodwork.

178

520527

2790

08:59

As you can see, I have high cholesterol.

179

523317

2369

09:01

I have particularly high bad cholesterol, and I have some

180

525686

2751

09:04

bad liver numbers, but those are because we had a dinner party with a lot of good wine

181

528437

3003

09:07

the night before we ran the test. (Laughter)

182

531440

2709

09:10

Right. But look at how non-computable this information is.

183

534149

4413

09:14

This is like the photograph of my granddad holding my mom

184

538562

2974

09:17

from a data perspective, and I had to go into the system

185

541536

3599

09:21

and get it out.

186

545135

2162

09:23

So the thing that I'm proposing we do here

187

547297

3282

09:26

is that we reach behind us and we grab the dust,

188

550579

2416

09:28

that we reach into our bodies and we grab the genotype,

189

552995

2978

09:31

and we reach into the medical system and we grab our records,

190

555973

2701

09:34

and we use it to build something together, which is a commons.

191

558674

3440

09:38

And there's been a lot of talk about commonses, right,

192

562114

3144

09:41

here, there, everywhere, right. A commons is nothing more

193

565258

2948

09:44

than a public good that we build out of private goods.

194

568206

2928

09:47

We do it voluntarily, and we do it through standardized

195

571134

2769

09:49

legal tools. We do it through standardized technologies.

196

573903

2800

09:52

Right. That's all a commons is. It's something that we build

197

576703

3271

09:55

together because we think it's important.

198

579974

2520

09:58

And a commons of data is something that's really unique,

199

582494

2632

10:01

because we make it from our own data. And although

200

585126

2868

10:03

a lot of people like privacy as their methodology of control

201

587994

2287

10:06

around data, and obsess around privacy, at least

202

590281

2255

10:08

some of us really like to share as a form of control,

203

592536

3048

10:11

and what's remarkable about digital commonses

204

595584

2353

10:13

is you don't need a big percentage if your sample size is big enough

205

597937

3532

10:17

to generate something massive and beautiful.

206

601469

2511

10:19

So not that many programmers write free software,

207

603980

2558

10:22

but we have the Apache web server.

208

606538

2335

10:24

Not that many people who read Wikipedia edit,

209

608873

2697

10:27

but it works. So as long as some people like to share

210

611570

4009

10:31

as their form of control, we can build a commons, as long as we can get the information out.

211

615579

3744

10:35

And in biology, the numbers are even better.

212

619323

2376

10:37

So Vanderbilt ran a study asking people, we'd like to take

213

621699

2552

10:40

your biosamples, your blood, and share them in a biobank,

214

624251

3322

10:43

and only five percent of the people opted out.

215

627573

2372

10:45

I'm from Tennessee. It's not the most science-positive state

216

629945

3092

10:48

in the United States of America. (Laughter)

217

633037

3039

10:51

But only five percent of the people wanted out.

218

636076

2378

10:54

So people like to share, if you give them the opportunity and the choice.

219

638454

4023

10:58

And the reason that I got obsessed with this, besides the obvious family aspects,

220

642477

4483

11:02

is that I spend a lot of time around mathematicians,

221

646960

3273

11:06

and mathematicians are drawn to places where there's a lot of data

222

650233

2914

11:09

because they can use it to tease signals out of noise.

223

653147

2943

11:11

And those correlations that they can tease out, they're not

224

656090

2968

11:14

necessarily causal agents, but math, in this day and age,

225

659058

3872

11:18

is like a giant set of power tools

226

662930

2360

11:21

that we're leaving on the floor, not plugged in in health,

227

665290

3875

11:25

while we use hand saws.

228

669165

2312

11:27

If we have a lot of shared genotypes, and a lot of shared

229

671477

4438

11:31

outcomes, and a lot of shared lifestyle choices,

230

675915

2748

11:34

and a lot of shared environmental information, we can start

231

678663

2776

11:37

to tease out the correlations between subtle variations

232

681439

2896

11:40

in people, the choices they make and the health that they create as a result of those choices,

233

684335

5311

11:45

and there's open-source infrastructure to do all of this.

234

689646

2486

11:48

Sage Bionetworks is a nonprofit that's built a giant math system

235

692132

3094

11:51

that's waiting for data, but there isn't any.

236

695226

4572

11:55

So that's what I do. I've actually started what we think is

237

699798

3888

11:59

the world's first fully digital, fully self-contributed,

238

703686

3938

12:03

unlimited in scope, global in participation, ethically approved

239

707624

5035

12:08

clinical research study where you contribute the data.

240

712659

3655

12:12

So if you reach behind yourself and you grab the dust,

241

716314

2206

12:14

if you reach into your body and grab your genome,

242

718520

2626

12:17

if you reach into the medical system and somehow extract your medical record,

243

721146

3047

12:20

you can actually go through an online informed consent process --

244

724193

3323

12:23

because the donation to the commons must be voluntary

245

727516

2646

12:26

and it must be informed -- and you can actually upload

246

730162

2793

12:28

your information and have it syndicated to the

247

732955

2592

12:31

mathematicians who will do this sort of big data research,

248

735547

3096

12:34

and the goal is to get 100,000 in the first year

249

738643

2856

12:37

and a million in the first five years so that we have

250

741499

2358

12:39

a statistically significant cohort that you can use to take

251

743857

3834

12:43

smaller sample sizes from traditional research

252

747691

2422

12:46

and map it against,

253

750113

1599

12:47

so that you can use it to tease out those subtle correlations

254

751712

2922

12:50

between the variations that make us unique

255

754634

2529

12:53

and the kinds of health that we need to move forward as a society.

256

757163

4024

12:57

And I've spent a lot of time around other commons.

257

761187

3024

13:00

I've been around the early web. I've been around

258

764211

2680

13:02

the early creative commons world, and there's four things

259

766891

2608

13:05

that all of these share, which is, they're all really simple.

260

769499

3354

13:08

And so if you were to go to the website and enroll in this study,

261

772853

2727

13:11

you're not going to see something complicated.

262

775580

2255

13:13

But it's not simplistic. These things are weak intentionally,

263

777835

5049

13:18

right, because you can always add power and control to a system,

264

782884

3023

13:21

but it's very difficult to remove those things if you put them in at the beginning,

265

785907

3964

13:25

and so being simple doesn't mean being simplistic,

266

789871

2545

13:28

and being weak doesn't mean weakness.

267

792416

2184

13:30

Those are strengths in the system.

268

794600

2351

13:32

And open doesn't mean that there's no money.

269

796951

2665

13:35

Closed systems, corporations, make a lot of money

270

799616

3020

13:38

on the open web, and they're one of the reasons why the open web lives

271

802636

3539

13:42

is that corporations have a vested interest in the openness

272

806175

2827

13:44

of the system.

273

809002

2334

13:47

And so all of these things are part of the clinical study that we've created,

274

811336

3794

13:51

so you can actually come in, all you have to be is 14 years old,

275

815130

3429

13:54

willing to sign a contract that says I'm not going to be a jerk,

276

818559

2027

13:56

basically, and you're in.

277

820586

2665

13:59

You can start analyzing the data.

278

823251

1573

14:00

You do have to solve a CAPTCHA as well. (Laughter)

279

824824

4159

14:04

And if you'd like to build corporate structures on top of it,

280

828983

3581

14:08

that's okay too. That's all in the consent,

281

832564

3146

14:11

so if you don't like those terms, you don't come in.

282

835710

2564

14:14

It's very much the design principles of a commons

283

838274

3092

14:17

that we're trying to bring to health data.

284

841366

2594

14:19

And the other thing about these systems is that it only takes

285

843960

2979

14:22

a small number of really unreasonable people working together

286

846939

3179

14:26

to create them. It didn't take that many people

287

850118

3182

14:29

to make Wikipedia Wikipedia, or to keep it Wikipedia.

288

853300

3472

14:32

And we're not supposed to be unreasonable in health,

289

856772

2068

14:34

and so I hate this word "patient."

290

858840

2276

14:37

I don't like being patient when systems are broken,

291

861116

3167

14:40

and health care is broken.

292

864283

2627

14:42

I'm not talking about the politics of health care, I'm talking about the way we scientifically approach health care.

293

866910

4164

14:46

So I don't want to be patient. And the task I'm giving to you

294

871074

3270

14:50

is to not be patient. So I'd like you to actually try,

295

874344

3046

14:53

when you go home, to get your data.

296

877390

2717

14:56

You'll be shocked and offended and, I would bet, outraged,

297

880107

2717

14:58

at how hard it is to get it.

298

882824

2876

15:01

But it's a challenge that I hope you'll take,

299

885700

2619

15:04

and maybe you'll share it. Maybe you won't.

300

888319

2461

15:06

If you don't have anyone in your family who's sick,

301

890780

1444

15:08

maybe you wouldn't be unreasonable. But if you do,

302

892224

2993

15:11

or if you've been sick, then maybe you would.

303

895217

2207

15:13

And we're going to be able to do an experiment in the next several months

304

897424

3088

15:16

that lets us know exactly how many unreasonable people are out there.

305

900512

3157

15:19

So this is the Athena Breast Health Network. It's a study

306

903669

2122

15:21

of 150,000 women in California, and they're going to

307

905791

3818

15:25

return all the data to the participants of the study

308

909609

2718

15:28

in a computable form, with one-clickability to load it into

309

912327

3146

15:31

the study that I've put together. So we'll know exactly

310

915473

2616

15:33

how many people are willing to be unreasonable.

311

918089

2304

15:36

So what I'd end [with] is,

312

920393

2384

15:38

the most beautiful thing I've learned since I quit my job

313

922777

3320

15:41

almost a year ago to do this, is that it really doesn't take

314

926097

3383

15:45

very many of us to achieve spectacular results.

315

929480

3808

15:49

You just have to be willing to be unreasonable,

316

933288

2712

15:51

and the risk we're running is not the risk those 14 men

317

936000

2331

15:54

who got yellow fever ran. Right?

318

938331

1868

15:56

It's to be naked, digitally, in public. So you know more

319

940199

2861

15:58

about me and my health than I know about you. It's asymmetric now.

320

943060

3433

16:02

And being naked and alone can be terrifying.

321

946493

3630

16:06

But to be naked in a group, voluntarily, can be quite beautiful.

322

950123

4467

16:10

And so it doesn't take all of us.

323

954590

1888

16:12

It just takes all of some of us. Thank you.

324

956478

3006

16:15

(Applause)

325

959484

5590

Translated by Joseph Geni
Reviewed by Morton Bast

ABOUT THE SPEAKER

John Wilbanks - Data Commons Advocate
Imagine the discoveries that could result from a giant pool of freely available health and genomic data. John Wilbanks is working to build it.

Why you should listen

Performing a medical or genomic experiment on a human requires informed consent and careful boundaries around privacy. But what if the data that results, once scrubbed of identifying marks, was released into the wild? At WeConsent.us, John Wilbanks thinks through the ethical and procedural steps to create an open, massive, mine-able database of data about health and genomics from many sources. One step: the Portable Legal Consent for Common Genomics Research (PLC-CGR), an experimental bioethics protocol that would allow any test subject to say, "Yes, once this experiment is over, you can use my data, anonymously, to answer any other questions you can think of." Compiling piles of test results in one place, Wilbanks suggests, would turn genetic info into big data--giving researchers the potential to spot patterns that simply aren't viewable up close.

A campaigner for the wide adoption of data sharing in science, Wilbanks is also a Senior Fellow with the Kauffman Foundation, a Research Fellow at Lybba and supported by Sage Bionetworks.

In February 2013, the US government responded to a We the People petition spearheaded by Wilbanks and signed by 65,000 people, and announced a plan to open up taxpayer-funded research data and make it available for free.

More profile about the speaker
John Wilbanks | Speaker | TED.com

THE ORIGINAL VIDEO ON TED.COM

John Wilbanks: Let's pool our medical data | TED Talk | TED.com