Riccardo Sabatini: How to read the genome and build a human being
里卡多.薩巴提尼: 如何讀懂基因組並建造人類
Riccardo Sabatini applies his expertise in numerical modeling and data to projects ranging from material science to computational genomics and food market predictions. Full bio
Double-click the English transcript below to play the video.
I'm going to take you on a journey
我要帶各位進行一段冒險之旅,
the biggest dream of humanity:
many, many years ago
要拉回到好幾好幾年前,
raw material, some energy,
再加上點能量,
that was not there before.
以前從未存在過的任何東西。
I was coming back home
有天回到家裡時,
always knew a 3D printer.
就有一台 3D 印表機。
my father and my mom in this case,
指的是我媽跟我爸之間的投入,
in the same media, that is food,
discovering that she was a 3D printer,
就是一台3D列印機之外,
by that piece,
這個吸引注了,
at the beginning
as a gigantic Lego piece.
一個大型的樂高玩具,
blocks are little atoms
樂高積木就是一個原子,
a carbon here, a nitrogen here.
氮原子在這。
that compose a human being,
人類的原子清單的數量,
quite an astonishing number.
drive to assemble a little baby,
裡面的原子數數量,
of thumb drives --
組裝起一個人類,
2000台鐵達尼號這麼大。
a pregnant lady,
看到懷孕的婦女,
amount of information
anything you heard of.
忘了你曾聽過的。
of information that exists.
最大數據資料。
年輕的物理學家還聰明,
than a young physicist,
to pack this information
負責管理包裹這個資訊的
when Rosalind Franklin,
--羅莎琳.富蘭克林--
to finally poke inside a human cell,
最後才戳進人類細胞裡
a fairly simple alphabet,
you need three billion of them.
你需要30億個字母。
any sense as a number, right?
真的很沒有概念,對吧?
I could explain myself better
才讓人比較容易了解。
I'm going to have some help,
我最好找個人來幫忙,
introduce the code
的最佳人選,
to sequence it, Dr. Craig Venter.
克萊格.凡特博士。
克萊格.凡特博士上台。
接著一個字地列印出來:
from the United States to Canada
Lulu.com, a start-up, did everything.
他們幫我做的這一切。
of what is the code of life.
的視覺感受。
I can do something fun.
我要做件有趣的事。
挑一段來讀一讀。
book ... like this one.
書兒,比如這本。
it's a fairly big book.
what is the code of life.
the color of the eyes to Craig.
克萊格的眼睛顏色。
more complicated.
two letters in this position --
剛好漏掉兩個字母--
to a terrible disease:
得了一個恐佈的疾病:
we don't know how to solve it,
我們不知道如何解決,
of difference from what we are.
的差異而已。
看到一些嘆為觀止的事情。
me, me and you, you --
成就你我不同的地方
字母的差異,
is the miracle of life that you are.
行塑了你是甚麼樣的人,
when we think that we are different.
讓我們再反思一下,
真的有這麼多。
at assembling Swedish furniture,
看說明書組裝瑞典的家具,
教你如何破解你的人生。
is nothing you can crack in your life.
克雷格.文特爾本人,
we can learn from these books,
學習每樣東西,
of personalized medicine,
should be done to have better health
才能更健康,
裡面的秘密。
and many, many more people,
還有其他很多、很多的人,
called machine learning.
「機械自主學習」的概念。
thousands of them.
成千上萬的基因組——
the biggest database of human beings:
人類最大的資料庫:
everything you can think of.
你能想到的每樣東西。
被自主翻譯出來後
我們建立了一台機器。
and we train a machine --
many, many machines --
是很多很多台機器——
the genome in a phenotype.
基因組的生物特徵表象。
and what do they do?
它們有甚麼作用?
be used for everything,
is particularly complicated.
它就特別複雜。
to build different challenges.
我們想建立不一樣的挑戰。
from common traits.
because they are common,
因為它們都很普遍。
and predict your height?
就可以知道你的身高嗎?
你的生活形式有關,
eight kilograms of precision.
將預測誤差控制在 8 公斤以內。
the code changes during your life.
你的基因碼也會更著改變。
it gets insertions.
並把它模擬出來。
among millions of these letters.
上百萬個這種字母。
a very well-defined object.
完整的堆疊系統,
a machine what a face is,
人臉是甚麼,
with machine learning,
we read the first sequence --
我們讀取到第一個序列--
to see some signals.
coming in our lab.
進來我們實驗室的實驗對象。
we reduce the complexity,
我們減少了複雜度,
你的臉上原貌呈現出來--
and asymmetries come from your life.
來自於你後天的生活方式。
and we run our algorithm.
拿去跑我們的演算法。
from the blood.
left and right, left and right,
左看看、右看看做比較,
those pictures to be identical.
這些照片是一致的。
another exercise, to be honest.
這次要誠實。
comes from gender,
the ethnicity component of a human.
人類種族族群。
is much more complicated.
even in the differences,
that we are in the right ballpark,
我們預測還算不錯,
that comes in place,
the complete cranial structure,
跑完整個頭蓋骨結構,
in the training of the machine.
訓練的機器裡面出現過。
外面隨機取樣的。
probably never believe.
in a scientific publication,
發表這一切了,
Chris challenged me.
克里斯就挑戰我說,
and tried to predict
and believe me, you have no idea
相信我,你們絕對不知道
this blood now, here --
of biological information
全部的生物資訊,
and I'm going to do it with you.
下次再做給大家看。
all the understanding we have.
所有我們知道的東西,
we predicted he's a male.
我們預測出他是位男士。
the subject is 82.
實際上是82公斤。
and peculiar ethnicity.
they never fit in models.
他們從來不會跟我們的預測相符。
is a complex corner case for our model.
就是一個很複雜的特殊案例。
a lot to recognize people
來辨認人的特徵,
寫到基因組裡面。
我就是長這樣。
but my beard cut.
而是我的鬍鬚。
in this case, transfer it --
我會把它轉變一下--
than Photoshop, no modeling --
much, much better in the feeling.
for predicting height
out of your blood.
and the same approach,
這些科技、方法、
我們要如何進行工作、
研究人員共同的挑戰。
researchers around the world.
from a statistical approach
傳統的統計方法,
of exactly how you are.
complicated challenge,
in the world on this topic.
就在這個主題裡面。
be confronted with decisions
的決策的能力就越強,
inner detail on how life works.
生命如何運作的內部細節。
that cannot be confined
we're building as a humanity.
我們要建構的人類未來。
with artists, with philosophers,
藝術家、哲學家
that we make in the next year
ABOUT THE SPEAKER
Riccardo Sabatini - Scientist, entrepreneurRiccardo Sabatini applies his expertise in numerical modeling and data to projects ranging from material science to computational genomics and food market predictions.
Why you should listen
Data scientist Riccardo Sabatini harnesses numerical methods for a surprising variety of fields, from material science research to the study of food commodities (as a past director of the EU research project FoodCAST). His most recent research centers on computational genomics and how to crack the code of life.
In addition to his data research, Sabatini is deeply involved in education for entrepreneurs. He is the founder and co-director of the Quantum ESPRESSO Foundation, an advisor in several data-driven startups, and funder of The HUB Trieste, a social impact accelerator.
Riccardo Sabatini | Speaker | TED.com