life and health; big data; current status; prospect
The life and health big data is an important resource of Chinese population health and biosafety. Currently, China's data are suffering from a severe drain and sovereignty loss, the data security cannot be guaranteed, and the efficiency of data reuse is extremely low. Thus, the construction of a national data sharing platform is urgent and should be accelerated. By developing new methods for multiple sources and proactive data collection, new mechanisms of mutual benefit and win-win data sharing and new technologies of highly efficient and intelligent data parsing, we need to establish a system for life and health big data collection, management, sharing, and application. The system will serve scientific research institutes, universities, hospitals, enterprises, and the broad masses of the people, and greatly contribute to China's economic and social development and the improvement of people's wellbeing.
Bulletin of Chinese Academy of Sciences
Stephens Z D, Lee S Y, Faghri F, et al. Big Data:Astronomical or Genomical? PLoS Biology, 2015, 13(7):e1002195.
Chen R, Mias G I, Li-Pook-Than J, et al. Personal omics profiling reveals dynamic molecular and medical phenotypes. Cell, 2012, 148(6):1293-1307.
Gao W, Emaminejad S, Nyein H Y Y, et al. Fully integrated wearable sensor arrays for multiplexed in situ perspiration analysis. Nature, 2016, 529(7587):509-514.
Esteva A, Kuprel B, Novoa R A, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature, 2017, 542(7639):115-118.
Nebbioso A, Tambaro F P, Dell'Aversana C, et al. Cancer epigenetics:Moving forward. PLoS Genet, 2018, 14(6):e1007362.
Vogel G. German law allows use of DNA to predict suspects' looks. Science, 2018, 360(6391):841-842.
Genomes Project Consortium, Abecasis G R, Altshuler D, et al. A map of human genome variation from population-scale sequencing. Nature, 2010, 467(7319):1061-1073.
Cancer Genome Atlas Research Network. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature, 2008, 455(7216):1061-1068.
Cancer Genome Atlas Research Network. Integrated genomic analyses of ovarian carcinoma. Nature, 2011, 474(7353):609-615.
Cancer Genome Atlas Research Network, Weinstein J N, Collisson E A, et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nature Genetics, 2013, 45(10):1113-1120.
Turnbull C, Scott R H, Thomas E, et al. The 100 000 Genomes Project:Bringing whole genome sequencing to the NHS. BmjBritish Medical Journal, 2018, 361:k1687.
葛百川, 彭建雄, 刘冰. DNA数据库实战应用战法体系与能力建设研究.刑事技术, 2016, 41(4):259-264.
BIG Data Center Members. The BIG Data Center:From deposition to integration to translation. Nucleic Acids Research, 2017, 45(D1):D18-D24.
BIG Data Center Members. Database Resources of the BIG Data Center in 2018. Nucleic Acids Research, 2018, 46(D1):D14-D20.
Wang Y, Song F, Zhu J, et al. GSA:Genome Sequence Archive. Genomics Proteomics & Bioinformatics, 2017, 15(1):14-18.
Rigden D J, Fernandez X M. The 2018 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res, 2018, 46(D1):D1-D7.
Yiming, BAO and Yongbiao, XUE
"Current Status and Prospect of Life and Health Big Data,"
Bulletin of Chinese Academy of Sciences (Chinese Version): Vol. 33
, Article 14.
Available at: https://bulletinofcas.researchcommons.org/journal/vol33/iss8/14