•  
  •  
 

Bulletin of Chinese Academy of Sciences (Chinese Version)

Keywords

big data; scientific big data; big earth data; data-intensive science

Document Type

Article

Abstract

Big data occupies the strategic high ground in the era of knowledge economies and also constitutes a new national and global strategic resource. It is a new pattern for scientific discovery with less dependence on causality and heavy dependence on data correlation. It has become a data-intensive scientific paradigm, following previous paradigms of empirical, theoretical and computational science. The paradigm has shifted the methodology of scientific research from theories and models based on causal analysis to comprehensive mechanistic scientific discovery including correlation analysis. As a branch of big data, scientific big data includes internal characteristics such as non-repeatability, high uncertainty, high dimensionality, and computational complexity. External characteristics include data type, data volume, data acquisition, and data analysis. All these characteristics bring new challenges for the techniques and methods of processing scientific big data. On the basis of the above analysis, we raise four recommendations:scientific cognition of scientific big data, construction of scientific big data infrastructure, establishment of a scientific data research center, and the structuring of a scientific big data academic platform.

First page

768

Last Page

773

Language

Chinese

Publisher

Bulletin of Chinese Academy of Sciences

References

Guo H D, Wang L Z, Chen F, et al. Scientific big data and digital earth. Science Bulletin, 2014, 59(35):5066-5073.

郭华东, 陈润生, 徐志伟, 等.自然科学与人文科学大数据——第六届中德前沿探索圆桌会议综述.中国科学院院刊, 2016, 31(6):707-716.

Reinsel D, Gantz J, Rydning J. Data age 2025: The evolution of data to life-critical don't focus on big data. Framingham: IDC Analyze the Future, 2017.

Gantz J, Reinsel D. The digital universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. Framingham: IDC Analyze the Future, 2014.

GRDI2020. Global Research Data Infrastructures: Towards a 10-year vision for global research data infrastructures. [2018-08-16]. http://www.grdi2020.eu/Repository/FileScaricati/6bdc07fb-b21d-4b90-81d4-d909fdb96b87.pdf.

李学龙, 龚海刚.大数据系统综述.中国科学:信息科学, 2015, 45(1):1-44.

郭华东.大数据、大科学、大发现——大数据与科学发现国际研讨会综述.中国科学院院刊, 2014, 29(4):500-506.

Hey T, Tansley S, Tolle K. The fourth paradigm:Data-intensive scientific discovery. Washington DC:Microsoft Research, 2009.

郭华东, 王力哲, 陈方, 等.科学大数据与数字地球.科学通报, 2014, 59(12):1047-1054.

Guo H D. Steps to the digital Silk Road. Nature, 2018, 554:25-27.

Guo H D. Big Earth data:A new frontier in Earth and information sciences. Big Earth Data, 2017, 1(1-2):4-20.

Guo H D, Liu J, Qiu Y B, et al. The Digital Belt and Road program in support of regional sustainability. International Journal of Digital Earth, 2018, 11(7):657-669.

中华人民共和国国务院. 国务院关于印发促进大数据发展行动纲要的通知. [2015-09-05]. http://www.gov.cn/zhengce/content/2015-09/05/content_10137.htm.

Share

COinS