Data science in large cohort studies

Canqing Yu,Liming Li
DOI: https://doi.org/10.3760/cma.j.issn.0254-6450.2019.01.001
2019-01-01
Abstract:Large cohort study gained its popularity in biomedical research and demonstrated its application in exploring disease etiology and pathogenesis,improving the prognosis of disease,as well as reducing the burden of diseases.Data science is an interdisciplinary field that uses scientific methods from computer science and statistics to extract insights or knowledge from data in a specific domain.The results from the combination of the two would provide new evidence for developing the strategies and measures on disease prevention and control.This review included a brief introduction of data science,descriptions on characteristics of large cohort data according to the development of the study design,and application of data science at each stage of a large cohort study,as well as prospected the application of data science in the future large cohort studies.
What problem does this paper attempt to address?