What is Data Science? An Operational Definition based on Text Mining of Data Science Curricula

Zhiyong Zhang,Danyang Zhang
DOI: https://doi.org/10.35566/jbds/v1n1/p1
2021-01-01
Journal of Behavioral Data Science
Abstract:Data science has maintained its popularity for about 20 years. This study adopts a bottom-up approach to understand what data science is by analyzing the descriptions of courses offered by the data science programs in the United States. Through topic modeling, 14 topics are identified from the current curricula of 56 data science programs. These topics reiterate that data science is at the intersection of statistics, computer science, and substantive fields.
What problem does this paper attempt to address?