What is Data Science? An Operational Definition based on Text Mining of Data Science Curricula
Keywords:Data Science, Topic Modeling, Data Science Curriculum
Data science has maintained its popularity for about 20 years. This study adopts a bottom-up approach to understand what data science is by analyzing the descriptions of courses offered by the data science programs in the United States. Through topic modeling, 14 topics are identified from the current curricula of 56 data science programs. These topics reiterate that data science is at the intersection of statistics, computer science, and substantive fields.