C. Romero, S. Ventura
Abstract:This survey is an updated and improved version of the previous one published
in 2013 in this journal with the title data mining in education. It reviews in
a comprehensible and very general way how Educational Data Mining and Learning
Analytics have been applied over educational data. In the last decade, this
research area has evolved enormously and a wide range of related terms are now
used in the bibliography such as Academic Analytics, Institutional Analytics,
Teaching Analytics, Data-Driven Education, Data-Driven Decision-Making in
Education, Big Data in Education, and Educational Data Science. This paper
provides the current state of the art by reviewing the main publications, the
key milestones, the knowledge discovery cycle, the main educational
environments, the specific tools, the free available datasets, the most used
methods, the main objectives, and the future trends in this research area.
What problem does this paper attempt to address?
This paper is a recent review of the field of Educational Data Mining (EDM) and Learning Analytics (LA). Since the last survey in 2013, this research field has experienced significant development, leading to the emergence of new terms such as academic analytics, teaching analytics, and data-driven education. These areas all focus on how to use educational data to improve the quality of education and learning outcomes.
The paper provides an overview of the current research status, reviewing major publications, key milestones, the knowledge discovery cycle, major educational environments, specific tools, free datasets, commonly used methods, major goals, and future trends. Educational Data Mining focuses on developing methods to explore unique data in educational environments, while Learning Analytics focuses on understanding and optimizing the learning process, emphasizing data-driven decision making.
The paper also mentions other concepts related to these two fields, such as academic analytics, teaching analytics, data-driven education, the application of big data in education, and educational data science. Although these fields may have differences, their goals are all aimed at improving educational practices through data-intensive methods.
With the increasing availability of electronic learning resources and the establishment of student information databases, a large amount of educational data has been generated. Therefore, there is a need for tools to automatically analyze this data to reveal valuable information that can help understand student learning patterns. Despite differences in research focus and technological applications, the EDM and LA communities share common goals and methods.
Finally, the paper discusses various educational environments ranging from traditional classrooms to online education, as well as different systems used for data collection and analysis, such as Learning Management Systems (LMS), Intelligent Tutoring Systems (ITS), and Massive Open Online Courses (MOOCs). Additionally, it mentions methods for data preprocessing, analysis, and visualization, as well as tools and datasets used for educational research.