Developing Standards for Educational Datasets by School Level: A Framework for Sustainable K-12 Education

In-Seong Jeon,Shin-Yu Kim,Seong-Joo Kang
DOI: https://doi.org/10.3390/su16124954
IF: 3.9
2024-06-11
Sustainability
Abstract:As artificial intelligence (AI) and data science education gain importance in K-12 curricula, there is a growing need for well-designed sustainable educational datasets tailored to different school levels. Sustainable datasets should be reusable, adaptable, and accessible to support long-term AI and data science education goals. However, research on the systematic categorization of difficulty levels in educational datasets is limited. This study aims to address this gap by developing a framework for sustainable educational dataset standards based on learners' developmental stages and data preprocessing requirements. The proposed framework consists of five levels: Level 1 (grades 1–4), where data preprocessing is unnecessary; Level 2 (grades 5–6), involving basic data cleaning; Level 3 (grades 7–9), requiring attribute manipulation; Level 4 (grades 10–12), involving feature merging and advanced preprocessing; and Level 5 (teachers/adults), requiring the entire data science process. An expert validity survey was conducted with 22 elementary and secondary school teachers holding advanced degrees in AI education. The results showed high validity for Levels 1–4 but relatively lower validity for Level 5, suggesting the need for separate training and resources for teachers. Based on the CVR results and expert feedback, the standards for Educational Datasets were revised, particularly for Stage 5, which targets teachers and adult learners. The findings highlight the importance of expert validation, step-by-step experiences, and an interdisciplinary approach in developing educational datasets. This study contributes to the theoretical understanding of educational datasets and provides practical implications for teachers, students, educational institutions, and policymakers in implementing effective and sustainable AI and data science education in K-12 settings, ultimately fostering a more sustainable future.
environmental sciences,environmental studies,green & sustainable science & technology
What problem does this paper attempt to address?
This paper aims to address the challenges faced by K-12 education (kindergarten to high school) in AI and data science education, particularly the lack of sustainable education datasets suitable for different grade levels. The study proposes a framework for developing standardized education datasets based on learners' developmental stages and data preprocessing requirements. This framework consists of five levels, ranging from Level 1 where no data preprocessing is needed for the first grade to Level 5 involving the entire data science process for teachers or adults. Through an expert validation survey of 22 elementary and secondary school teachers with advanced degrees in AI education, the results show that Levels 1 to 4 are highly effective, but Level 5 requires separate training and resources. The paper points out that despite many countries worldwide incorporating AI into education policies, insufficient teacher training, limited educational resources, and the complexity of data preprocessing are the major obstacles currently faced. The study emphasizes the importance of expert validation, incremental experiences, and interdisciplinary approaches in developing education datasets. Through this approach, theoretical understanding can be provided for teachers, students, educational institutions, and policymakers, and practical guidance can be provided for the implementation of effective and sustainable AI and data science education, thus promoting a more sustainable future.