Classification model of air quality in Jakarta using decision tree algorithm based on air pollutant standard index

F M Putra,I S Sitanggang
DOI: https://doi.org/10.1088/1755-1315/528/1/012053
2020-07-01
IOP Conference Series: Earth and Environmental Science
Abstract:Abstract The level of air quality is getting lower because of high levels of air pollution in big cities. Big cities in Indonesia also experience air pollution, this is caused by the increase in road users who use motorized vehicle materials, industrial development, land burning, waste accumulation so that air quality changes quite dramatically. Daily air quality needs to be accurately measured and classified. Accurate classification results will help the government in making policy. The aim is to control pollution to get air quality standards that can be useful for survival, especially in Jakarta. Air pollutants contain various components of elemental compounds such as carbon monoxide (CO), nitrogen dioxide (NO;), sulfur dioxide (SO;), particulate matter (PM, ), ozone (O s ) and nitrogen monoxide (NO) This study aims to determine the parameters that affect air quality in Jakarta using the C5.0 algorithm and Random Forest based on the Air Pollution Standard Index (ISPU) category. The classification algorithms used are C5.0 and Random Forest which are categorized in the Decision Tree model. C5.0 also produces rule-based models. The accuracy of the decision tree model and the rule-based model from C5.0 and Random Forest on the dataset of 2017 is 99.74%, 99.22%, and 99.97% with 1412 training data and 389 testing data. The accuracy of the decision tree model and the rule-based model from C5.0 and Random Forest on the dataset of 2018 is 98.28%, 98.85%, and 97.42% with 1439 training data and 349 testing data. The most important variable to classify air quality is Ozone (O=) and air quality in Jakarta is dominated by the Moderate category in 2017 and 2018.
What problem does this paper attempt to address?