Water quality prediction and classification based on principal component regression and gradient boosting classifier approach

Saikat Islam Khan,Nazrul Islam,Jia Uddin,Sifatul Islam,Mostofa Kamal Nasir,Md. Saikat Islam Khan
DOI: https://doi.org/10.1016/j.jksuci.2021.06.003
2021-06-01
Abstract:Estimating water quality has been one of the significant challenges faced by the world in recent decades. This paper presents a water quality prediction model utilizing the principal component regression technique. Firstly, the water quality index (WQI) is calculated using the weighted arithmetic index method. Secondly, the principal component analysis (PCA) is applied to the dataset, and the most dominant WQI parameters have been extracted. Thirdly, to predict the WQI, different regression algorithms are used to the PCA output. Finally, the Gradient Boosting Classifier is utilized to classify the water quality status. The proposed system is experimentally evaluated on a Gulshan Lake-related dataset. The results demonstrate 95% prediction accuracy for the principal component regression method and 100% classification accuracy for the Gradient Boosting Classifier method, which show credible performance compared with the state-of-art models.
computer science, information systems
What problem does this paper attempt to address?