Area and Feature Guided Regularised Random Forest: a novel method for predictive modelling of binary phenomena. The case of illegal landfill in Canary Island

Lorenzo Carlos Quesada-Ruiz,Victor Francisco Rodriguez-Galiano,Raúl Zurita-Milla,Emma Izquierdo-Verdiguier
DOI: https://doi.org/10.1080/13658816.2022.2075879
2022-06-10
International Journal of Geographical Information Science
Abstract:This paper presents a novel method, Area and Feature Guided Regularised Random Forest (AFGRRF), applied for modelling binary geographic phenomenon (occurrence versus absence). AFGRRF is a wrapper feature-selection method based on a previous modification of Random Forest (RF), namely the Guided Regularised Random Forest (GRRF). AFGRRF produces maps that minimise the affected area without a significant difference in accuracy. For this, it tunes the GRRF hyper-parameters according to a trade of between True Positive Rate and the affected area (Success Rate). AFGRRF also addresses the 'Rashomon effect' or the multiplicity of good models. The proposed method was tested to model illegal landfills in Gran Canaria Island (Spain). AFGRRF performance was compared to that of other RF-based methods: (i) standard RF; (ii) Area Random Forest (ARF); (iii) Feature Random Forest (FRF); (iv) Area Feature Random Forest (AFRF) and (v) GRRF. AFGRRF predicted the smallest affected area, 19% of the island, at a similar True Positive Rate. This percentage is substantially smaller than the one predicted by RF (27.43%), ARF (26%), FRF (27.78%), AFRF (23%) and GRRF (29.67%).
geography, physical,computer science, information systems,information science & library science
What problem does this paper attempt to address?