Rent Price Prediction with Advanced Machine Learning Methods: A Comparison of California and Texas

Hong Ru Chan
DOI: https://doi.org/10.54097/84vvv580
2024-03-13
Abstract:The forecast of rent prices in dynamic housing markets is of fundamental importance to renters, landlords, investors, and politicians alike. Machine learning models offer flexibility, excel at modeling complex relationships, and provide outstanding forecast precision. This study compares advanced machine learning models, extreme gradient boosting regressor (XGBoost), light gradient boosting machine (LightGBM), random forest, ridge regression, and two ensemble approaches, to predict California and Texas rent prices. The two ensemble approaches include a hybrid regression of averaging base models and a 2-level stacked generalization model. The results revealed that a stacked generalization ensemble with base models random forest, XGBoost, LightGBM, and meta-model ridge regression achieved the best performance for the California dataset by generating the lowest MSE and highest R2 of 46116.3 and 0.8858, respectively. In contrast, random forest outperformed both ensemble models with the lowest MSE and MAPE of 18401.93 and 9.7003%, respectively, and the highest R2 of 0.7992. These methodologies can assess future rental property worth, serve as indicators for market dynamics, and aid in establishing real estate policies, thereby providing practical guidance to individuals, businesses, and policymakers.
What problem does this paper attempt to address?