Comparing predictive performance of general regression neural network (GRNN) and hedonic regression model for factors affecting housing prices in “Pune-India”

Abhijat Arun Abhyankar,Harish Kumar Singla
DOI: https://doi.org/10.1108/ijhma-01-2021-0003
2021-06-10
International Journal of Housing Markets and Analysis
Abstract:Purpose The purpose of this study is to compare the predictive performance of the hedonic multivariate regression model with the probabilistic neural network (PNN)-based general regression neural network (GRNN) model of housing prices in “Pune-India.” Design/methodology/approach Data on 211 properties across “Pune city-India” is collected. The price per square feet is considered as a dependent variable whereas distances from important landmarks such as railway station, fort, university, airport, hospital, temple, parks, solid waste site and stadium are considered as independent variables along with a dummy for amenities. The data is analyzed using a hedonic type multivariate regression model and GRNN. The GRNN divides the entire data set into two sets, namely, training set and testing set and establishes a functional relationship between the dependent and target variables based on the probability density function of the training data (Alomair and Garrouch, 2016). Findings While comparing the performance of the hedonic multivariate regression model and PNN-based GRNN, the study finds that the output variable (i.e. price) has been accurately predicted by the GRNN model. All the 42 observations of the testing set are correctly classified giving an accuracy rate of 100%. According to Cortez (2015), a value close to 100% indicates that the model can correctly classify the test data set. Further, the root mean square error (RMSE) value for the final testing for the GRNN model is 0.089 compared to 0.146 for the hedonic multivariate regression model. A lesser value of RMSE indicates that the model contains smaller errors and is a better fit. Therefore, it is concluded that GRNN is a better model to predict the housing price functions. The distance from the solid waste site has the highest degree of variable senstivity impact on the housing prices (22.59%) followed by distance from university (17.78%) and fort (17.73%). Research limitations/implications The study being a “case” is restricted to a particular geographic location hence, the findings of the study cannot be generalized. Further, as the objective of the study is restricted to just to compare the predictive performance of two models, it is felt appropriate to restrict the scope of work by focusing only on “location specific hedonic factors,” as determinants of housing prices. Practical implications The study opens up a new dimension for scholars working in the field of housing prices/valuation. Authors do not rule out the use of traditional statistical techniques such as ordinary least square regression but strongly recommend that it is high time scholars use advanced statistical methods to develop the domain. The application of GRNN, artificial intelligence or other techniques such as auto regressive integrated moving average and vector auto regression modeling helps analyze the data in a much more sophisticated manner and help come up with more robust and conclusive evidence. Originality/value To the best of the author’s knowledge, it is the first case study that compares the predictive performance of the hedonic multivariate regression model with the PNN-based GRNN model for housing prices in India.
English Else
What problem does this paper attempt to address?