Abstract:National land use policies and strategies worldwide have attempted to establish a healthy housing rental market towards urban sustainability. Monitoring fine-scale housing rental prices should provide essential implications for equitable housing policies. However, doing so remains a challenge because aggregated data were traditionally collected at a coarse scale through census or social surveys. On-line housing rental websites (OHRWs) have become popular social media platforms in the housing studies. This paper attempts to demonstrate how to monitor fine-scale housing rental prices based on OHRWs using the case of Shenzhen in China. Employing hedonic model, a set of housing rental determinants are initially selected from three characteristics (neighborhood, location and structure) and at three levels (nearest accessibility, 15-minute walking distance availability and sub-district availability). Housing rent prediction models are then established (respectively for October 2017 and February 2018) using the training samples collected from the OHRWs and six machine-learning algorithms, including random forest regression (RFR), extra-trees regression (ETR), gradient-boosting regression (GBR), support vector regression (SVR), multi-layer perceptron neural network (MLP-NN) and k-nearest neighbor algorithm (k-NN). Thereafter, the relative importance of the determinants is calculated and visualized using partial dependence plots. Finally, the models are used to monitor housing rental price dynamics for all of the communities within Shenzhen. Results show that all of the algorithms except SVR generally present good performance. Among them, RFR and ETR are the best one in October 2017 and February 2018, respectively. Concerning the spatial pattern of housing rental, the high-high clusters merge in the central districts, whereas the low-low clusters are located in the outskirts, and the growth rate is the greatest in the farthest outskirts from the central districts. Each determinant affects the housing rent across different scale and sub-district availability and nearest accessibility are more important than 15-minute walking distance availability. The two most influential determinants are sub-district job opportunity and nearest accessibility to health care facilities. The case of Shenzhen shows that the demonstrated framework, which integrates machine-learning algorithms and the hedonic modeling, is practical and efficient. The approach is believed to provide an essential tool to inform equitable housing policies.

Design and implementation of second-hand housing data statistical analysis system

Research on Data Collection and Analysis of Second Hand House in China Based on Python

A Visualization Data Analysis of Second-hand Houses in Zhuhai Based on Python and ECharts and Bootstrap

Understanding the Impacts of Public Facilities on Residential House Prices: Spatial Data-Driven Approach Applied in Hangzhou, China

Design and Implementation of Craweper Based on Scrapy

Spatial Variation of Housing Prices in Hangzhou City: Two-Dimensional Analysis Based on Hedonic Prices

Analysis and prediction of second-hand house price based on random forest

Design and Implementation of Crawler Program Based on Python

Design and Deployment of Django-based Housing Information Management System

An empirical analysis of second-hand house transaction prices based on machine learning

Research on the Spatial Distribution Characteristics and Built Environment Effects of Housing Prices in the Central Urban Area of Nanjing based on Big Data

Understanding Housing Prices Using Geographic Big Data: A Case Study in Shenzhen

Research on housing prices prediction based on multiple linear regression

The Analysis of Second-Hand Housing Price Influencing Factors Based on Hedonic Model and WEB Information

Monitoring housing rental prices based on social media:An integrated approach of machine-learning algorithms and hedonic modeling to inform equitable housing policies

Influence Factors and Regression Model of Urban Housing Prices Based on Internet Open Access Data

Spatiotemporal Analysis of Housing Prices in China: A Big Data Perspective

Computer mathematical statistics applied in the housing price investigation through machine learning and linear regression model

The Study On Adaptive Spatial Sampling Used In Allocation Of Housing Price Monitoring Sites - Taking Wujin Section Of Changzhou City As An Example

Market Segment and Hedonic Price Analysis of Urban Housing