Comparisons of Different Methods Used for Second-Hand Car Price Prediction

Jian Chen,Fangfang Li,Jing Xu,Qing Wang,Qingzhen Han,Ming Yan
DOI: https://doi.org/10.1117/12.2638739
2022-01-01
Abstract:By establishing correlation coefficient matrix, huge sample data of second-hand car trading was processed so that, the irrelevant variables were deleted, as well as the missing and outlier values were handled. Then, the main variables have been extracted by using Xgboost algorithm. 13 of 36 major characteristic variables affecting the second-hand car price were filtered out according to their importance ranking, which include the mileage, tradeTime, brand, model et al. With the selected variables as independent variables and the price of second-hand car as dependent variable, the BP neural network model, linear regression model and random forest model were established to predict the price of second-hand cars. Finally, the predicting results were compared, which show that the fit goodness of random forest model is 0.992, and the model evaluation is 0.527, which gives the best performance.
What problem does this paper attempt to address?