Predicting travel mode choice with a robust neural network and Shapley additive explanations analysis
Li Tang,Chuanli Tang,Qi Fu,Changxi Ma
DOI: https://doi.org/10.1049/itr2.12514
IF: 2.7
2024-04-24
IET Intelligent Transport Systems
Abstract:This study uses a neural network with feature selection for travel mode choice prediction and Shapley additive explanations (SHAP) analysis for model interpretation. It highlights that by applying feature selection using a joint result from two embedded methods, a more robust model in neural networks that improves the overfitting problem in mode choice prediction was able to be developed. Additionally, interpreting the neural network using SHAP overcomes the limitation of neural network models not being interpretable. Predicting and understanding travellers' mode choices is crucial to developing urban transportation systems and formulating traffic demand management strategies. Machine learning (ML) methods have been widely used as promising alternatives to traditional discrete choice models owing to their high prediction accuracy. However, a significant body of ML methods, especially the branch of neural networks, is constrained by overfitting and a lack of model interpretability. This study employs a neural network with feature selection for predicting travel mode choices and Shapley additive explanations (SHAP) analysis for model interpretation. A dataset collected in Chengdu, China was used for experimentation. The results reveal that the neural network achieves commendable prediction performance, with a 12% improvement over the traditional multinomial logit model. Also, feature selection using a combined result from two embedded methods can alleviate the overfitting tendency of the neural network, while establishing a more robust model against redundant or unnecessary variables. Additionally, the SHAP analysis identifies factors such as travel expenditure, age, driving experience, number of cars owned, individual monthly income, and trip purpose as significant features in our dataset. The heterogeneity of mode choice behaviour is significant among demographic groups, including different age, car ownership, and income levels.
engineering, electrical & electronic,transportation science & technology