Optimizing control variable selection with algorithms: Parsimony and precision in regression analysis

Fernando Campayo-Sanchez,Juan Luis Nicolau
DOI: https://doi.org/10.1177/13548166241287953
IF: 4.5817
2024-09-26
Tourism Economics
Abstract:Tourism Economics, Ahead of Print. This research note explores the pivotal role of control variables in any tourism and hospitality research that utilizes regression models in statistical analyses. While theory-driven independent variables offer insight into expected effects, the inclusion of control variables is crucial for mitigating potential confounding factors. In an attempt to strike a balance between model complexity and parsimony, researchers face the challenge of selecting the optimal control variables. To address this issue, the study tests three alternative methods: genetic algorithms, lasso models, and the branch and bound algorithm. Despite their underutilization in tourism research, these methods offer efficient means of selecting control variables, enhancing model precision and interpretation without unnecessarily convoluting the model with irrelevant factors.
economics,hospitality, leisure, sport & tourism
What problem does this paper attempt to address?