Hotel Booking Cancellation Prediction Using Applied Bayesian Models

Md Asifuzzaman Jishan,Vikas Singh,Ayan Kumar Ghosh,Md Shahabub Alam,Khan Raqib Mahmud,Bijan Paul
2024-10-24
Abstract:This study applies Bayesian models to predict hotel booking cancellations, a key challenge affecting resource allocation, revenue, and customer satisfaction in the hospitality industry. Using a Kaggle dataset with 36,285 observations and 17 features, Bayesian Logistic Regression and Beta-Binomial models were implemented. The logistic model, applied to 12 features and 5,000 randomly selected observations, outperformed the Beta-Binomial model in predictive accuracy. Key predictors included the number of adults, children, stay duration, lead time, car parking space, room type, and special requests. Model evaluation using Leave-One-Out Cross-Validation (LOO-CV) confirmed strong alignment between observed and predicted outcomes, demonstrating the model's robustness. Special requests and parking availability were found to be the strongest predictors of cancellation. This Bayesian approach provides a valuable tool for improving booking management and operational efficiency in the hotel industry.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
This paper attempts to solve the problem of hotel reservation cancellation prediction, which is a key challenge faced by the hotel industry. Specifically: 1. **Problem Background**: - The hotel industry is significantly affected by reservation cancellations in terms of resource allocation, revenue management, and customer satisfaction. - Traditional methods such as decision trees, logistic regression, and machine - learning algorithms, although providing a certain predictive ability, have limitations in dealing with uncertainty, dynamically updating predictions, and interpretability. 2. **Research Objectives**: - Apply Bayesian models (especially Bayesian logistic regression and beta - binomial models) to predict hotel reservation cancellations more accurately. - Improve the flexibility and adaptability of predictions by introducing prior knowledge and real - time data adjustment. 3. **Data Sources**: - Use a data set containing 36,285 observations and 17 features from the Kaggle platform for analysis. 4. **Main Contributions**: - Compare the predictive performance of Bayesian logistic regression and beta - binomial models. - Evaluate the effectiveness of the models through leave - one - out cross - validation (LOO - CV), and the results show that the Bayesian logistic regression model has better predictive accuracy. - Discover that special requirements and parking space availability are the strongest predictors of reservation cancellations. 5. **Practical Applications**: - Provide a new tool to help hotel managers better manage reservations, optimize resource allocation, and improve operational efficiency and service quality. Through these efforts, this research aims to provide the hotel industry with a more flexible and adaptable prediction framework to deal with the challenges brought by reservation cancellations.