AI-based betting anomaly detection system to ensure fairness in sports and prevent illegal gambling

Changgyun Kim,Jae-Hyeon Park,Ji-Yong Lee
DOI: https://doi.org/10.1038/s41598-024-57195-8
IF: 4.6
2024-03-19
Scientific Reports
Abstract:This study develops a solution to sports match-fixing using various machine-learning models to detect match-fixing anomalies, based on betting odds. We use five models to distinguish between normal and abnormal matches: logistic regression (LR), random forest (RF), support vector machine (SVM), the k-nearest neighbor (KNN) classification, and the ensemble model—a model optimized from the previous four. The models classify normal and abnormal matches by learning their patterns using sports betting odds data. The database was developed based on the world football league match betting data of 12 betting companies, which offered a vast collection of data on players, teams, game schedules, and league rankings for football matches. We develop an abnormal match detection model based on the data analysis results of each model, using the match result dividend data. We then use data from real-time matches and apply the five models to construct a system capable of detecting match-fixing in real time. The RF, KNN, and ensemble models recorded a high accuracy, over 92%, whereas the LR and SVM models were approximately 80% accurate. In comparison, previous studies have used a single model to examine football match betting odds data, with an accuracy of 70–80%.
multidisciplinary sciences
What problem does this paper attempt to address?
This paper attempts to solve the problem of match - fixing in sports competitions by developing an artificial - intelligence - based anomaly detection system to ensure the fairness of competitions and prevent illegal gambling. Specifically, the research aims to: 1. **Identify and prevent match - fixing behavior in sports competitions**: By analyzing gambling odds data and using machine - learning models to detect abnormal patterns, potential match - fixing behavior can be identified. 2. **Improve the accuracy of the detection system**: Compared with previous studies, this study adopts multiple machine - learning models (logistic regression, random forest, support vector machine, k - nearest neighbor classification, and support vector machine ensemble model) to improve the accuracy and reliability of detection. 3. **Construct a real - time detection system**: Using the gambling data of the world football leagues, construct a system that can detect competition anomalies in real - time, ensuring that potential match - fixing behavior is discovered and dealt with in a timely manner during actual competitions. ### Research Background Sports competitions should be carried out in a fair - competition environment, and the results should be determined jointly by the performance of athletes and external factors. However, some people preset the competition results through illegal means (such as using banned drugs or manipulating the competition results), which seriously damages the sports spirit and has a negative impact on the industry. Therefore, developing an effective match - fixing detection system is particularly important. ### Main Methods This study uses the following five machine - learning models to distinguish between normal and abnormal competitions: - **Logistic Regression (LR)** - **Random Forest (RF)** - **Support Vector Machine (SVM)** - **k - Nearest Neighbor (KNN)** - **Ensemble Model** These models learn from the gambling odds data to identify competitions that may be involved in match - fixing. The database used in the study includes world football league data from 12 gambling companies, covering information such as players, teams, schedules, and league rankings. ### Model Performance - The accuracy rates of Random Forest (RF), k - Nearest Neighbor (KNN), and Ensemble Model exceed 92%. - The accuracy rates of Logistic Regression (LR) and Support Vector Machine (SVM) are approximately 80%. ### Conclusion This study has successfully developed a high - precision match - fixing detection system by combining multiple machine - learning models and rich gambling data, providing strong support for ensuring the fairness and integrity of sports competitions.