Predicting soccer matches with complex networks and machine learning

Eduardo Alves Baratela,Felipe Jordão Xavier,Thomas Peron,Paulino Ribeiro Villas-Boas,Francisco Aparecido Rodrigues
2024-09-20
Abstract:Soccer attracts the attention of many researchers and professionals in the sports industry. Therefore, the incorporation of science into the sport is constantly growing, with increasing investments in performance analysis and sports prediction industries. This study aims to (i) highlight the use of complex networks as an alternative tool for predicting soccer match outcomes, and (ii) show how the combination of structural analysis of passing networks with match statistical data can provide deeper insights into the game patterns and strategies used by teams. In order to do so, complex network metrics and match statistics were used to build machine learning models that predict the wins and losses of soccer teams in different leagues. The results showed that models based on passing networks were as effective as ``traditional'' models, which use general match statistics. Another finding was that by combining both approaches, more accurate models were obtained than when they were used separately, demonstrating that the fusion of such approaches can offer a deeper understanding of game patterns, allowing the comprehension of tactics employed by teams relationships between players, their positions, and interactions during matches. It is worth mentioning that both network metrics and match statistics were important and impactful for the mixed model. Furthermore, the use of networks with a lower granularity of temporal evolution (such as creating a network for each half of the match) performed better than a single network for the entire game.
Social and Information Networks,Machine Learning,Applications
What problem does this paper attempt to address?
The paper attempts to address the issue of evaluating the effectiveness of complex network metrics in predicting football match outcomes within machine learning models. Specifically, the research objectives include the following points: 1. **Verify whether the analysis of passing network structures can enhance the performance of prediction models**: By constructing passing networks to analyze interactions between players and extracting complex network metrics as input features for machine learning models, the study aims to assess whether these metrics can improve prediction accuracy. 2. **Combine network metrics with traditional features**: Compare the effectiveness of metrics based on passing networks with traditional models that rely solely on historical match statistics, and test the potential synergistic effects of combining both to further enhance prediction accuracy. 3. **Explore differences between different leagues**: Conduct cluster analysis on match data from leagues in countries such as Spain, Germany, England, France, and Italy to understand whether there are significant differences in network structures and match characteristics between these leagues. Through this research, the authors hope to demonstrate that passing network structures can serve as effective indicators for predicting match outcomes and that combining multiple features can better understand team performance patterns and strategies.