Improved sports image classification using deep neural network and novel tuna swarm optimization

Zetian Zhou,Heqing Zhang,Mehdi Effatparvar
DOI: https://doi.org/10.1038/s41598-024-64826-7
IF: 4.6
2024-06-21
Scientific Reports
Abstract:Sports image classification is a complex undertaking that necessitates the utilization of precise and robust techniques to differentiate between various sports activities. This study introduces a novel approach that combines the deep neural network (DNN) with a modified metaheuristic algorithm known as novel tuna swarm optimization (NTSO) for the purpose of sports image classification. The DNN is a potent technique capable of extracting high-level features from raw images, while the NTSO algorithm optimizes the hyperparameters of the DNN, including the number of layers, neurons, and activation functions. Through the application of NTSO to the DNN, a finely-tuned network is developed, exhibiting exceptional performance in sports image classification. Rigorous experiments have been conducted on an extensive dataset of sports images, and the obtained results have been compared against other state-of-the-art methods, including Attention-based graph convolution-guided third-order hourglass network (AGTH-Net), particle swarm optimization algorithm (PSO), YOLOv5 backbone and SPD-Conv, and Depth Learning (DL). According to a fivefold cross-validation technique, the DNN/NTSO model provided remarkable precision, recall, and F1-score results: 97.665 ± 0.352%, 95.400 ± 0.374%, and 0.8787 ± 0.0031, respectively. Detailed comparisons reveal the DNN/NTSO model's superiority toward various performance metrics, solidifying its standing as a top choice for sports image classification tasks. Based on the practical dataset, the DNN/NTSO model has been successfully evaluated in real-world scenarios, showcasing its resilience and flexibility in various sports categories. Its capacity to uphold precision in dynamic settings, where elements like lighting, backdrop, and motion blur are prominent, highlights its utility. The model's scalability and efficiency in analyzing images from live sports competitions additionally validate its suitability for integration into real-time sports analytics and media platforms. This research not only confirms the theoretical superiority of the DNN/NTSO model but also its pragmatic effectiveness in a wide array of demanding sports image classification assignments.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the problem of sports image classification. Specifically, the research team proposes a new method that combines Deep Neural Networks (DNN) with an improved metaheuristic algorithm—Novel Tuna Swarm Optimization (NTSO)—to enhance the accuracy of sports image classification. In the task of sports image classification, traditional image classification techniques often struggle to achieve ideal classification results due to factors such as dynamic actions in the images, variable backgrounds, and lighting conditions. To overcome these challenges, the researchers designed a new model that integrates DNN and NTSO. The DNN is capable of extracting high-level features from the raw images, while the NTSO is responsible for optimizing the hyperparameters of the DNN, including the number of layers, the number of neurons, and the activation functions. By applying NTSO to the DNN, a finely-tuned network can be obtained, which performs excellently in sports image classification tasks. Experimental results show that the DNN/NTSO model achieved an accuracy, recall, and F1 score of 97.665±0.352%, 95.400±0.374%, and 0.8787±0.0031, respectively, under five-fold cross-validation. Compared to other advanced methods, such as the Attention-Guided Graph Convolutional Third-Order Hourglass Network (AGTH-Net), Particle Swarm Optimization (PSO), YOLOv5 backbone network, and SPD-Conv, the DNN/NTSO model demonstrates superiority across multiple performance metrics. Moreover, the model has been successfully applied in real-world scenarios, proving its robustness and flexibility across various sports categories. Even in dynamic environments, such as changing lighting conditions, complex backgrounds, or motion blur, the model maintains high accuracy. This makes the DNN/NTSO model highly suitable for integration into real-time sports analysis and media platforms to process image data from live sports competitions. In summary, this paper presents an effective solution for sports image classification, which not only has theoretical advantages but also has been validated in practice, making it applicable to various challenging sports image classification tasks.