Exploration of Network Optimization Strategies Based On the TSN Model.

Yifei Yuan,Zanxi Ruan,Yingmei Wei,Tingshuai Jiang
DOI: https://doi.org/10.1145/3639631.3639657
2023-01-01
Abstract:With the widespread application of deep learning in the field of computer vision, deep learning-based video behavior recognition models have become important tools for video recognition tasks. However, these models currently face two challenges: how to expand and enhance data to improve model accuracy, and how to achieve lightweight networks without increasing computational complexity. The Temporal Segment Network (TSN) model is a deep learning model used for video action recognition. It combines 2D and 3D convolutional neural networks and effectively captures temporal and spatial information in videos while maintaining the requirement for lightweight networks. This paper explores the optimal data augmentation strategy based on the TSN model and proposes a Temporal Feature Fusion Algorithm to optimize the TSN model, providing a solution for high-performance lightweight networks. These findings will contribute to the exploration of data augmentation techniques in future video behavior recognition models and provide insights for lightweight network research.
What problem does this paper attempt to address?