LOTUS: Improving Transformer Efficiency with Sparsity Pruning and Data Lottery Tickets

Ojasw Upadhyay
2024-05-02
Abstract:Vision transformers have revolutionized computer vision, but their computational demands present challenges for training and deployment. This paper introduces LOTUS (LOttery Transformers with Ultra Sparsity), a novel method that leverages data lottery ticket selection and sparsity pruning to accelerate vision transformer training while maintaining accuracy. Our approach focuses on identifying and utilizing the most informative data subsets and eliminating redundant model parameters to optimize the training process. Through extensive experiments, we demonstrate the effectiveness of LOTUS in achieving rapid convergence and high accuracy with significantly reduced computational requirements. This work highlights the potential of combining data selection and sparsity techniques for efficient vision transformer training, opening doors for further research and development in this area.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
This paper aims to address the issue of low training efficiency in Transformer models in the field of computer vision. Although Transformer has shown excellent performance in tasks such as image recognition and object detection, its high computational requirements pose challenges during training and deployment. The paper proposes a new method called LOTUS (LOttery Transformers with Ultra Sparsity), which combines data selection (data lottery theory) and sparse pruning techniques to accelerate the training process of Transformers while maintaining or improving accuracy, and reducing computational demands. Through experiments, the paper demonstrates the effectiveness of LOTUS in achieving fast convergence and high accuracy while reducing computational resources, opening up new directions for research and development in this field in the future.