VISEM-Tracking, a human spermatozoa tracking dataset

Vajira Thambawita,Steven A. Hicks,Andrea M. Storås,Thu Nguyen,Jorunn M. Andersen,Oliwia Witczak,Trine B. Haugen,Hugo L. Hammer,Pål Halvorsen,Michael A. Riegler
DOI: https://doi.org/10.1038/s41597-023-02173-4
2023-05-10
Abstract:A manual assessment of sperm motility requires microscopy observation, which is challenging due to the fast-moving spermatozoa in the field of view. To obtain correct results, manual evaluation requires extensive training. Therefore, computer-assisted sperm analysis (CASA) has become increasingly used in clinics. Despite this, more data is needed to train supervised machine learning approaches in order to improve accuracy and reliability in the assessment of sperm motility and kinematics. In this regard, we provide a dataset called VISEM-Tracking with 20 video recordings of 30 seconds (comprising 29,196 frames) of wet sperm preparations with manually annotated bounding-box coordinates and a set of sperm characteristics analyzed by experts in the domain. In addition to the annotated data, we provide unlabeled video clips for easy-to-use access and analysis of the data via methods such as self- or unsupervised learning. As part of this paper, we present baseline sperm detection performances using the YOLOv5 deep learning (DL) model trained on the VISEM-Tracking dataset. As a result, we show that the dataset can be used to train complex DL models to analyze spermatozoa.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the lack of data in sperm movement analysis. Specifically, Computer - Aided Sperm Analysis (CASA) systems require a large amount of training data to improve accuracy and reliability when evaluating sperm motility and kinetic characteristics. However, the existing publicly - labeled data sets are very limited. Most data sets only focus on fixed and stained static sperm images or very short sequences and cannot fully reflect the dynamic characteristics of sperm. Therefore, the author provides a data set named VISEM - Tracking, which contains 20 sperm videos of 30 seconds each (a total of 29,196 frames). Each video has manually - labeled bounding box coordinates and a set of sperm characteristics analyzed by domain experts. In addition, unlabeled video clips are also provided to facilitate data access and analysis through self - supervised or unsupervised learning methods. The provision of this data set aims to fill the gaps in existing data sets and support the training of more complex deep - learning models to achieve more accurate analysis of sperm movement.