Limitations of weak labels for embedding and tagging

Nicolas Turpault,Romain Serizel,Emmanuel Vincent
DOI: https://doi.org/10.48550/arXiv.2002.01687
2020-12-07
Abstract:Many datasets and approaches in ambient sound analysis use weakly labeled <a class="link-external link-http" href="http://data.Weak" rel="external noopener nofollow">this http URL</a> labels are employed because annotating every data sample with a strong label is too <a class="link-external link-http" href="http://expensive.Yet" rel="external noopener nofollow">this http URL</a>, their impact on the performance in comparison to strong labels remains <a class="link-external link-http" href="http://unclear.Indeed" rel="external noopener nofollow">this http URL</a>, weak labels must often be dealt with at the same time as other challenges, namely multiple labels per sample, unbalanced classes and/or overlapping <a class="link-external link-http" href="http://events.In" rel="external noopener nofollow">this http URL</a> this paper, we formulate a supervised learning problem which involves weak <a class="link-external link-http" href="http://labels.We" rel="external noopener nofollow">this http URL</a> create a dataset that focuses on the difference between strong and weak labels as opposed to other challenges. We investigate the impact of weak labels when training an embedding or an end-to-end <a class="link-external link-http" href="http://classifier.Different" rel="external noopener nofollow">this http URL</a> experimental scenarios are discussed to provide insights into which applications are most sensitive to weakly labeled data.
Sound,Artificial Intelligence,Machine Learning,Audio and Speech Processing
What problem does this paper attempt to address?