Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing

Hung-Yi Lee,Shinji Watanabe,Karen Livescu,Abdelrahman Mohamed,Tara Sainath
DOI: https://doi.org/10.1109/jstsp.2022.3205434
IF: 7.695
2022-01-01
IEEE Journal of Selected Topics in Signal Processing
Abstract:The papers in this special section focus on self-supervised learning for speech and audio processing. A current trend in the machine learning community is the adoption of self-supervised approaches to pretrain deep networks. Self-supervised learning utilizes proxy-supervised learning tasks (or pretext tasks)—for example, distinguishing parts of the input signal from distractors or reconstructing masked input segments conditioned on unmasked segments—to obtain training data from unlabeled corpora. These approaches make it possible to use the tremendous amount of unlabeled data available on the web to train large neural models. Recent self-supervised approaches for speech and audio processing are also gaining attention.
What problem does this paper attempt to address?