Deep Transform: Time-Domain Audio Error Correction via Probabilistic Re-Synthesis

Andrew J.R. Simpson
DOI: https://doi.org/10.48550/arXiv.1503.05849
2015-03-20
Abstract:In the process of recording, storage and transmission of time-domain audio signals, errors may be introduced that are difficult to correct in an unsupervised way. Here, we train a convolutional deep neural network to re-synthesize input time-domain speech signals at its output layer. We then use this abstract transformation, which we call a deep transform (DT), to perform probabilistic re-synthesis on further speech (of the same speaker) which has been degraded. Using the convolutive DT, we demonstrate the recovery of speech audio that has been subject to extreme degradation. This approach may be useful for correction of errors in communications devices.
Sound,Machine Learning,Neural and Evolutionary Computing
What problem does this paper attempt to address?