Convolutional neural network for robust pitch determination.

Hong Su,Hui Zhang,Xueliang Zhang,Guanglai Gao
DOI: https://doi.org/10.1109/ICASSP.2016.7471741
2016-01-01
ICASSP
Abstract:Pitch is an important characteristic of speech and is useful for many applications. However, pitch determination in noisy conditions is difficult. In this paper, we propose a supervised learning algorithm to estimate pitch using a convolutional neural network (CNN). Specifically, we use a CNN for pitch candidate selection, and dynamic programming for pitch tracking. Our experimental results show that the proposed method can obtain accurate pitch estimation and they show good generalization ability to new speakers and noisy conditions. We credit the success to the use of CNN, which is suitable for modeling the shift-invariant spectral feature for pitch detection.
What problem does this paper attempt to address?