Pvd: A New Pathological Voice Dataset For Intra-Speaker Recognition Research Interest

Dongdong Li,Jianyu Wang,Yingchun Yang
DOI: https://doi.org/10.1109/iscslp.2016.7918488
2016-01-01
Abstract:In this paper, a pathological voice dataset (PVD) is introduced. The dataset contains recordings of 14 speakers (9 female and 5male) and two health states: normal and unhealthy. Each speaker pronounces fixed words, prompted digits, reads sentences and gives free talking. These materials cover all the phonemes in Chinese. The dataset also considerate the channel variability and is recorded through three channels simultaneously for each speaker: mobile phone, microphone and digital voice recorder. This corpus is constructed for prosodic and linguistic investigation of pathological voice variability in Mandarin. It can also be used for recognition of speakers in unhealthy state. Furthermore, speaker recognition baseline experiments are performed on this database.
What problem does this paper attempt to address?