Cheating Heisenberg: Achieving certainty in wideband spectrography

Sean Fulop
DOI: https://doi.org/10.1121/1.4778123
2003-10-01
The Journal of the Acoustical Society of America
Abstract:The spectrographic analysis of sound has been with us some 58 years, and one of the key properties of the process is the trade-off in resolution between the time and frequency dimensions in the computed graph. While spectrography has greatly advanced the development of phonetics, the uncertainty principle has always been a source of frustration to phoneticians because so many of the interesting features of speech must be observed by computing Fourier spectra over very short time frames—i.e., using a ‘‘wideband’’ spectrogram. Since the uncertainty relation between time and frequency is unbreakable, the only option for improvement is to make a new kind of spectrogram that does not graph time and frequency. An algorithm is described and demonstrated which computes a new kind of spectrogram in which Fourier transform frequency is replaced by the channelized instantaneous frequency, and time is adjusted by the local group delay. The theory behind this procedure was clarified in Nelson [J. Acoust. Soc. Am. 110, 2575–2592 (2001)]. The resulting wideband spectrograms show dramatically improved resolution of speech features, which will be demonstrated with sample figures. It is thus suggested that phoneticians should be more interested in the instantaneous frequency spectrum than in the Fourier transform.
acoustics,audiology & speech-language pathology
What problem does this paper attempt to address?