Abstract:In this paper, we propose a wideband (WB) to super-wideband audio bandwidth extension (BWE) method based on temporal smoothing cepstral coefficients (TSCC). A temporal relationship of audio signals is included into feature extraction in the bandwidth extension frontend to make the temporal evolution of the extended spectra smoother. In the bandwidth extension scheme, a Gammatone auditory filter bank is used to decompose the audio signal, and the energy of each frequency band is long-term smoothed using minima controlled recursive averaging (MCRA) in order to suppress transient components. The resulting ‘steady-state’ spectrum is processed by frequency weighting, and the temporal smoothing cepstral coefficients are obtained by means of the power-law loudness function and cepstral normalization. The extracted temporal smoothing cepstral coefficients are fed into a Gaussian mixture model (GMM)-based Bayesian estimator to estimate the high-frequency (HF) spectral envelope, while the fine structure is restored by spectral translation. Evaluation results show that the temporal smoothing cepstral coefficients exploit the temporal relationship of audio signals and provide higher mutual information between the low- and high-frequency parameters, without increasing the dimension of input vectors in the frontend of bandwidth extension systems. In addition, the proposed bandwidth extension method is applied into the G.729.1 wideband codec and outperforms the Mel frequency cepstral coefficient (MFCC)-based method in terms of log spectral distortion (LSD), cosh measure, and differential log spectral distortion. Further, the proposed method improves the smoothness of the reconstructed spectrum over time and also gains a good performance in the subjective listening tests.

Nonlinear bandwidth extension of audio signals based on hidden Markov model

Blind Bandwidth Extension Of Audio Signals Based On Non-Linear Prediction And Hidden Markov Model

Nonlinear bandwidth extension based on nearest-neighbor matching

Blind Bandwidth Extension Of Audio Signals Based On Harmonic Mapping In Phase Space

Audio Bandwidth Extension Using Ensemble of Recurrent Neural Networks

Audio Bandwidth Extension Based on Ensemble Echo State Networks with Temporal Evolution

Audio bandwidth extension based on RBF neural network

Audio Bandwidth Extension Method Based on Local Least Square Support Vector Machine

Bandwidth extension of audio signal based on SOM prediction model

Bandwidth Extension Method Based on Nonlinear Audio Characteristics Classification

A Blind Bandwidth Extension Method for Audio Signals Based on Phase Space Reconstruction

Audio Bandwidth Extension Method Based on Echo State Network

Audio Bandwidth Extension based on Maximum Lyapunov Prediction

Analysis and Forecast of Audio Bandwidth Extending Techniques

A Blind Bandwidth Extension Method of Audio Signals Based on Volterra Series.

Audio Bandwidth Extension Based on Grey Model

Spectral Envelope Estimation Used for Audio Bandwidth Extension Based on RBF Neural Network

Audio Bandwidth Extension Based on Temporal Smoothing Cepstral Coefficients

Audio Bandwidth Extension Based on Volterra Series

Bandwidth extension of audio signals based on cochlear filter cepstral coefficients

Artificial Bandwidth Extension For Speech Signals Using Speech Recogniton