An energy-efficient voice activity detector using reconfigurable Gaussian base normalization deep neural network
Anu Samanta,Indranil Hatai,Ashis Kumar Mal
DOI: https://doi.org/10.1007/s11042-023-14699-1
IF: 2.577
2023-02-23
Multimedia Tools and Applications
Abstract:This research paper proposed deep neural networks and approximation computation are used to create an energy-efficient voice activity detector (VDA). The proposed technique is split up into two parts: feature extraction and voice/noise classification using a deep neural network with Gaussian basis normalization (GNDNN). Pre-processing of input data initially: the digitalized speech signal’s high-frequency components are pre-emphasized, trying to make it a little less susceptible to finite precision impacts later inside the signal processing. The feature extraction module uses Mel-frequency cepstral coefficients (MFCC), time-frequency non-negative matrix factorization (TFNMF), to extract the input speech signals feature value. The TFNMF, MFCC output from feature extraction is classified by the GNDNN speech prediction phase, which evaluates whether the signal is indeed a voice or noise. The proposed approach can be dynamically changed to meet various computing accuracy demands. Our proposed approach most exciting accuracy result of 98.75%. Comparable to the CNN and DNN, which achieves the accuracy of 97.25%, 95.25%, and EERA had the worst accuracy 88.75%. The results of the experiments show that our proposed strategy outperforms previous methods.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering