Abstract:Clustering algorithms have the characteristics of being simple and efficient and can complete calculations without a large number of datasets, making them suitable for application in noise reduction processing for audio module mass production testing. In order to solve the problems of the NMF algorithm easily getting stuck in local optimal solutions and difficult feature signal extraction, an improved NMF audio denoising algorithm combined with K-means initialization was designed. Firstly, the Euclidean distance formula of K-means has been improved to extract audio signal features from multiple dimensions. Combined with the initialization strategy of K-means decomposition, the initialization dictionary matrix of the NMF algorithm has been optimized to avoid getting stuck in local optimal solutions and effectively improve the robustness of the algorithm. Secondly, in the sparse coding part of the NMF algorithm, feature extraction expressions are added to solve the problem of noise residue and partial spectral signal loss in audio signals during the operation process. At the same time, the size of the coefficient matrix is limited to reduce operation time and improve the accuracy of feature extraction in high-precision audio signals. Then, comparative experiments were conducted using the NOIZEUS and NOISEX-92 datasets, as well as random noise audio signals. This algorithm improved the signal-to-noise ratio by 10–20 dB and reduced harmonic distortion by approximately −10 dB. Finally, a high-precision audio acquisition unit based on FPGA was designed, and practical applications have shown that it can effectively improve the signal-to-noise ratio of audio signals and reduce harmonic distortion.

Audio hash function based on non-negative matrix factorisation of mel-frequency cepstral coefficients.

Robust audio hashing based on discrete-wavelet-transform and non-negative matrix factorisation

Audio Perceptual Hashing Based on Nmf and Mdct Coefficients

Hash Authentication Algorithm of Compressed Domain Speech Perception Based on MFCC and NMF

Robust Audio Hashing Scheme Based on Cochleagram and Cross Recurrence Analysis

Key-Dependent Compressed Domain Audio Hashing

Compressed Domain Robust Hashing For Aac Audio

Perceptual Hashing Based on Correlation Coefficient of MFCC for Speech Authentication

ZERO-WATERMARKING ALGORITHM BASED ON AUDIO FEATURES OF MFCC

Robust Speech Hash Function

Daubechies Wavelets Based Robust Audio Fingerprinting for Content-Based Audio Retrieval

Robust Mel-Frequency Cepstral coefficients feature detection and dual-tree complex wavelet transform for digital audio watermarking

New Authentication Algorithm for Audio Content

An Improved Nonnegative Matrix Factorization Algorithm Combined with K-Means for Audio Noise Reduction

Music Content Authentication Based on Beat Segmentation and Fuzzy Classification

On the music content authentication.

Robust Speech Hashing for Content Authentication

Content-Based Audio Retrieval Using Perceptual Hash

A Fingerprint-Based Audio Authentication Scheme Using Frequency Domain Statistical Characteristic

Robust Hashing for Music Copyright Protection by Combining Beat Segmentation and Chroma.

Robust and lightweight audio fingerprint for Automatic Content Recognition