Abstract:Early detection of factory machinery malfunctions is crucial in industrial applications. In machine anomalous sound detection (ASD), different machines exhibit unique vibration-frequency ranges based on their physical properties. Meanwhile, the human auditory system is adept at tracking both temporal and spectral dynamics of machine sounds. Consequently, integrating the computational auditory models of the human auditory system with machine-specific properties can be an effective approach to machine ASD. We first quantified the frequency importances of four types of machines using the Fisher ratio (F-ratio). The quantified frequency importances were then used to design machine-specific non-uniform filterbanks (NUFBs), which extract the log non-uniform spectrum (LNS) feature. The designed NUFBs have a narrower bandwidth and higher filter distribution density in frequency regions with relatively high F-ratios. Finally, spectral and temporal modulation representations derived from the LNS feature were proposed. These proposed LNS feature and modulation representations are input into an autoencoder neural-network-based detector for ASD. The quantification results from the training set of the Malfunctioning Industrial Machine Investigation and Inspection dataset with a signal-to-noise (SNR) of 6 dB reveal that the distinguishing information between normal and anomalous sounds of different machines is encoded non-uniformly in the frequency domain. By highlighting these important frequency regions using NUFBs, the LNS feature can significantly enhance performance using the metric of AUC (area under the receiver operating characteristic curve) under various SNR conditions. Furthermore, modulation representations can further improve performance. Specifically, temporal modulation is effective for fans, pumps, and sliders, while spectral modulation is particularly effective for valves.

Representation Learning Using Machine Attribute Information for Anomalous Sound Detection in Real Scenarios

Machine Anomalous Sound Detection Based on Self-Supervised Classification

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Using Classification-Based Methods

Exploring Large Scale Pre-Trained Models for Robust Machine Anomalous Sound Detection

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques

Hierarchical Metadata Information Constrained Self-Supervised Learning for Anomalous Sound Detection Under Domain Shift

Transformer-based Autoencoder with ID Constraint for Unsupervised Anomalous Sound Detection

Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

Anomalous Sound Detection using Audio Representation with Machine ID based Contrastive Learning Pretraining

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation

Stream-based Active Learning for Anomalous Sound Detection in Machine Condition Monitoring

Anomaly sound detection of industrial devices by using teacher-student incremental continual learning

Autoencoder with Group-based Decoder and Multi-task Optimization for Anomalous Sound Detection

Integrating the Data Augmentation Scheme with Various Classifiers for Acoustic Scene Modeling

Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models

Semi-Supervised Machine Condition Monitoring by Learning Deep Discriminative Audio Features

Domain Shift-oriented Machine Anomalous Sound Detection Model Based on Self-Supervised Learning

Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks

Low-Complexity Acoustic Scene Classification Using Data Augmentation and Lightweight ResNet