Dimension Reduction Method of High-Dimensional Fault Datasets Based on C_M_t-SNE under Unsupervised Background

Sencai Ma,Gang Cheng,Yong Li,Rongzhen Zhao
DOI: https://doi.org/10.1016/j.measurement.2023.112835
IF: 5.6
2023-01-01
Measurement
Abstract:The unlabeled fault datasets often contain much non-sensitive redundant, and uncertain information. This study designs a novel interpretable and unsupervised dimension reduction method for unlabeled data containing redundancy and uncertainty. Firstly, a fuzzy-based way for pseudo-label generation is given, and feature cloud models under pseudo labels are established; Secondly, this study takes the expectation, entropy, and hyper entropy of the cloud models representing uncertainty in features as spatial vectors. The difference degree between vectors is treated as the evaluation standard to filter out non-sensitive features based on the maximum initial difference; Moreover, redundant elements are fused by t-SNE, and lower dimensional feature components conducive for fault classification are obtained; Finally, the effectiveness of the method is demonstrated by comparative experiments. The results show that this method has a higher factor, which means that the method can better mine the difference among different faults and improve the performance of fault identification.
What problem does this paper attempt to address?