EVNet: An Explainable Deep Network for Dimension Reduction

Zelin Zang,Shenghui Cheng,Linyan Lu,Hanchen Xia,Liangyu Li,Yaoting Sun,Yongjie Xu,Lei Shang,Baigui Sun,Stan Z. Li
DOI: https://doi.org/10.48550/arXiv.2211.15478
2022-11-21
Abstract:Dimension reduction (DR) is commonly utilized to capture the intrinsic structure and transform high-dimensional data into low-dimensional space while retaining meaningful properties of the original data. It is used in various applications, such as image recognition, single-cell sequencing analysis, and biomarker discovery. However, contemporary parametric-free and parametric DR techniques suffer from several significant shortcomings, such as the inability to preserve global and local features and the pool generalization performance. On the other hand, regarding explainability, it is crucial to comprehend the embedding process, especially the contribution of each part to the embedding process, while understanding how each feature affects the embedding results that identify critical components and help diagnose the embedding process. To address these problems, we have developed a deep neural network method called EVNet, which provides not only excellent performance in structural maintainability but also explainability to the DR therein. EVNet starts with data augmentation and a manifold-based loss function to improve embedding performance. The explanation is based on saliency maps and aims to examine the trained EVNet parameters and contributions of components during the embedding process. The proposed techniques are integrated with a visual interface to help the user to adjust EVNet to achieve better DR performance and explainability. The interactive visual interface makes it easier to illustrate the data features, compare different DR techniques, and investigate DR. An in-depth experimental comparison shows that EVNet consistently outperforms the state-of-the-art methods in both performance measures and explainability.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are the deficiencies of existing dimension reduction (DR) techniques in maintaining global and local features and the lack of interpretability. Specifically: 1. **Retention of global and local features**: Existing non - parametric and parametric dimension reduction techniques cannot well preserve the global and local features of data, resulting in the reduced - dimension data losing important characteristics of the original data. 2. **Interpretability problem**: Traditional dimension reduction methods are like "black boxes", and it is difficult to understand their embedding processes, especially the influence of each feature on the dimension reduction results. This is particularly important for application scenarios that require interpretability (such as biomedical research). To solve these problems, the author proposes a dimension reduction method based on deep neural networks - EVNet (Explainable Visualization Network), which not only improves the dimension reduction performance but also enhances the interpretability of the model. EVNet achieves these goals in the following ways: - **Data augmentation**: Introduce data augmentation as prior knowledge to improve the generalization ability and interpretability of the model. - **New loss function**: Design a manifold - based loss function \(L_{sp}\), which is compatible with data augmentation and calculates data similarity in the latent space, avoiding the direct use of input data similarity. - **Explanation mechanism**: Use tools such as saliency maps to explain the trained EVNet parameters and the contributions of each component in the embedding process. - **Interactive visualization interface**: Integrate an interactive visualization interface to help users adjust EVNet to obtain better dimension reduction performance and interpretability. Through these improvements, EVNet can show better performance than existing methods on multiple image and biological datasets and provide clearer interpretability.