Deep Optimal Transport for Domain Adaptation on SPD Manifolds

Ce Ju,Cuntai Guan
2024-06-03
Abstract:The machine learning community has shown increasing interest in addressing the domain adaptation problem on symmetric positive definite (SPD) manifolds. This interest is primarily driven by the complexities of neuroimaging data generated from brain signals, which often exhibit shifts in data distribution across recording sessions. These neuroimaging data, represented by signal covariance matrices, possess the mathematical properties of symmetry and positive definiteness. However, applying conventional domain adaptation methods is challenging because these mathematical properties can be disrupted when operating on covariance matrices. In this study, we introduce a novel geometric deep learning-based approach utilizing optimal transport on SPD manifolds to manage discrepancies in both marginal and conditional distributions between the source and target domains. We evaluate the effectiveness of this approach in three cross-session brain-computer interface scenarios and provide visualized results for further insights. The GitHub repository of this study can be accessed at <a class="link-external link-https" href="https://github.com/GeometricBCI/Deep-Optimal-Transport-for-Domain-Adaptation-on-SPD-Manifolds" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Artificial Intelligence,Signal Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the domain adaptation problem on symmetric positive - definite (SPD) manifolds, especially in the cross - session motor imagery classification tasks in brain - computer interfaces (BCI). Specifically: 1. **Domain Adaptation Problem**: The data distributions of the source domain and the target domain are different. This is particularly evident in neuroimaging data because these data usually show distribution changes between different recording sessions. Such changes make it difficult for traditional domain adaptation methods to be directly applied to covariance matrices because their mathematical properties (symmetry and positive - definiteness) may be destroyed during the operation. 2. **Differences in Marginal and Conditional Distributions**: There are differences between the marginal distribution \(P_S(X_S)\neq P_T(X_T)\) and the conditional distribution \(Q_S(Y_S|X_S)\neq Q_T(Y_T|X_T)\) of the source domain and the target domain. This places higher requirements on the model, which needs to adapt to the changes in both distributions simultaneously. 3. **Limitations of Existing Methods**: The existing optimal transport (OT) framework has theoretical and practical deficiencies when dealing with data on SPD manifolds. For example, the cost function based on Euclidean distance is inconsistent in differential geometry and fails to fully consider the differences in conditional distributions. To solve these problems, the authors propose a new geometric deep - learning method - Deep Optimal Transport (DOT), which uses the optimal transport theory to perform domain adaptation on SPD manifolds. This method achieves joint distribution adaptation by minimizing the Riemannian distance between the source domain and the target domain, thereby effectively addressing the domain adaptation challenges in cross - session brain - computer interface scenarios. In summary, this paper aims to develop a domain adaptation method that can effectively handle the differences in marginal and conditional distributions on SPD manifolds to improve the performance of cross - session brain - computer interface tasks.