Clément Bonet,Benoît Malézieux,Alain Rakotomamonjy,Lucas Drumetz,Thomas Moreau,Matthieu Kowalski,Nicolas Courty
Abstract:When dealing with electro or magnetoencephalography records, many supervised prediction tasks are solved by working with covariance matrices to summarize the signals. Learning with these matrices requires using Riemanian geometry to account for their structure. In this paper, we propose a new method to deal with distributions of covariance matrices and demonstrate its computational efficiency on M/EEG multivariate time series. More specifically, we define a Sliced-Wasserstein distance between measures of symmetric positive definite matrices that comes with strong theoretical guarantees. Then, we take advantage of its properties and kernel methods to apply this distance to brain-age prediction from MEG data and compare it to state-of-the-art algorithms based on Riemannian geometry. Finally, we show that it is an efficient surrogate to the Wasserstein distance in domain adaptation for Brain Computer Interface applications.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively handle the covariance matrix distribution in electroencephalogram (EEG) or magnetoencephalogram (MEG) recordings when processing these signals. Specifically, the paper proposes a new method to handle the distribution of covariance matrices and demonstrates its computational efficiency on M/EEG data of multivariate time series. More specifically, the paper defines the Sliced - Wasserstein distance between symmetric positive - definite matrix measurements, which has strong theoretical guarantees. Then, using its properties and kernel methods, this distance is applied to the task of predicting brain age from MEG data and compared with existing algorithms based on Riemannian geometry. Finally, the paper also shows that in domain adaptation for brain - computer interface applications, the Sliced - Wasserstein distance can be an effective alternative to the Wasserstein distance.
### Main Contributions
1. **Introduction of SPDSW**: The paper proposes the Sliced - Wasserstein distance between symmetric positive - definite matrix measurements (SPDSW) and provides a good basis for numerical approximation.
2. **Theoretical Results**: The paper derives the topological, statistical, and computational properties of SPDSW, in particular, proving that SPDSW is a distance topologically equivalent to the Wasserstein distance in this context.
3. **Extension to Distribution Regression of SPD Matrices**: The paper extends the distribution regression method using the Sliced - Wasserstein kernel to SPD matrices and applies it to the brain - age prediction task of MEG data, showing that its performance is better than other methods based on Riemannian geometry.
4. **Application in BCI**: The paper shows that SPDSW can be an effective alternative to the Wasserstein distance in domain adaptation for brain - computer interface applications.
### Method Overview
- **Sliced - Wasserstein Distance**: The paper first reviews the Sliced - Wasserstein distance in Euclidean space and then extends it to the space of symmetric positive - definite matrices. By calculating the average of the Wasserstein distances on all geodesics passing through the origin, SPDSW is defined.
- **Background of SPD Matrices**: The paper introduces the basic concepts of symmetric positive - definite matrices, including the properties of Riemannian manifolds and different metric methods, such as affine - invariant metric and log - Euclidean metric.
- **Construction of SPDSW**: The paper describes in detail how to construct SPDSW in the space of symmetric positive - definite matrices, including the method of projecting onto geodesics and coordinate calculation.
- **Properties of SPDSW**: The paper derives the theoretical properties of SPDSW, including its properties as a distance, the measure of weak convergence, and its relationship with the Wasserstein distance.
- **Numerical Experiments**: The paper conducts numerical experiments on the Cam - CAN dataset, demonstrating the effectiveness and performance of SPDSW in the brain - age prediction task.
### Conclusion
The method proposed in the paper not only has theoretical advantages in handling the covariance matrix distribution in M/EEG data but also shows good performance in practical tasks. In particular, in the tasks of brain - age prediction and domain adaptation in brain - computer interfaces, SPDSW, as an effective alternative to the Wasserstein distance, shows significant advantages.