Two-sided Matrix Regression

Nayel Bettache,Cristina Butucea
2023-03-09
Abstract:The two-sided matrix regression model $Y = A^*X B^* +E$ aims at predicting $Y$ by taking into account both linear links between column features of $X$, via the unknown matrix $B^*$, and also among the row features of $X$, via the matrix $A^*$. We propose low-rank predictors in this high-dimensional matrix regression model via rank-penalized and nuclear norm-penalized least squares. Both criteria are non jointly convex; however, we propose explicit predictors based on SVD and show optimal prediction bounds. We give sufficient conditions for consistent rank selector. We also propose a fully data-driven rank-adaptive procedure. Simulation results confirm the good prediction and the rank-consistency results under data-driven explicit choices of the tuning parameters and the scaling parameter of the noise.
Statistics Theory
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to predict the target matrix \(Y\) through the Two - sided Matrix Regression (2MR) model in the context of high - dimensional data. Specifically, the model \(Y = A^*XB^*+E\) aims to predict the matrix \(Y\) by considering the linear relationships between the column features of \(X\) (through the unknown matrix \(B^*\)) and the linear relationships between the row features of \(X\) (through the matrix \(A^*\)). In the paper, low - rank predictors are proposed and these predictors are implemented by rank - penalty and nuclear - norm - penalty least - squares methods. Although the proposed criteria are not jointly convex, the authors propose explicit predictors based on Singular Value Decomposition (SVD) and show the optimal prediction bounds. In addition, the paper also provides sufficient conditions to ensure the consistency of the rank selector and proposes a fully data - driven rank - adaptive procedure. ### Key Points Summary: 1. **Model Background**: The paper discusses how to use the two - sided matrix regression model to handle high - dimensional data in the context of big data, especially when the data matrix \(Y\) can be explained by the design matrix \(X\) and two parameter matrices \(A^*\) and \(B^*\). 2. **Model Form**: The form of the two - sided matrix regression model is \(Y = A^*XB^* + E\), where \(E\) is the noise matrix and its entries are assumed to be independent, centered \(\sigma\)-sub - Gaussian distributions. 3. **Low - Rank Predictors**: The paper proposes a method to construct low - rank predictors by rank - penalty and nuclear - norm - penalty least - squares methods. Although these criteria are not jointly convex, the authors give explicit predictors through Singular Value Decomposition (SVD). 4. **Theoretical Results**: The authors prove that these predictors can reach the optimal prediction bounds under certain conditions and provide a data - driven rank - adaptive procedure. 5. **Numerical Simulation**: The numerical simulation results verify the good prediction performance and rank consistency of the proposed predictors under different noise levels. ### Main Contributions: - Propose low - rank predictors and implement these predictors by rank - penalty and nuclear - norm - penalty least - squares methods. - Demonstrate the excellent performance of these predictors in high - dimensional data and provide theoretical guarantees. - Propose a data - driven rank - adaptive procedure that can effectively select the appropriate rank in practical applications. ### Formula Summary: - **Model**: \[ Y = A^* X B^* + E \] - **Low - Rank Predictors**: \[ \hat{A}_r = U_Y \hat{A}_0^r U_X^T \] and \[ \hat{B}_r = V_X \hat{B}_0^r V_Y^T \] - **Nuclear - Norm - Penalty Least - Squares**: \[ \min_{A, B} \| Y - A X B \|_F^2+ 2\lambda\cdot\| A X B \|_* \] Through these methods, the paper provides an effective solution set for the matrix regression problem of high - dimensional data.