Linear RNNs Provably Learn Linear Dynamical Systems
Lifu Wang,Tianyu Wang,Shengwei Yi,Bo Shen,Bo Hu,Xing Cao
DOI: https://doi.org/10.1007/s10957-024-02521-3
2024-10-16
Journal of Optimization Theory and Applications
Abstract:In this paper, we investigate the learning abilities of linear recurrent neural networks (RNNs) trained using Gradient Descent. We present a theoretical guarantee demonstrating that these linear RNNs can effectively learn any stable linear dynamical system with polynomial complexity. Importantly, our derived generalization error bound is independent of the episode length. For any stable linear system with a transition matrix C characterized by a parameter related to the spectral radius, we prove that despite the non-convexity of the parameter optimization loss, a linear RNN can learn the system with polynomial sample and time complexity in , provided that the RNN has sufficient width. Notably, the required width of the hidden layers does not depend on the length of the input sequence. This work provides the first rigorous theoretical foundation for learning linear RNNs. Our findings suggest that linear RNNs are capable of efficiently learning complex dynamical systems, paving the way for further research into the learning capabilities of more general RNN architectures.
mathematics, applied,operations research & management science