Characterizing Deep Gaussian Processes via Nonlinear Recurrence Systems

Anh Tong,Jaesik Choi
DOI: https://doi.org/10.48550/arXiv.2010.09301
2020-12-21
Abstract:Recent advances in Deep Gaussian Processes (DGPs) show the potential to have more expressive representation than that of traditional Gaussian Processes (GPs). However, there exists a pathology of deep Gaussian processes that their learning capacities reduce significantly when the number of layers increases. In this paper, we present a new analysis in DGPs by studying its corresponding nonlinear dynamic systems to explain the issue. Existing work reports the pathology for the squared exponential kernel function. We extend our investigation to four types of common stationary kernel functions. The recurrence relations between layers are analytically derived, providing a tighter bound and the rate of convergence of the dynamic systems. We demonstrate our finding with a number of experimental results.
Machine Learning
What problem does this paper attempt to address?