StarNet: Gradient-free Training of Deep Generative Models using Determined System of Linear Equations

Amir Zadeh,Santiago Benoit,Louis-Philippe Morency
DOI: https://doi.org/10.48550/arXiv.2101.00574
2021-01-03
Abstract:In this paper we present an approach for training deep generative models solely based on solving determined systems of linear equations. A network that uses this approach, called a StarNet, has the following desirable properties: 1) training requires no gradient as solution to the system of linear equations is not stochastic, 2) is highly scalable when solving the system of linear equations w.r.t the latent codes, and similarly for the parameters of the model, and 3) it gives desirable least-square bounds for the estimation of latent codes and network parameters within each layer.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?