A resource-efficient model for deep kernel learning

Luisa D'Amore
2024-10-14
Abstract:According to the Hughes phenomenon, the major challenges encountered in computations with learning models comes from the scale of complexity, e.g. the so-called curse of dimensionality. There are various approaches for accelerate learning computations with minimal loss of accuracy. These approaches range from model-level to implementation-level approaches. To the best of our knowledge, the first one is rarely used in its basic form. Perhaps, this is due to theoretical understanding of mathematical insights of model decomposition approaches, and thus the ability of developing mathematical improvements has lagged behind. We describe a model-level decomposition approach that combines both the decomposition of the operators and the decomposition of the network. We perform a feasibility analysis on the resulting algorithm, both in terms of its accuracy and scalability.
Machine Learning,Numerical Analysis
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the excessive consumption of computing resources when deep learning (DL) processes large - scale data. Specifically, with the increase in the complexity of deep - learning models, especially under the influence of the curse of dimensionality (that is, the complexity of processing high - dimensional data rises sharply), the training of DL models becomes extremely time - consuming and computationally costly. This not only limits the scale and performance of the models but also makes it more difficult to deploy these models in resource - constrained environments. To address this challenge, the author proposes a new model - level technique - D3L (Decomposed Deep Learning). This method is based on the idea of kernel decomposition and aims to improve the resource efficiency of the model through data reduction, localization of prediction functions, and reduction of error functions. The main feature of D3L lies in the appropriate modification of the local error function. By imposing regularization constraints, the consistency of solutions to adjacent sub - problems is enforced. As a result, a large - scale DL problem that originally needed to be solved can be decomposed into multiple smaller sub - problems. Each sub - problem can be solved independently, and these sub - problems can be efficiently solved in a parallel computing framework, thereby improving the accuracy - per - parameter metric while maintaining the scalability of the algorithm. In addition, D3L also designs a neighbor communication scheme that allows local exchange of parameter updates between worker nodes without the need for a fully - connected network structure, further reducing communication overhead and improving computational efficiency. This method not only helps to alleviate the computational bottleneck of deep - learning models but also provides a feasible solution for applying complex DL models in resource - constrained environments.