On the spectral bias of coupled frequency predictor–corrector triangular DNN: The convergence analysis

Rui Zhang
DOI: https://doi.org/10.1007/s13160-023-00617-3
2023-09-23
Japan Journal of Industrial and Applied Mathematics
Abstract:Data-driven neural network algorithms show extraordinary promise in the field of scientific computing and artificial intelligence, but they are still lacking in the case where the target functions to be approximated exhibit high frequency. In previous work (Zhang in Jpn J Ind Appl Math, https://doi.org/10.1007/s13160-023-00577-8, 2023), we proposed a novel architecture, coupled frequency predictor–corrector triangular DNN (cFPCT-DNN), which couples a frequency predictor network and a corrector network. Various numerical experiments show that cFPCT-DNNs are efficient and robust in approximating high-frequency functions. In this paper, we investigate the spectral bias of cFPCT-DNN through the lens of neural tangent kernel (NTK) theory and elucidate how such super parameter obtained through the predictor network and as the activation function in corrector network can lead to robust and accurate DNN models. We verify from a theoretical and experimental point of view that determines the frequency of eigenfunctions of limiting NTK, and the spectrums of NTK shrink exceedingly slowly for the cFPCT-DNN, which proves that the convergence rate can be significantly improved in the training process.
mathematics, applied
What problem does this paper attempt to address?