Seeing Implicit Neural Representations as Fourier Series

Nuri Benbarka,Timon Höfer,Hamd ul-moqeet Riaz,Andreas Zell
DOI: https://doi.org/10.48550/arXiv.2109.00249
2021-09-01
Abstract:Implicit Neural Representations (INR) use multilayer perceptrons to represent high-frequency functions in low-dimensional problem domains. Recently these representations achieved state-of-the-art results on tasks related to complex 3D objects and scenes. A core problem is the representation of highly detailed signals, which is tackled using networks with periodic activation functions (SIRENs) or applying Fourier mappings to the input. This work analyzes the connection between the two methods and shows that a Fourier mapped perceptron is structurally like one hidden layer SIREN. Furthermore, we identify the relationship between the previously proposed Fourier mapping and the general d-dimensional Fourier series, leading to an integer lattice mapping. Moreover, we modify a progressive training strategy to work on arbitrary Fourier mappings and show that it improves the generalization of the interpolation task. Lastly, we compare the different mappings on the image regression and novel view synthesis tasks. We confirm the previous finding that the main contributor to the mapping performance is the size of the embedding and standard deviation of its elements.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on the following aspects: 1. **Understanding the relationship between implicit neural representations (INR) and Fourier mappings**: The paper explores the structural similarities between a perceptron using Fourier mappings and a single - hidden - layer SIREN with a periodic activation function. Specifically, the paper proves that a perceptron with a Fourier mapping is structurally similar to a single - hidden - layer SIREN, but the mapping in SIREN is trainable and represented in amplitude - phase form, while the Fourier mapping is represented in sine - cosine form. 2. **Exploring the application of Fourier series in signal representation**: The paper assumes that the functions to be learned have a finite input domain (such as the height and width of an image), and their values are defined on a finite set, so these functions can be considered continuous and periodic on their input boundaries, satisfying all the conditions for representation by Fourier series. The paper further determines the trigonometric form of the d - dimensional Fourier series and proves that a perceptron with an integer lattice - point mapping can represent any periodic signal if it contains infinitely many frequencies in the mapping. However, in practical applications, the Fourier coefficients are finite and can be obtained by uniformly sampling the signal and applying the fast Fourier transform (FFT). 3. **Improving the generalization ability of interpolation tasks**: The paper modifies a progressive training strategy, that is, training low - frequency components in the early stage of training and gradually adding high - frequency components as the training progresses. This method helps to avoid overfitting and improve the generalization ability of the model on interpolation tasks. 4. **Comparing the performance of different mapping methods**: The paper compares different mapping methods on image regression and new - view - synthesis tasks, verifying the conclusion in previous studies that the main contributing factors to mapping performance are the number and standard deviation of mapping parameters. Through the above research, the paper aims to provide a new perspective for understanding the connection between implicit neural representations and Fourier analysis, and propose effective techniques to improve the performance and generalization ability of models based on Fourier mappings.