Neural Experts: Mixture of Experts for Implicit Neural Representations

Yizhak Ben-Shabat,Chamin Hewa Koneputugodage,Sameera Ramasinghe,Stephen Gould
2024-10-29
Abstract:Implicit neural representations (INRs) have proven effective in various tasks including image, shape, audio, and video reconstruction. These INRs typically learn the implicit field from sampled input points. This is often done using a single network for the entire domain, imposing many global constraints on a single function. In this paper, we propose a mixture of experts (MoE) implicit neural representation approach that enables learning local piece-wise continuous functions that simultaneously learns to subdivide the domain and fit locally. We show that incorporating a mixture of experts architecture into existing INR formulations provides a boost in speed, accuracy, and memory requirements. Additionally, we introduce novel conditioning and pretraining methods for the gating network that improves convergence to the desired solution. We evaluate the effectiveness of our approach on multiple reconstruction tasks, including surface reconstruction, image reconstruction, and audio signal reconstruction and show improved performance compared to non-MoE methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve two main limitations that Implicit Neural Representations (INRs) encounter when dealing with complex signals: parallelization and scale limitations, as well as locality limitations. Specifically: 1. **Parallelization and scale limitations**: - Traditional INRs usually use a Multi - Layer Perceptron (MLP) to represent the entire domain, which makes all parameters have to contribute to the output of each input point, thus resulting in difficult parameter optimization. - This design restricts the model's parallelization ability and scalability. 2. **Locality limitations**: - An ideal INR should be able to quickly change features to model sharp boundaries, but in the optimized MLP, due to spectral bias, this characteristic does not occur naturally. - Therefore, the pure MLP network suppresses the ability to perform local operations on the signal, affecting the model's flexibility and adaptability. To solve these problems, the author introduced a new implicit neural representation method based on the Mixture of Experts (MoE) architecture, called "Neural Experts". This method improves the existing INRs in the following ways: - **Learning of local piecewise continuous functions**: The MoE architecture allows the model to divide the input space into multiple regions and learn local piecewise continuous functions within each region, thereby better capturing local features. - **Enhanced parallelization and efficiency**: By assigning tasks to different expert sub - networks, computational resources can be used more effectively, improving training speed and memory utilization. - **Improved initialization and pre - training**: New conditioning and pre - training methods are introduced to help the manager network converge better to the optimal solution and avoid getting trapped in local minima. ### Main contributions of the paper 1. **Introducing the mixture of experts architecture for implicit neural representation**: Proposed a MoE INR method that can reconstruct signals while segmenting the domain and achieve parallelization. 2. **Designing a novel manager network architecture and initialization method**: Enabling domain segmentation to be carried out without true labels. 3. **Demonstrating effectiveness in multiple applications**: Including image, audio, and 3D surface reconstruction, proving that this method improves reconstruction performance while reducing the number of parameters. Through these improvements, Neural Experts significantly enhances the performance of INR in various tasks, especially in applications requiring high precision and high efficiency.