Deep Unrolling Networks with Recurrent Momentum Acceleration for Nonlinear Inverse Problems

Qingping Zhou,Jiayu Qian,Junqi Tang,Jinglai Li
2024-03-31
Abstract:Combining the strengths of model-based iterative algorithms and data-driven deep learning solutions, deep unrolling networks (DuNets) have become a popular tool to solve inverse imaging problems. While DuNets have been successfully applied to many linear inverse problems, nonlinear problems tend to impair the performance of the method. Inspired by momentum acceleration techniques that are often used in optimization algorithms, we propose a recurrent momentum acceleration (RMA) framework that uses a long short-term memory recurrent neural network (LSTM-RNN) to simulate the momentum acceleration process. The RMA module leverages the ability of the LSTM-RNN to learn and retain knowledge from the previous gradients. We apply RMA to two popular DuNets -- the learned proximal gradient descent (LPGD) and the learned primal-dual (LPD) methods, resulting in LPGD-RMA and LPD-RMA respectively. We provide experimental results on two nonlinear inverse problems: a nonlinear deconvolution problem, and an electrical impedance tomography problem with limited boundary measurements. In the first experiment we have observed that the improvement due to RMA largely increases with respect to the nonlinearity of the problem. The results of the second example further demonstrate that the RMA schemes can significantly improve the performance of DuNets in strongly ill-posed problems.
Machine Learning
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the performance bottleneck in nonlinear inverse problems. Specifically, while deep unfolding networks (DuNets) perform excellently in many linear inverse problems, their performance often suffers when dealing with nonlinear problems. To improve this situation, the authors propose a Recurrent Momentum Acceleration (RMA) framework based on Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN). The RMA module leverages the capability of LSTM-RNN to learn and retain information from previous gradients, thereby enhancing the performance of DuNets in nonlinear inverse problems. ### Main Contributions 1. **Proposing the RMA Framework**: By introducing LSTM-RNN to simulate the momentum acceleration process, the RMA module can learn and retain knowledge from previous gradients, thereby improving the performance of DuNets. 2. **Applying RMA to Two Popular DuNets**: The RMA is applied to Learning Proximal Gradient Descent (LPGD) and Learning Primal-Dual methods (LPD), resulting in LPGD-RMA and LPD-RMA, respectively. 3. **Experimental Validation**: The effectiveness of the RMA framework is demonstrated through experimental results on two nonlinear inverse problems (nonlinear deconvolution problem and electrical impedance tomography problem with limited boundary measurements). Specifically, the RMA module significantly improves the performance of DuNets in nonlinear problems. ### Experimental Results 1. **Nonlinear Deconvolution Problem**: - When the nonlinear parameter \( a = 0 \), the Mean Squared Error (MSE) values of various methods are almost the same because the gradient of the forward operator is constant. - When \( a > 0 \), methods with the RMA module significantly outperform other methods. For example, the MSE of the LPD-RMA method is 8.0%, 12.0%, and 16.0% lower than that of the LPD method when \( a = 1, 2, 4 \), respectively. - Traditional momentum acceleration modules can also improve performance, but the RMA module consistently outperforms traditional methods in all nonlinear cases. - The LPD-RMA method consistently performs the best in terms of MSE. 2. **Electrical Impedance Tomography Problem**: - The effectiveness of the RMA module is validated through a synthetic 2D experimental dataset. - The RMA module significantly improves the performance of DuNets under different numbers of training samples. - The RMA module performs exceptionally well in low-data scenarios and maintains its advantage even in high-data scenarios. ### Conclusion By introducing the RMA module, this paper successfully addresses the performance bottleneck of DuNets in nonlinear inverse problems. Experimental results show that the RMA module not only performs excellently in the nonlinear deconvolution problem but also significantly improves the reconstruction quality in the electrical impedance tomography problem. This provides new directions and tools for future research in nonlinear inverse problems.