Migrating Knowledge between Physical Scenarios based on Artificial Neural Networks

Yurui Qu,Li Jing,Yichen Shen,Min Qiu,Marin Soljacic
DOI: https://doi.org/10.48550/arXiv.1809.00972
2019-05-03
Abstract:Deep learning is known to be data-hungry, which hinders its application in many areas of science when datasets are small. Here, we propose to use transfer learning methods to migrate knowledge between different physical scenarios and significantly improve the prediction accuracy of artificial neural networks trained on a small dataset. This method can help reduce the demand for expensive data by making use of additional inexpensive data. First, we demonstrate that in predicting the transmission from multilayer photonic film, the relative error rate is reduced by 46.8% (26.5%) when the source data comes from 10-layer (8-layer) films and the target data comes from 8-layer (10-layer) films. Second, we show that the relative error rate is decreased by 22% when knowledge is transferred between two very different physical scenarios: transmission from multilayer films and scattering from multilayer nanoparticles. Finally, we propose a multi-task learning method to improve the performance of different physical scenarios simultaneously in which each task only has a small dataset.
Computer Vision and Pattern Recognition,Machine Learning,Computational Physics
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is: How to improve the prediction accuracy of artificial neural networks in different physical scenarios with a small - data set. Specifically, the paper proposes a transfer - learning method, which can transfer knowledge from one physical scenario to another, thereby significantly improving the prediction performance of artificial neural networks trained on a small - data set. This method aims to reduce the need for expensive data and improve the performance of the model by using additional inexpensive data. The paper mainly focuses on two aspects of problems: 1. **Knowledge transfer between similar physical scenarios**: For example, the transfer prediction from a 10 - layer multi - layer thin film to an 8 - layer multi - layer thin film, or knowledge transfer in the opposite direction. Experimental results show that through transfer learning, the relative error rate can be significantly reduced. For example, the relative error rate from a 10 - layer thin film to an 8 - layer thin film is reduced by 50.5%. 2. **Knowledge transfer between very different physical scenarios**: For example, from the scattering problem of an 8 - layer multi - layer nanoparticle to the transmission problem of an 8 - layer multi - layer thin film. Although these two scenarios are very different, transfer learning can still reduce the relative error rate, achieving a 19.7% decrease. In addition, the paper also explores the multi - task learning method, that is, simultaneously handling multiple related tasks, each with only a small amount of data. By sharing some hidden layers of the neural network, multi - task learning can simultaneously improve performance on multiple tasks, even if the amount of data for each task is very small. In conclusion, by introducing transfer - learning and multi - task - learning methods, the paper solves the challenges of deep - learning applications in physical problems under the condition of a small - data set, and shows the potential of these methods in improving the prediction accuracy of models.