Data Efficient Lithography Modeling with Transfer Learning and Active Data Selection

Yibo Lin,Meng Li,Yuki Watanabe,Taiki Kimura,Tetsuaki Matsunawa,Shigeki Nojima,David Z. Pan
DOI: https://doi.org/10.48550/arXiv.1807.03257
2018-06-28
Abstract:Lithography simulation is one of the key steps in physical verification, enabled by the substantial optical and resist models. A resist model bridges the aerial image simulation to printed patterns. While the effectiveness of learning-based solutions for resist modeling has been demonstrated, they are considerably data-demanding. Meanwhile, a set of manufactured data for a specific lithography configuration is only valid for the training of one single model, indicating low data efficiency. Due to the complexity of the manufacturing process, obtaining enough data for acceptable accuracy becomes very expensive in terms of both time and cost, especially during the evolution of technology generations when the design space is intensively explored. In this work, we propose a new resist modeling framework for contact layers, utilizing existing data from old technology nodes and active selection of data in a target technology node, to reduce the amount of data required from the target lithography configuration. Our framework based on transfer learning and active learning techniques is effective within a competitive range of accuracy, i.e., 3-10X reduction on the amount of training data with comparable accuracy to the state-of-the-art learning approach.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the low data efficiency in lithography simulation. Specifically, although the existing learning - based resist modeling methods can achieve high precision, they require a large amount of training data, and this data can only be used for specific lithography configurations, resulting in low data utilization efficiency. As the technology nodes continue to evolve (for example, from 10 nanometers to 7 nanometers), obtaining sufficient training data becomes very expensive and time - consuming. To solve these problems, the author proposes a new resist modeling framework, aiming to reduce the amount of training data required for the target lithography configuration in the following two ways: 1. **Transfer Learning**: Utilize the data and knowledge from old technology nodes and transfer them to the models of new technology nodes. 2. **Active Learning**: Actively select the most representative data samples in the target technology node for labeling. ### Main Contributions - Propose a high - performance resist modeling technique based on Residual Neural Network (ResNet). - Design a transfer learning scheme to reduce the amount of data required to reach the target precision by using data from other configurations. - Propose an active learning scheme based on the K - Medoids algorithm, which is suitable for CNN and ResNet in combination with theoretical analysis. - The experimental results show that compared with the state - of - the - art learning methods, this method can reduce the amount of training data by 3 to 10 times while maintaining similar precision. ### Problem Definition The paper defines two main problems: 1. **Learning - based Resist Modeling Problem**: Given a data set containing aerial images and their central position thresholds, train a resist model to maximize the precision of predicting thresholds. 2. **Data - efficient Resist Modeling Problem**: Given a labeled N10 data set containing aerial images and thresholds, and an unlabeled N7 data set containing only aerial images, train a resist model for the N7 data set so that it queries the labels of N7 data samples as little as possible while achieving high precision. ### Definition of Data Efficiency Data efficiency refers to the amount of target - domain data required to learn a model under a given precision. If a model can obtain higher precision than another model at a specific amount of data, it means that this model has higher data efficiency. ### Training Process Figure 5 shows the overall training process: 1. Use the labeled source - domain data to train the source - domain model. 2. Before training the target - domain model, apply active learning to select data samples for label query. 3. Finally, use the selected data samples and the knowledge transferred from the source - domain model to train the target - domain model. In this way, this method not only improves the precision of the model, but also significantly reduces the amount of required training data, thereby enhancing data efficiency.