On Robustness and Generalization of ML-Based Congestion Predictors to Valid and Imperceptible Perturbations

Chester Holtz,Yucheng Wang,Chung-Kuan Cheng,Bill Lin
2024-03-01
Abstract:There is substantial interest in the use of machine learning (ML)-based techniques throughout the electronic computer-aided design (CAD) flow, particularly methods based on deep learning. However, while deep learning methods have achieved state-of-the-art performance in several applications, recent work has demonstrated that neural networks are generally vulnerable to small, carefully chosen perturbations of their input (e.g. a single pixel change in an image). In this work, we investigate robustness in the context of ML-based EDA tools -- particularly for congestion prediction. As far as we are aware, we are the first to explore this concept in the context of ML-based EDA.
Machine Learning,Hardware Architecture
What problem does this paper attempt to address?
The paper mainly discusses the application of machine learning (ML) in electronic design automation (EDA) tools, especially how deep learning-based networks can predict congestion in circuit layouts. The study found that although these deep learning methods perform well in tasks such as image classification, they are very sensitive to small, carefully selected disturbances in the input. The paper proposes a novel concept of imperceptibility for VLSI layout problems, which allows for small changes to the layout without altering the global wiring. The paper demonstrates that even small movements of a few cell positions (e.g., 1%-5%) can significantly reduce the predicted global congestion, without actually affecting the congestion. This indicates that existing CNN and GNN congestion models are fragile when faced with such imperceptible perturbations. To enhance the robustness of the models, the paper proposes a simple training method that modifies the training process to improve resistance to such disturbances. The main contributions of the paper include: 1. Defining a feasible neighborhood for input design specific to VLSI layout problems, i.e., determining what small disturbances to the layout can be made while keeping relevant congestion metrics unchanged. 2. Proposing a supervised and unsupervised algorithm to efficiently search the neighborhood of layouts. 3. Exploring adversarial training as a means to enhance robustness and generalization. 4. Demonstrating that benchmark layouts can be deteriorated in congestion prediction quality through perturbations. Additionally, the paper discusses the generalization problem of integrating machine learning into EDA toolsets, emphasizing the need for careful evaluation of the generalization performance of ML tools, not just relying on standard performance metrics on training and testing sets. In the experimental section, the paper uses the CircuitNet dataset to demonstrate the impact of the proposed imperceptible perturbation method on congestion predictors and verifies that adversarial training can significantly improve the robustness of the models.