DeepGate3: Towards Scalable Circuit Representation Learning

Zhengyuan Shi,Ziyang Zheng,Sadaf Khan,Jianyuan Zhong,Min Li,Qiang Xu
2024-07-15
Abstract:Circuit representation learning has shown promising results in advancing the field of Electronic Design Automation (EDA). Existing models, such as DeepGate Family, primarily utilize Graph Neural Networks (GNNs) to encode circuit netlists into gate-level embeddings. However, the scalability of GNN-based models is fundamentally constrained by architectural limitations, impacting their ability to generalize across diverse and complex circuit designs. To address these challenges, we introduce DeepGate3, an enhanced architecture that integrates Transformer modules following the initial GNN processing. This novel architecture not only retains the robust gate-level representation capabilities of its predecessor, DeepGate2, but also enhances them with the ability to model subcircuits through a novel pooling transformer mechanism. DeepGate3 is further refined with multiple innovative supervision tasks, significantly enhancing its learning process and enabling superior representation of both gate-level and subcircuit structures. Our experiments demonstrate marked improvements in scalability and generalizability over traditional GNN-based approaches, establishing a significant step forward in circuit representation learning technology.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in the field of electronic design automation (EDA), the existing circuit representation learning models based on graph neural networks (GNNs) encounter problems of insufficient scalability and generalization ability when dealing with large - scale and complex circuit designs. Specifically: 1. **Scalability limitations**: Although traditional GNN models can effectively handle structured data, the performance improvement is not directly proportional to the increase in the amount of training data. This means that the method of simply increasing the amount of training data to improve the model performance has limited effectiveness. In addition, the message - passing mechanism of GNNs is prone to cause information distortion when dealing with long paths in large graphs, making it difficult to distinguish similar graph structures. 2. **Insufficient generalization ability**: The discriminative ability of existing GNN models is usually not sufficient to deal with large - scale and complex circuit designs. Especially when it is necessary to generalize different types of circuits, the performance of the model is often not satisfactory. To overcome these challenges, the paper proposes **DeepGate3**, an enhanced circuit representation learning architecture. By integrating Transformer modules after the initial GNN processing, it not only retains the powerful gate - level representation ability of previous models (such as DeepGate2), but also enhances the ability to model sub - circuits. DeepGate3 significantly improves the quality of the learning process by introducing innovative supervised tasks, making it excellent in representing both gate - level and sub - circuit structures. ### Main contributions: - **Architectural innovation**: DeepGate3 combines the advantages of GNNs and Transformers. Through the Refine Transformer (RT) and Pooling Transformer (PT) modules, it improves the scalability and generalization ability of the model. - **Improved pooling mechanism**: DeepGate3 introduces a new pooling Transformer mechanism, using special [CLS] tokens to aggregate information between input and output, thereby detecting and emphasizing subtle differences in circuits more effectively. - **Multi - level pre - training strategy**: DeepGate3 adopts multi - level pre - training tasks, including gate - level and circuit - level tasks, ensuring that the model can learn the characteristics of circuits from multiple perspectives, thereby improving its performance in actual EDA tasks. Through these improvements, DeepGate3 has demonstrated a significant improvement in scalability and generalization ability in experiments, setting a new benchmark for the development of circuit representation learning techniques.