Neural Architecture Search for Text Classification with Limited Computing Resources Using Efficient Cartesian Genetic Programming
Xuan Wu,Di Wang,Huanhuan Chen,Lele Yan,Yubin Xiao,Chunyan Miao,Hongwei Ge,Dong Xu,Yanchun Liang,Kangping Wang,Chunguo Wu,You Zhou
DOI: https://doi.org/10.1109/tevc.2023.3346969
IF: 16.497
2024-01-01
IEEE Transactions on Evolutionary Computation
Abstract:Cartesian Genetic Programming (CGP) has often been applied for Neural Architecture Search (NAS). However, the performance of CGP is less than ideal when searching for architectures with limited computing resources. To better facilitate NAS with limited computing resources, this paper proposes a crossover operator, a light-weighted age mechanism, and two adaptive mutation operators as the novel components in our Efficient Cartesian Genetic Programming (ECGP) method. To assess the performance of ECGP, we conduct extensive experiments on three text classification task datasets. The experimental results demonstrate that ECGP outperforms other NAS methods, requiring only hundreds of fitness evaluations to find architectures with competitive accuracy compared with human-designed models. Additionally, the ECGP-evolved architectures are shown as converging fast and stably, and having high-level transferability with merely a 1-2% accuracy drop. Ablation studies demonstrate the effectiveness of the proposed operators and age mechanism, and identify GRU as the most critical function in the text classification task. Finally, we summarize three design principles observed from the ECGP-evolved architectures that are in line with human-design strategies. To the best of our knowledge, this work introduces the first attention-derived NAS benchmark for the text classification task.