Automatically Design Convolutional Neural Networks by Optimization With Submodularity and Supermodularity

Wenzheng Hu,Junqi Jin,Tie-Yan Liu,Changshui Zhang
DOI: https://doi.org/10.1109/tnnls.2019.2939157
IF: 14.255
2020-09-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:The architecture of convolutional neural networks (CNNs) is a key factor of influencing their performance. Although deep CNNs perform well in many difficult problems, how to intelligently design the architecture is still a challenging problem. Focusing on two practical architectural design problems: to maximize the accuracy with a given forward running time and to minimize the forward running time with a given accuracy requirement, we innovatively utilize prior knowledge to convert architecture optimization problems into submodular optimization problems. We propose efficient Greedy algorithms to solve them and give theoretical bounds of our algorithms. Specifically, we employ the techniques on some public data sets and compare our algorithms with some other hyperparameter optimization methods. Experiments show our algorithms' efficiency.
computer science, artificial intelligence, theory & methods,engineering, electrical & electronic, hardware & architecture
What problem does this paper attempt to address?