A New Optimization Model for MLP Hyperparameter Tuning: Modeling and Resolution by Real-Coded Genetic Algorithm

Fatima Zahrae El-Hassani,Meryem Amri,Nour-Eddine Joudar,Khalid Haddouch
DOI: https://doi.org/10.1007/s11063-024-11578-0
IF: 2.565
2024-03-15
Neural Processing Letters
Abstract:This paper introduces an efficient real-coded genetic algorithm (RCGA) evolved for constrained real-parameter optimization. This novel RCGA incorporates three specially crafted evolutionary operators: Tournament Selection (RS) with elitism, Simulated Binary Crossover (SBX), and Polynomial Mutation (PM). The application of this RCGA is directed toward optimizing the MLPRGA+5 model. This model is designed to configure Multilayer Perceptron neural networks by optimizing both their architecture and associated hyperparameters, including learning rates, activation functions, and regularization hyperparameters. The objective function employed is the widely recognized learning loss function, commonly used for training neural networks. The integration of this objective function is supported by the introduction of new variables representing MLP hyperparameter values. Additionally, a set of constraints is thoughtfully designed to align with the structure of the Multilayer Perceptron (MLP) and its corresponding hyperparameters. The practicality and effectiveness of the MLPRGA+5 approach are demonstrated through extensive experimentation applied to four datasets from the UCI machine learning repository. The results highlight the remarkable performance of MLPRGA+5, characterized by both complexity reduction and accuracy improvement.
computer science, artificial intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the problem of optimizing hyper - parameters in the multi - layer perceptron (MLP) neural network. Specifically, the paper introduces a new real - coded genetic algorithm (Real - Coded Genetic Algorithm, RCGA), which is specially designed to solve the constrained real - parameter optimization problem. This newly proposed RCGA contains three specially designed evolutionary operators: Tournament Selection with Elitism (RS), Simulated Binary Crossover (SBX) and Polynomial Mutation (PM). These operators work together to optimize the MLPRGA + 5 model, which aims to configure the multi - layer perceptron neural network by optimizing the architecture of the MLP and its related hyper - parameters (such as the learning rate, activation function and regularization hyper - parameters). The objective function of the paper is the widely recognized learning loss function, which is a commonly used metric when training neural networks. To support the application of this objective function, the paper introduces new variables to represent the hyper - parameter values of the MLP and designs a set of constraint conditions that match the MLP structure and its corresponding hyper - parameters. Through these methods, the paper demonstrates the effectiveness and practicality of the MLPRGA + 5 method on datasets in four UCI machine - learning databases. The results show that this method can not only reduce the complexity of the model, but also improve the accuracy.