Abstract:The ability to find optimal molecular structures with desired properties is a popular challenge, with applications in areas such as drug discovery. Genetic algorithms are a common approach to global minima molecular searches due to their ability to search large regions of the energy landscape and decrease computational time via parallelization. In order to decrease the amount of unstable intermediate structures being produced and increase the overall efficiency of an evolutionary algorithm, clustering was introduced in multiple instances. However, there is little literature detailing the effects of differentiating the selection frequencies between clusters. In order to find a balance between exploration and exploitation in our genetic algorithm, we propose a system of clustering the starting population and choosing clusters for an evolutionary algorithm run via a dynamic probability that is dependent on the fitness of molecules generated by each cluster. We define four parameters, MFavOvrAll-A, MFavClus-B, NoNewFavClus-C, and Select-D, that correspond to a reward for producing the best structure overall, a reward for producing the best structure in its own cluster, a penalty for not producing the best structure, and a penalty based on the selection ratio of the cluster, respectively. A reward increases the probability of a cluster's future selection, while a penalty decreases it. In order to optimize these four parameters, we used a Gaussian distribution to approximate the evolutionary algorithm performance of each cluster and performed a grid search for different parameter combinations. Results show parameter MFavOvrAll-A (rewarding clusters for producing the best structure overall) and parameter Select-D (appearance penalty) have a significantly larger effect than parameters MFavClus-B and NoNewFavClus-C. In order to produce the most successful models, a balance between MFavOvrAll-A and Select-D must be made that reflects the exploitation vs exploration trade-off often seen in reinforcement learning algorithms. Results show that our reinforcement-learning-based method for selecting clusters outperforms an unclustered evolutionary algorithm for quinoline-like structure searches.

Online Deterministic Annealing for Classification and Clustering

Annealing Optimization for Progressive Learning With Stochastic Approximation

Annealed discriminant analysis

Stochastic Subnetwork Annealing: A Regularization Technique for Fine Tuning Pruned Subnetworks

A Regularization Framework for Multiclass Classification: A Deterministic Annealing Approach.

A simulated annealing algorithm with a dual perturbation method for clustering

Stochastic Annealing for Variational Inference

Cluster Resource Management for Dynamic Workloads by Online Optimization

Online Hyperparameter Optimization for Class-Incremental Learning

Tuning Reinforcement Learning Parameters for Cluster Selection to Enhance Evolutionary Algorithms

Simple on-the-fly parameter selection mechanisms for two classical discrete black-box optimization benchmark problems

The BYY annealing learning algorithm for Gaussian mixture with automated model selection

Noisy Batch Active Learning with Deterministic Annealing

An efficient optimization approach for designing machine learning models based on genetic algorithm

Parallel Clustering Algorithm by Deterministic Annealing

An Annealing Approach to Byy Harmony Learning on Gaussian Mixture with Automated Model Selection

Evolving Restricted Boltzmann Machine-Kohonen Network for Online Clustering

Partial distortion entropy maximization for online data clustering

Cyclical Log Annealing as a Learning Rate Scheduler

Variable Annealing Length and Parallelism in Simulated Annealing

On the Effectiveness of Simple Success-Based Parameter Selection Mechanisms for Two Classical Discrete Black-Box Optimization Benchmark Problems