Abstract:By efficiently building and exploiting surrogates, data-driven evolutionary algorithms (DDEAs) can be very helpful in solving expensive and computationally intensive problems. However, they still often suffer from two difficulties. First, many existing methods for building a single ad hoc surrogate are suitable for some special problems but may not work well on some other problems. Second, the optimization accuracy of DDEAs deteriorates if available data are not enough for building accurate surrogates, which is common in expensive optimization problems. To this end, this article proposes a novel DDEA with two efficient components. First, a boosting strategy (BS) is proposed for self-aware model managements, which can iteratively build and combine surrogates to obtain suitable surrogate models for different problems. Second, a localized data generation (LDG) method is proposed to generate synthetic data to alleviate data shortage and increase data quantity, which is achieved by approximating fitness through data positions. By integrating the BS and the LDG, the BDDEA-LDG algorithm is able to improve model accuracy and data quantity at the same time automatically according to the problems at hand. Besides, a tradeoff is empirically considered to strike a better balance between the effectiveness of surrogates and the time cost for building them. The experimental results show that the proposed BDDEA-LDG algorithm can generally outperform both traditional methods without surrogates and other state-of-the-art DDEA son widely used benchmarks and an arterial traffic signal timing real-world optimization problem. Furthermore, the proposed BDDEA-LDG algorithm can use only about 2% computational budgets of traditional methods for producing competitive results.

Sample-Based Attribute Selective A$n$ DE for Large Data

Selective AnDE for Large Data Learning: a Low-Bias Memory Constrained Approach

Learning by Extrapolation from Marginal to Full-Multivariate Probability Distributions: Decreasingly Naive Bayesian Classification

MiniAnDE: a reduced AnDE ensemble to deal with microarray data

Alleviating the Attribute Conditional Independence and I.I.D. Assumptions of Averaged One-Dependence Estimator by Double Weighting

Averaged Tree-Augmented One-Dependence Estimators

Big Models for Big Data using Multi objective averaged one dependence estimators

Selective AnDE Based on Attributes Ranking by Maximin Conditional Mutual Information (MMCMI)

Self-Adaptive Attribute Value Weighting for Averaged One-Dependence Estimators.

Accelerating large-scale DEA computation using sequential categorization and dynamic reference set selection

Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search

Extracting Credible Dependencies for Averaged One-Dependence Estimator Analysis

Adaptive Feature Selection With Augmented Attributes

K-Dependence Bayesian Classifier Ensemble

Scalable Bayesian regression in high dimensions with multiple data sources

Attribute Value Weighted Average of One-Dependence Estimators

To Select or to Weigh: A Comparative Study of Linear Combination Schemes for SuperParent-One-Dependence Estimators

AdaSelection: Accelerating Deep Learning Training through Data Subsampling

Alleviating the independence assumptions of averaged one-dependence estimators by model weighting

Boosting Data-Driven Evolutionary Algorithm With Localized Data Generation

Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws