Abstract:The genetic algorithm is a kind of evolutionary algorithms, which is widely used to solve optimization problems. It still faces the challenge of computing efficiency when dealing with large-scale optimization problems, which motivates the research on parallel genetic algorithm (PGA) to accelerate it. Sunway TaihuLight system is the first supercomputer with a peak performance of over 125 PFlops, which is powered by SW26010 heterogeneous many-core processors. Among the common PGA models, the hierarchical model fits the SW26010 best due to its heterogeneity. In this paper, we propose a heterogeneous parallel genetic algorithm (swGA) on Sunway TaihuLight system, which implements process-level and thread-level parallelism. In the process-level parallelism, we propose an island model based on a migration pool, which preserves the elitists and provides the globally stochastic individuals to be immigrated for islands. In addition, an interleaving emigration scheme and a dynamic immigration scheme are proposed to reduce the communication overhead of migration. In the thread-level parallelism, we propose the island/master-slave, island/cellular, and island/island models based on the unique mechanisms of SW26010. We optimize these models by using some optimization methods such as direct memory access, register-level communication, double buffering, vectorization. The experimental results show that the island model of swGA has a better search ability compared with a directed migration based island model and a serial GA. Based on the island model, the three hierarchical models all have satisfied search ability and acceleration effect. Among them, the island/island model has the best search ability, and the island/cellular model has the best acceleration effect, which achieves a speedup up to 3603.13× on 64 CGs.

Parallel Design and Optimization of Galaxy Group Finding Algorithm on Comparation of SGI and Distributed-memory Cluster

A parallel computing method for irregular work

Optimizing Irregular Memory Access in Astrophysical Clustering Studies

An Implementation of Parallel MLFMA on a Cluster of Computers with Distributed Memory

A Hierarchical Grid Algorithm for Accelerating High-Performance Conjugate Gradient Benchmark on Sunway Many-Core Processor

Enabling and Scaling the HPCG Benchmark on the Newest Generation Sunway Supercomputer with 42 Million Heterogeneous Cores

AN PARALLEL AND DISTRIBUTED PROGRAMMING SOLUTION BASED ON HETEROGENEOUS GPU CLUSTER

Highly Scalable Parallel Genetic Algorithm on Sunway Many-Core Processors

Parallelization and Optimization of NSGA-II on Sunway TaihuLight System

A New Hybrid Hierarchical Parallel Algorithm to Enhance the Performance of Large-Scale Structural Analysis Based on Heterogeneous Multicore Clusters

A Heterogeneous Parallel Genetic Algorithm Based on SW26010 Processors

Parallel-SG: research of parallel graphics rendering system on PC-Cluster.

Parallel Genetic Algorithm for Design of HEN Based on OpenMP System

Pipelining Computation and Optimization Strategies for Scaling GROMACS on the Sunway Many-Core Processor.

Efficient Hierarchical Parallel Genetic Algorithms Using Grid Computing

Effectively Utilizing Global Cluster Memory for Large Data-Intensive Parallel Programs

Acceleration of a Production-Level Unstructured Grid Finite Volume CFD Code on GPU

Parallelization of Spectral Clustering Algorithm on Multi-Core Processors and GPGPU

Parallel Implementation for Dynamic Mesh Optimization on Distributed Computer System.

Development and performance optimization of a parallel computing infrastructure for an unstructured-mesh modelling framework

Parallel Multiple Sequences Alignment in SMP Cluster