Protein structure prediction as a hard optimization problem: the genetic algorithm approach

Mehul M. Khimasia,Peter V. Coveney
DOI: https://doi.org/10.48550/arXiv.physics/9708012
1997-08-11
Abstract:Protein structure prediction can be shown to be an NP-hard problem; the number of conformations grows exponentially with the number of residues. The native conformations of proteins occupy a very small subset of these, hence an exploratory, robust search algorithm, such as a genetic algorithm (GA), is required. The dynamics of GAs tend to be complicated and problem-specific. However, their empirical success warrants their further study. In this paper, guidelines for the design of genetic algorithms for protein structure prediction are determined. To accomplish this, the performance of the simplest genetic algorithm is investigated for simple lattice-based protein structure prediction models (which is extendible to real-space), using energy minimization. The study has led us to two important conclusions for `protein-structure-prediction-genetic-algorithms'. Firstly, they require high resolution building blocks attainable by multi-point crossovers and secondly they require a local dynamics operator to `fine tune' good conformations. Furthermore, we introduce a statistical mechanical approach to analyse the genetic algorithm dynamics and suggest a convergence criterion using a quantity analogous to the free energy of population.
Chemical Physics
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the modeling of price fluctuations in financial markets, especially how to more accurately describe the price fluctuation characteristics in financial time series. Specifically, the authors are concerned with the complexity of price fluctuations in financial markets, including the correlation of volatility and the fat - tail characteristics of the probability distribution function (pdf). Traditional methods, such as the Gaussian assumption and ARCH/GARCH models, have limitations in capturing these characteristics. Therefore, this paper proposes using wavelet analysis to decompose the volatility of intraday (S&P500) return data, and by studying the two - point - half correlation functions on different time scales, reveals the information cascade phenomenon from large - scale (low - frequency, i.e., "infrared") to fine - scale (high - frequency, i.e., "ultraviolet"). The authors also quantify and visualize the information flow across scales and attempt to explain their findings from the perspective of market dynamics. They introduce a stochastic multiplicative cascade model to explain the observed long - term correlations, which is similar to the concepts in turbulence theory but is applied to the volatility of financial markets rather than price increments. This model can better explain the multi - scale characteristics of volatility and provides a new perspective for understanding financial market dynamics.