Visualizing genetic constraints

Travis L. Gaydos,Nancy E. Heckman,Mark Kirkpatrick,J. R. Stinchcombe,Johanna Schmitt,Joel Kingsolver,J. S. Marron
DOI: https://doi.org/10.1214/12-AOAS603
2013-12-06
Abstract:Principal Components Analysis (PCA) is a common way to study the sources of variation in a high-dimensional data set. Typically, the leading principal components are used to understand the variation in the data or to reduce the dimension of the data for subsequent analysis. The remaining principal components are ignored since they explain little of the variation in the data. However, evolutionary biologists gain important insights from these low variation directions. Specifically, they are interested in directions of low genetic variability that are biologically interpretable. These directions are called genetic constraints and indicate directions in which a trait cannot evolve through selection. Here, we propose studying the subspace spanned by low variance principal components by determining vectors in this subspace that are simplest. Our method and accompanying graphical displays enhance the biologist's ability to visualize the subspace and identify interpretable directions of low genetic variability that align with simple directions.
Applications
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to explore genetic constraints, that is, in evolutionary biology, how low genetic variation in certain directions limits the possibility of specific traits evolving through natural selection or artificial selection. Specifically, the paper proposes a method for studying the sub - spaces spanned by low - variance principal components. By determining the simplest vectors in these sub - spaces, it enhances biologists' understanding of these sub - spaces and identifies directions of low genetic variation with biological significance. The main contributions of the paper are as follows: 1. **Proposing a method**: A new method has been developed to analyze and visualize the directions of low genetic variation, which helps to understand which traits are genetically restricted during the evolutionary process. 2. **Biological background**: It details the roles of genetic variation and selection in evolution, especially explaining how genetic variation affects evolutionary changes caused by selection through the Breeder's equation. 3. **Data analysis**: The proposed method is applied to analyze two data sets, one is the height measurement data of Impatiens capensis at different ages, and the other is the growth rate data of Pieris rapae larvae at different temperatures. Through these analyses, the paper aims to help biologists better understand the existence of genetic constraints and their biological significance, thereby providing new perspectives and tools for evolutionary biology research.