Abstract:Reaction coordinates (RCs) are the few essential coordinates of a protein that control its functional processes, such as allostery, enzymatic reaction, and conformational change. They are critical for understanding protein function and provide optimal enhanced sampling of protein conformational changes and states. Since the pioneering works in the late 1990s, identifying the correct and objectively provable RCs has been a central topic in molecular biophysics and chemical physics. This review summarizes the major advances in identifying RCs over the past 25 years, focusing on methods aimed at finding RCs that meet the rigorous committor criterion, widely accepted as the true RCs. Importantly, the newly developed physics-based energy flow theory and generalized work functional method provide a general and rigorous approach for identifying true RCs, revealing their physical nature as the optimal channels of energy flow in biomolecules.
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the problem of identifying reaction coordinates (RCs) in protein molecules. Specifically, the article explores the following points:
1. **The Concept and Importance of Reaction Coordinates**:
- Reaction coordinates are the key coordinates in proteins that control their functional processes (such as allosteric effects, enzymatic reactions, and conformational changes).
- They are crucial for understanding protein function and provide the best enhanced sampling methods for protein conformational changes and states.
2. **Theoretical Foundations and Challenges**:
- The paper reviews significant progress in identifying reaction coordinates over the past 25 years, with a particular focus on methods that meet the stringent "commitment operator criteria."
- Newly developed energy flow theory and generalized work functional methods offer a universal and rigorous approach to identifying true reaction coordinates, revealing their physical nature as the optimal channels for energy flow in biomolecules.
3. **Understanding Activation Processes**:
- Protein functions (such as enzymatic reactions, allosteric effects, etc.) are controlled by transitions between conformations, which often require overcoming energy barriers.
- Understanding these transition processes requires an understanding of the dynamics of activation processes.
4. **Definition and Validation of Reaction Coordinates**:
- Reaction coordinates are defined as a few key coordinates that fully determine the commitment operator value of the system in any conformation.
- The existence and correctness of reaction coordinates can be objectively verified through commitment operator tests.
5. **Application of Machine Learning Methods**:
- The paper discusses several attempts to identify reaction coordinates using machine learning methods, including genetic algorithms combined with neural network (GNN) methods.
- These methods optimize candidate reaction coordinates by training on the distribution of commitment operators in the dataset.
6. **Methods Based on Physical Principles**:
- In addition to machine learning methods, the paper also explores methods based on physical principles of reaction rate theory, such as minimizing recrossing, maximizing reaction flux paths, and the lowest energy or free energy paths.
In summary, the paper aims to provide a systematic framework for identifying reaction coordinates in proteins through theoretical analysis, experimental validation, and machine learning methods, thereby better understanding the functional mechanisms of proteins.