MuLAN: Mutation-driven Light Attention Networks for investigating protein-protein interactions from sequences

Gianluca Lombardi,Alessandra Carbone
DOI: https://doi.org/10.1101/2024.08.24.609515
2024-08-26
Abstract:Understanding how proteins interact and how mutations affect these interactions is crucial for unraveling the complexities of biological systems and their evolution. Mutations can significantly alter protein behavior, impacting stability, interactions, and activity, thereby affecting cellular functions and influencing disease development and treatment effectiveness. Experimental methods for examining protein interactions are often slow and costly, highlighting the need for efficient computational strategies. We present MuLAN, a groundbreaking deep learning method that leverages light attention networks and the power of pre-trained protein language models to infer protein interactions, predict binding affinity changes, and reconstruct mutational landscapes for proteins involved in binary interactions, starting from mutational changes and directly using sequence data only. Unlike previous methods that depend heavily on structural information, MuLAN's sequence-based approach offers faster and more accessible predictions. This innovation allows for variations in predictions based on specific partners, opening new possibilities for understanding protein behavior through their sequences. The potential implications for disease research and drug development mark a significant step forward in the computational analysis of protein interactions.
Bioinformatics
What problem does this paper attempt to address?
The problem this paper attempts to address is how interactions between proteins and their mutations affect these interactions. Specifically, the focus of the study is on: 1. **Predicting changes in binding affinity**: Predicting changes in the binding affinity (ΔΔG) of binary complexes after protein mutations using sequence data. 2. **Identifying interaction interfaces**: Identifying the interaction interfaces of proteins in complexes from sequence information. 3. **Reconstructing mutation landscapes**: Reconstructing the mutation landscapes of protein binary complexes based on specific interaction partners to better understand the impact of mutations on protein behavior. Traditional experimental methods, while reliable, are time-consuming and costly. Therefore, this paper proposes a deep learning method based on a lightweight attention mechanism called MuLAN, which can make predictions using only sequence data, overcoming the limitations of traditional methods that rely on structural information. MuLAN utilizes pre-trained protein language models to extract features and generates interpretable representations through convolutional lightweight attention blocks to achieve the above three goals. This method not only improves prediction speed but also enhances the understanding of protein behavior.