Towards the accurate modelling of antibody-antigen complexes from sequence using machine learning and information-driven docking

Marco Giulini,Constantin Schneider,Daniel Cutting,Nikita Desai,Charlotte M Deane,Alexandre M J J Bonvin
DOI: https://doi.org/10.1093/bioinformatics/btae583
IF: 5.8
2024-10-01
Bioinformatics
Abstract:Motivation: Antibody-antigen complex modelling is an important step in computational workflows for therapeutic antibody design. While experimentally determined structures of both antibody and the cognate antigen are often not available, recent advances in machine learning-driven protein modelling have enabled accurate prediction of both antibody and antigen structures. Here, we analyse the ability of protein-protein docking tools to use machine learning generated input structures for information-driven docking. Results: In an information-driven scenario, we find that HADDOCK can generate accurate models of antibody-antigen complexes using an ensemble of antibody structures generated by machine learning tools and AlphaFold2 predicted antigen structures. Targeted docking using knowledge of the complementary determining regions on the antibody and some information about the targeted epitope allows the generation of high-quality models of the complex with reduced sampling, resulting in a computationally cheap protocol that outperforms the ZDOCK baseline. Availability and implementation: The source code of HADDOCK3 is freely available at github.com/haddocking/haddock3. The code to generate and analyse the data is available at github.com/haddocking/ai-antibodies. The full runs, including docking models from all modules of a workflow have been deposited in our lab collection (data.sbgrid.org/labs/32/1139) at the SBGRID data repository.
What problem does this paper attempt to address?