De novo antibody design with SE(3) diffusion

Daniel Cutting,Frédéric A. Dreyer,David Errington,Constantin Schneider,Charlotte M. Deane
2024-05-13
Abstract:We introduce IgDiff, an antibody variable domain diffusion model based on a general protein backbone diffusion framework which was extended to handle multiple chains. Assessing the designability and novelty of the structures generated with our model, we find that IgDiff produces highly designable antibodies that can contain novel binding regions. The backbone dihedral angles of sampled structures show good agreement with a reference antibody distribution. We verify these designed antibodies experimentally and find that all express with high yield. Finally, we compare our model with a state-of-the-art generative backbone diffusion model on a range of antibody design tasks, such as the design of the complementarity determining regions or the pairing of a light chain to an existing heavy chain, and show improved properties and designability.
Biomolecules,Machine Learning
What problem does this paper attempt to address?
This paper introduces a novel antibody variable region design model called IgDiff, which is based on the SE(3) diffusion framework and extends to handle multi-chain structures. The goal of IgDiff is to generate antibody structures with strong designability and novel binding regions, whose main chain dihedral angles are well consistent with the distribution of reference antibodies. The designed antibodies have been experimentally validated and shown to be highly expressible. In the study, the authors evaluated the designability and novelty of structures generated by the IgDiff model and found that they contain designable antibodies and unique binding sites. The main chain dihedral angles of the model align with the distribution of antibodies in the reference dataset, and experimental evidence demonstrates high production expression of all designed antibodies. Furthermore, IgDiff outperforms state-of-the-art generative main chain diffusion models in a range of antibody design tasks, such as complementarity-determining region design or pairing of light chains with existing heavy chains, exhibiting superior properties and designability. IgDiff is adapted from the recent main chain diffusion model FrameDiff specifically for antibody variable regions. It is fine-tuned on a synthetic antibody structural dataset and uses the antibody-specific inverse folding model AbMPNN to predict the corresponding sequences. Through comparisons on various design tasks, the superiority of IgDiff in antibody engineering tasks is proven, providing a data-driven AI approach for drug design, especially in designing novel proteins with specific functionalities.