Protein structure, a genetic encoding for glycosylation
Ben Kellman,Daniel Sandoval,Olga Zaytseva,Kelly Brock,Sabyasachi Baboo,Daniela Nachmanson,Edward Irvine,Erick Armingol,Nathan Mih,Yujie Zhang,Mia Jeffris,Philip Bartels,Thi Nguyen,Amy Tam,Sarah Gasman,Shlomi Ilan,Isaac Shamie,Jolene Diedrich,Xiaoning Wang,Ester van Woudenbergh,Meghan Altman,Anthony Aylward,Bokan Bao,Andrea Castro,James Sorrentino,Austin Chiang,Matt Campbell,Yannic Bartsch,Patricia Aguilar-Calvo,Christina Sigurdson,Galit Alter,Gordan Lauc,John Yates III,Debora Marks,Frederique Lisacek,Nathan Lewis
DOI: https://doi.org/10.1101/2024.05.15.594261
2024-05-23
Abstract:DNA, RNA, and proteins are synthesized using template molecules, but glycosylation is not believed to be constrained by a template. However, if cellular environment is the only determinant of glycosylation, all sites should receive the same glycans on average. This template-free assertion is inconsistent with observations of microheterogeneity-wherein each site receives distinct and reproducible glycan structures. Here, we test the assumption of template-free glycan biosynthesis. Through structural analysis of site-specific glycosylation data, we find protein-sequence and structural features that predict specific glycan features. To quantify these relationships, we present a new amino acid substitution matrix that describes glycoimpact-how glycosylation varies with protein structure. High-glycoimpact amino acids co-evolve with glycosites, and glycoimpact is high when estimates of amino acid conservation and variant pathogenicity diverge. We report hundreds of disease variants near glycosites with high-glycoimpact, including several with known links to aberrant glycosylation (e.g., Oculocutaneous Albinism, Jakob-Creutzfeldt disease, Gerstmann-Straussler-Scheinker, and Gaucher's Disease). Finally, we validate glycoimpact quantification by studying oligomannose-complex glycan ratios on HIV ENV, differential sialylation on IgG3 Fc, differential glycosylation on SARS-CoV-2 Spike, and fucose-modulated function of a tuberculosis monoclonal antibody. In all, we show glycan biosynthesis is accurately guided by specific, genetically-encoded rules, and this presents a plausible refutation to the assumption of template-free glycosylation.
Genetics