Community evaluation of glycoproteomics informatics solutions reveals high-performance search strategies for serum glycopeptide analysis

Rebeca Kawahara,Anastasia Chernykh,Kathirvel Alagesan,Marshall Bern,Weiqian Cao,Robert J. Chalkley,Kai Cheng,Matthew S. Choo,Nathan Edwards,Radoslav Goldman,Marcus Hoffmann,Yingwei Hu,Yifan Huang,Jin Young Kim,Doron Kletter,Benoit Liquet,Mingqi Liu,Yehia Mechref,Bo Meng,Sriram Neelamegham,Terry Nguyen-Khuong,Jonas Nilsson,Adam Pap,Gun Wook Park,Benjamin L. Parker,Cassandra L. Pegg,Josef M. Penninger,Toan K. Phung,Markus Pioch,Erdmann Rapp,Enes Sakalli,Miloslav Sanda,Benjamin L. Schulz,Nichollas E. Scott,Georgy Sofronov,Johannes Stadlmann,Sergey Y. Vakhrushev,Christina M. Woo,Hung-Yi Wu,Pengyuan Yang,Wantao Ying,Hui Zhang,Yong Zhang,Jingfu Zhao,Joseph Zaia,Stuart M. Haslam,Giuseppe Palmisano,Jong Shin Yoo,Göran Larson,Kai-Hooi Khoo,Katalin F. Medzihradszky,Daniel Kolarich,Nicolle H. Packer,Morten Thaysen-Andersen
DOI: https://doi.org/10.1038/s41592-021-01309-x
IF: 48
2021-11-01
Nature Methods
Abstract:Abstract Glycoproteomics is a powerful yet analytically challenging research tool. Software packages aiding the interpretation of complex glycopeptide tandem mass spectra have appeared, but their relative performance remains untested. Conducted through the HUPO Human Glycoproteomics Initiative, this community study, comprising both developers and users of glycoproteomics software, evaluates solutions for system-wide glycopeptide analysis. The same mass spectrometry based glycoproteomics datasets from human serum were shared with participants and the relative team performance for N- and O -glycopeptide data analysis was comprehensively established by orthogonal performance tests. Although the results were variable, several high-performance glycoproteomics informatics strategies were identified. Deep analysis of the data revealed key performance-associated search parameters and led to recommendations for improved ‘high-coverage’ and ‘high-accuracy’ glycoproteomics search solutions. This study concludes that diverse software packages for comprehensive glycopeptide data analysis exist, points to several high-performance search strategies and specifies key variables that will guide future software developments and assist informatics decision-making in glycoproteomics.
biochemical research methods
What problem does this paper attempt to address?