Worldwide Protein Data Bank Biocuration Supporting Open Access to High-Quality 3D Structural Biology Data

Jasmine Y. Young,John D. Westbrook,Zukang Feng,Ezra Peisach,Irina Persikova,Raul Sala,Sanchayita Sen,John M. Berrisford,G. Jawahar Swaminathan,Thomas J. Oldfield,Aleksandras Gutmanas,Reiko Igarashi,David R. Armstrong,Kumaran Baskaran,Li Chen,Minyu Chen,Alice R. Clark,Luigi Di Costanzo,Dimitris Dimitropoulos,Guanghua Gao,Sutapa Ghosh,Swanand Gore,Vladimir Guranovic,Pieter M. S. Hendrickx,Brian P. Hudson,Yasuyo Ikegawa,Yumiko Kengaku,Catherine L. Lawson,Yuhe Liang,Lora Mak,Abhik Mukhopadhyay,Buvaneswari Narayanan,Kayoko Nishiyama,Ardan Patwardhan,Gaurav Sahni,Eduardo Sanz-Garcia,Junko Sato,Monica R. Sekharan,Chenghua Shao,Oliver S. Smart,Lihua Tan,Glen van Ginkel,Huanwang Yang,Marina A. Zhuravleva,John L. Markley,Haruki Nakamura,Genji Kurisu,Gerard J. Kleywegt,Sameer Velankar,Helen M. Berman,Stephen K. Burley
DOI: https://doi.org/10.1093/database/bay002
2018-01-01
Database
Abstract:The Protein Data Bank (PDB) is the single global repository for experimentally determined 3D structures of biological macromolecules and their complexes with ligands. The worldwide PDB (wwPDB) is the international collaboration that manages the PDB archive according to the FAIR principles: Findability, Accessibility, Interoperability and Reusability. The wwPDB recently developed OneDep, a unified tool for deposition, validation and biocuration of structures of biological macromolecules. All data deposited to the PDB undergo critical review by wwPDB Biocurators. This article outlines the importance of biocuration for structural biology data deposited to the PDB and describes wwPDB biocuration processes and the role of expert Biocurators in sustaining a highquality archive. Structural data submitted to the PDB are examined for self-consistency, standardized using controlled vocabularies, cross-referenced with other biological data resources and validated for scientific/technical accuracy. We illustrate how biocuration is integral to PDB data archiving, as it facilitates accurate, consistent and comprehensive representation of biological structure data, allowing efficient and effective usage by research scientists, educators, students and the curious public worldwide.
What problem does this paper attempt to address?