ModelCIF: an Extension of PDBx/mmCIF Data Representation for Computed Structure Models

Brinda Vallat,Gerardo Tauriello,Stefan Bienert,Juergen Haas,Benjamin M. Webb,Augustin Zidek,Wei Zheng,Ezra Peisach,Dennis W. Piehl,Ivan Anischanka,Ian Sillitoe,James Tolchard,Mihaly Varadi,David Baker,Christine Orengo,Yang Zhang,Jeffrey C. Hoch,Genji Kurisu,Ardan Patwardhan,Sameer Velankar,Stephen K. Burley,Andrej Sali,Torsten Schwede,Helen M. Berman,John D. Westbrook
DOI: https://doi.org/10.1016/j.jmb.2023.168021
IF: 6.151
2023-01-01
Journal of Molecular Biology
Abstract:ModelCIF (github.com/ihmwg/ModelCIF) is a data information framework developed for and by computational structural biologists to enable delivery of Findable, Accessible, Interoperable, and Reusable (FAIR) data to users worldwide. ModelCIF describes the specific set of attributes and metadata associated with macromolecular structures modeled by solely computational methods and provides an extensible data representation for deposition, archiving, and public dissemination of predicted three-dimensional (3D) models of macromolecules. It is an extension of the Protein Data Bank Exchange / macromolecular Crystallographic Information Framework (PDBx/mmCIF), which is the global data standard for representing experimentally-determined 3D structures of macromolecules and associated metadata. The PDBx/ mmCIF framework and its extensions (e.g., ModelCIF) are managed by the Worldwide Protein Data Bank partnership (wwPDB, wwpdb.org) in collaboration with relevant community stakeholders such as the wwPDB ModelCIF Working Group (wwpdb.org/task/modelcif). This semantically rich and extensible data framework for representing computed structure models (CSMs) accelerates the pace of scientific discovery. Herein, we describe the architecture, contents, and governance of ModelCIF, and tools and processes for maintaining and extending the data standard. Community tools and software libraries that support ModelCIF are also described. (c) 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
What problem does this paper attempt to address?