MatrisomeDB 2.0: 2023 updates to the ECM-protein knowledge database

Xinhao Shao,Clarissa D Gomez,Nandini Kapoor,James M Considine,Christopher Grams,Alexandra Naba,Yu (Tom) Gao
DOI: https://doi.org/10.1093/nar/gkac1009
IF: 14.9
2022-11-20
Nucleic Acids Research
Abstract:The extracellular matrix (ECM) is a complex assembly of proteins that constitutes the scaffold organizing cells, tissues, and organs. Over the past decade, mass-spectrometry-based proteomics has become the method of choice to profile the composition of the ECM, or the matrisome, of tissues. To assist non-specialists with the reuse of ECM proteomic datasets, we released MatrisomeDB (https://matrisomedb.org) in 2020. Here, we report the expansion of the database to include 25 new curated studies on the ECM of 24 new tissues in addition to datasets on tissues previously included, more than doubling the size of the original database and achieving near-complete coverage of the in-silico predicted matrisome. We further enhanced data visualization by maps of peptides and post-translational-modifications detected onto domain-based representations and 3D structures of ECM proteins. We also referenced external resources to facilitate the design of targeted mass spectrometry assays. Last, we implemented an abstract-mining tool that generates an enrichment word cloud from abstracts of studies in which a queried protein is found with higher confidence and higher abundance relative to other studies in MatrisomeDB. Graphical abstract depicts the main steps of the workflow developed to expand the content and functionalities of MatrisomeDB.
biochemistry & molecular biology
What problem does this paper attempt to address?