The W3C Data Catalog Vocabulary, Version 2: Rationale, Design Principles, and Uptake

Riccardo Albertoni,David Browning,Simon Cox,Alejandra N. Gonzalez-Beltran,Andrea Perego,Peter Winstanley
DOI: https://doi.org/10.1162/dint_a_00241
2023-12-18
Data Intelligence
Abstract:Abstract DCAT is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. Since its first release in 2014 as a W3C Recommendation, DCAT has seen a wide adoption across communities and domains, particularly in conjunction with implementing the FAIR data principles (for findable, accessible, interoperable and reusable data). These implementation experiences, besides demonstrating the fitness of DCAT to meet its intended purpose, helped identify existing issues and gaps. Moreover, over the last few years, additional requirements emerged in data catalogs, given the increasing practice of documenting not only datasets but also data services and APIs. This paper illustrates the new version of DCAT, explaining the rationale behind its main revisions and extensions, based on the collected use cases and requirements, and outlines the issues yet to be addressed in future versions of DCAT.
computer science, information systems
What problem does this paper attempt to address?