FAIR Digital Objects for the Realization of Globally Aligned Data Spaces

Nicolas Blumenroehr,Philipp-Joachim Ost,Felix Kraus,Achim Streit
2024-11-27
Abstract:The FAIR principles are globally accepted guidelines for improved data management practices with the potential to align data spaces on a global scale. In practice, this is only marginally achieved through the different ways in which organizations interpret and implement these principles. The concept of FAIR Digital Objects provides a way to realize a domain-independent abstraction layer that could solve this problem, but its specifications are currently diverse, contradictory, and restricted to semantic models. In this work, we introduce a rigorously formalized data model with a set of assertions using formal expressions to provide a common baseline for the implementation of FAIR Digital Objects. The model defines how these objects enable machine-actionable decisions based on the principles of abstraction, encapsulation, and entity relationship to fulfill FAIR criteria for the digital resources they represent. We provide implementation examples in the context of two use cases and explain how our model can facilitate the (re)use of data across domains. We also compare how our model assertions are met by FAIR Digital Objects as they have been described in other projects. Finally, we discuss our results' adoption criteria, limitations, and perspectives in the big data context. Overall, our work represents an important milestone for various communities working towards globally aligned data spaces through FAIRification.
Databases
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that currently, when different organizations implement the FAIR principles (i.e., findability, accessibility, interoperability, and reusability of data), due to different interpretations and implementation methods of these principles, the data spaces are not fully aligned on a global scale. Specifically: 1. **Interoperability problems in data spaces**: The lack of interoperability between different data spaces mainly stems from technical, semantic, and governance challenges. These problems are particularly prominent in the context of big data because big data has the "5V" characteristics (i.e., volume, variety, velocity, veracity, value), which further exacerbates the difficulty of interoperability. 2. **Diversity in the implementation of FAIR principles**: Although the FAIR principles provide guidelines, each organization has different interpretations and implementation methods according to its own needs and practices, resulting in only partial interoperability between different data spaces, and some are not even implemented at all. 3. **Lack of a standardized abstract layer**: Currently, there is a lack of a high - level, domain - independent abstract layer to provide a unified FAIR - compatible representation without changing the native configurations of individual data spaces. This abstract layer can allow each data space to maintain full control over its digital resources while achieving cross - domain data sharing and reuse. To solve these problems, the paper introduces a strictly formalized data model and provides a set of assertions through formal expressions to achieve a standardized implementation of FAIR Digital Objects (FDOs). This model defines how these objects can support machine - executable decisions based on the FAIR principles through the principles of abstraction, encapsulation, and entity - relationship. In addition, the paper also provides implementation examples of two use cases and explains how this model promotes cross - domain data reuse. ### Specific problem summary - **Insufficient interoperability**: The interoperability between different data spaces is poor, resulting in low efficiency in data sharing and reuse. - **Diverse implementation of FAIR principles**: Different organizations have different interpretations and implementation methods of the FAIR principles, resulting in difficulties in aligning global data spaces. - **Lack of a standardized abstract layer**: A high - level, domain - independent abstract layer is needed to uniformly implement the FAIR principles without changing the native configurations of each data space. By introducing a strictly formalized FDO data model, the paper aims to provide an important milestone for achieving globally aligned data spaces.