Fedora: An Architecture for Complex Objects and their Relationships

Carl Lagoze,Sandy Payette,Edwin Shin,Chris Wilper
DOI: https://doi.org/10.48550/arXiv.cs/0501012
2005-08-23
Abstract:The Fedora architecture is an extensible framework for the storage, management, and dissemination of complex objects and the relationships among them. Fedora accommodates the aggregation of local and distributed content into digital objects and the association of services with objects. This al-lows an object to have several accessible representations, some of them dy-namically produced. The architecture includes a generic RDF-based relation-ship model that represents relationships among objects and their components. Queries against these relationships are supported by an RDF triple store. The architecture is implemented as a web service, with all aspects of the complex object architecture and related management functions exposed through REST and SOAP interfaces. The implementation is available as open-source soft-ware, providing the foundation for a variety of end-user applications for digital libraries, archives, institutional repositories, and learning object systems.
Digital Libraries,Multimedia
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively store, manage, and disseminate complex digital objects and the relationships between them. Specifically, the Fedora architecture aims to provide a scalable framework for handling complex objects and their inter - relationships. These objects can aggregate local and distributed content and associate services with objects. This enables an object to have multiple accessible representations, some of which are dynamically generated. ### Main Problems 1. **Representation and Management of Complex Objects**: How to represent and manage digital content with complex structures and semantic relationships. 2. **Flexibility and Scalability**: How to create a flexible and scalable system to adapt to different types and sources of digital content. 3. **Integration of Distributed Content**: How to integrate content from different locations so that it can be seamlessly aggregated. 4. **Dynamic Content Generation**: How to support the generation of dynamic or computed content through service operations. 5. **Relationship Query**: How to effectively query and manage the complex relationships between objects. ### Solutions The Fedora architecture solves the above problems in the following ways: - **RDF - based Relationship Model**: Use RDF (Resource Description Framework) to represent the relationships between objects and their components, and perform queries through RDF triple stores. - **Web Service Interfaces**: All complex object architectures and related management functions are exposed through REST and SOAP interfaces to ensure the interoperability and flexibility of the system. - **Open - source Implementation**: Provide an open - source implementation, laying the foundation for various end - user applications such as digital libraries, archives, institutional repositories, and learning object systems. - **Data Streams and Service Distributors**: Support the representation of static and dynamic content through the data streams (Datastreams) and distributors (Disseminators) mechanisms. - **Integrity Management**: Define entities such as relationship metadata, access control policies, and audit trails to ensure the integrity and security of digital objects. ### Formula Representation During the description, some key concepts can be represented by formulas, for example: - **RDF Triple**: \[ (S, P, O) \] where \(S\) is the subject, \(P\) is the predicate, and \(O\) is the object. - **Object Representation**: \[ \text{Object} = (\text{PID}, \text{Properties}, \text{Datastreams}, \text{Disseminators}) \] These formulas help to more clearly understand the core components and relationships in the Fedora architecture. Through these methods, the Fedora architecture can not only effectively manage complex digital objects, but also support the integration of advanced content management and semantic web technologies, thereby better meeting the needs of digital libraries, archives, and other fields.