Abstract:Background: Robust, extensible and distributed databases integrating clinical, imaging and molecular data represent a substantial challenge for modern neuroscience. It is even more difficult to provide extensible software environments able to effectively target the rapidly changing data requirements and structures of research experiments. There is an increasing request from the neuroscience community for software tools addressing technical challenges about: (i) supporting researchers in the medical field to carry out data analysis using integrated bioinformatics services and tools; (ii) handling multimodal/multiscale data and metadata, enabling the injection of several different data types according to structured schemas; (iii) providing high extensibility, in order to address different requirements deriving from a large variety of applications simply through a user runtime configuration. Methods: A dynamically extensible data structure supporting collaborative multidisciplinary research projects in neuroscience has been defined and implemented. We have considered extensibility issues from two different points of view. First, the improvement of data flexibility has been taken into account. This has been done through the development of a methodology for the dynamic creation and use of data types and related metadata, based on the definition of "meta" data model. This way, users are not constrainted to a set of predefined data and the model can be easily extensible and applicable to different contexts. Second, users have been enabled to easily customize and extend the experimental procedures in order to track each step of acquisition or analysis. This has been achieved through a process-event data structure, a multipurpose taxonomic schema composed by two generic main objects: events and processes. Then, a repository has been built based on such data model and structure, and deployed on distributed resources thanks to a Grid-based approach. Finally, data integration aspects have been addressed by providing the repository application with an efficient dynamic interface designed to enable the user to both easily query the data depending on defined datatypes and view all the data of every patient in an integrated and simple way. Results: The results of our work have been twofold. First, a dynamically extensible data model has been implemented and tested based on a "meta" data-model enabling users to define their own data types independently from the application context. This data model has allowed users to dynamically include additional data types without the need of rebuilding the underlying database. Then a complex process-event data structure has been built, based on this data model, describing patient-centered diagnostic processes and merging information from data and metadata. Second, a repository implementing such a data structure has been deployed on a distributed Data Grid in order to provide scalability both in terms of data input and data storage and to exploit distributed data and computational approaches in order to share resources more efficiently. Moreover, data managing has been made possible through a friendly web interface. The driving principle of not being forced to preconfigured data types has been satisfied. It is up to users to dynamically configure the data model for the given experiment or data acquisition program, thus making it potentially suitable for customized applications. Conclusions: Based on such repository, data managing has been made possible through a friendly web interface. The driving principle of not being forced to preconfigured data types has been satisfied. It is up to users to dynamically configure the data model for the given experiment or data acquisition program, thus making it potentially suitable for customized applications.

BioBricks.ai: A Versioned Data Registry for Life Sciences Data Assets

BRISK--research-oriented storage kit for biology-related data

The BioMart community portal: an innovative alternative to large, centralized data repositories

BioMaster: an Integrated Database and Analytic Platform to Provide Comprehensive Information about BioBrick Parts.

brainlife.io: A decentralized and open source cloud platform to support neuroscience research

Biomart Central Portal: An Open Database Network For The Biological Community

Unlocking biomedical data sharing: A structured approach with digital twins and artificial intelligence (AI) for open health sciences

Bio-medical Big Data Operating System (Bio-OS): An Integrated Data Mining Environment for Data Intensive Scientific Research

brainlife.io: a decentralized and open-source cloud platform to support neuroscience research

Sherlock: an open-source data platform to store, analyze and integrate Big Data for computational biologists

A new AI-assisted data standard accelerates interoperability in biomedical research

Digital asset management for heterogeneous biomedical data in an era of data-intensive science

Simplifying Data Analysis in Biomedical Research: An Automated, User-Friendly Tool

Playbook Workflow Builder: Interactive Construction of Bioinformatics Workflows from a Network of Microservices

BioThings Explorer: a query engine for a federated knowledge graph of biomedical APIs

qPortal: A platform for data-driven biomedical research

Abstract 6242: A workflow execution system in a data fabric for integrative cancer analyses

A multi-omics data analysis workflow packaged as a FAIR Digital Object

Aztec: A Platform to Render Biomedical Software Findable, Accessible, Interoperable, and Reusable.

Visualizing Clinical Data Retrieval and Curation in Multimodal Healthcare AI Research: A Technical Note on RIL-workflow

A repository based on a dynamically extensible data model supporting multidisciplinary research in neuroscience