Abstract:Background: Robust, extensible and distributed databases integrating clinical, imaging and molecular data represent a substantial challenge for modern neuroscience. It is even more difficult to provide extensible software environments able to effectively target the rapidly changing data requirements and structures of research experiments. There is an increasing request from the neuroscience community for software tools addressing technical challenges about: (i) supporting researchers in the medical field to carry out data analysis using integrated bioinformatics services and tools; (ii) handling multimodal/multiscale data and metadata, enabling the injection of several different data types according to structured schemas; (iii) providing high extensibility, in order to address different requirements deriving from a large variety of applications simply through a user runtime configuration. Methods: A dynamically extensible data structure supporting collaborative multidisciplinary research projects in neuroscience has been defined and implemented. We have considered extensibility issues from two different points of view. First, the improvement of data flexibility has been taken into account. This has been done through the development of a methodology for the dynamic creation and use of data types and related metadata, based on the definition of "meta" data model. This way, users are not constrainted to a set of predefined data and the model can be easily extensible and applicable to different contexts. Second, users have been enabled to easily customize and extend the experimental procedures in order to track each step of acquisition or analysis. This has been achieved through a process-event data structure, a multipurpose taxonomic schema composed by two generic main objects: events and processes. Then, a repository has been built based on such data model and structure, and deployed on distributed resources thanks to a Grid-based approach. Finally, data integration aspects have been addressed by providing the repository application with an efficient dynamic interface designed to enable the user to both easily query the data depending on defined datatypes and view all the data of every patient in an integrated and simple way. Results: The results of our work have been twofold. First, a dynamically extensible data model has been implemented and tested based on a "meta" data-model enabling users to define their own data types independently from the application context. This data model has allowed users to dynamically include additional data types without the need of rebuilding the underlying database. Then a complex process-event data structure has been built, based on this data model, describing patient-centered diagnostic processes and merging information from data and metadata. Second, a repository implementing such a data structure has been deployed on a distributed Data Grid in order to provide scalability both in terms of data input and data storage and to exploit distributed data and computational approaches in order to share resources more efficiently. Moreover, data managing has been made possible through a friendly web interface. The driving principle of not being forced to preconfigured data types has been satisfied. It is up to users to dynamically configure the data model for the given experiment or data acquisition program, thus making it potentially suitable for customized applications. Conclusions: Based on such repository, data managing has been made possible through a friendly web interface. The driving principle of not being forced to preconfigured data types has been satisfied. It is up to users to dynamically configure the data model for the given experiment or data acquisition program, thus making it potentially suitable for customized applications.

The Experiment Data Depot: A Web-Based Software Tool for Biological Experimental Data Storage, Sharing, and Visualization

Automated Experimentation Powers Data Science in Chemistry.

echemdb Toolkit -- a Lightweight Approach to Getting Data Ready for Data Management Solutions

State-of-the-Art Data Management: Improving the Reproducibility, Consistency, and Traceability of Structural Biology and in Vitro Biochemical Experiments

A web-portal for interactive data exploration, visualization, and hypothesis testing

DatumKB: A Database of Biological Experimental Results

MDRepo - an open environment for data warehousing and knowledge discovery from molecular dynamics simulations

The Encyclopedia of Proteome Dynamics: a big data ecosystem for (prote)omics

Enabling systematic, harmonised and large-scale biofilms data computation: the Biofilms Experiment Workbench

MiSDEED: a synthetic data engine for microbiome study power analysis and study design

Big Data access and infrastructure for modern biology: case studies in data repository utility

An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics

Data management in the modern structural biology and biomedical research environment

Digital asset management for heterogeneous biomedical data in an era of data-intensive science

Seamless Science: Lifting Experimental Mechanical Testing Lab Data to an Interoperable Semantic Representation

Open source and reproducible and inexpensive infrastructure for data challenges and education

SynBio2Easy-a biologist-friendly tool for batch operations on SBOL designs with Excel inputs

A repository based on a dynamically extensible data model supporting multidisciplinary research in neuroscience

The BioMart community portal: an innovative alternative to large, centralized data repositories

Online Omics Platform Expedites Industrial Application of Halomonas bluephagenesis TD1.0

A Semantic Cross-Species Derived Data Management Application