A perspective on neuroscience data standardization with Neurodata Without Borders

Andrea Pierré,Tuan Pham,Jonah Pearl,Sandeep Robert Datta,Jason T. Ritt,Alexander Fleischmann
2024-01-23
Abstract:Neuroscience research has evolved to generate increasingly large and complex experimental data sets, and advanced data science tools are taking on central roles in neuroscience research. Neurodata Without Borders (NWB), a standard language for neurophysiology data, has recently emerged as a powerful solution for data management, analysis, and sharing. We here discuss our efforts to implement NWB data science pipelines. We describe general principles and specific use cases that illustrate successes, challenges, and non-trivial decisions in software engineering. We hope that our experience can provide guidance for the neuroscience community and help bridge the gap between experimental neuroscience and data science.
Neurons and Cognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to address the issue of standardizing neuroscience data and specifically introduces Neurodata Without Borders (NWB) as a powerful solution for data management, analysis, and sharing. The authors discuss their experiences in implementing the NWB data science pipeline in their laboratory, highlighting successes, challenges, and some non-trivial decisions in software engineering. The main objectives include: 1. **Standardizing complex datasets**: - As neuroscience research generates increasingly large and complex datasets, standardization becomes necessary. NWB serves as a standard language to effectively organize and manage these data. 2. **Enhancing data sharing and reproducibility**: - A standardized data format facilitates collaboration and data sharing between different laboratories, thereby improving research transparency and reproducibility. 3. **Promoting team science**: - Big data and interdisciplinary team collaboration require new data organization strategies. NWB provides not only technical tools but also a conceptual framework to standardize data organization and description. 4. **Intra- and inter-laboratory collaboration**: - Implementing NWB standardization within a laboratory helps researchers better manage and analyze data; in collaborations with other laboratories, NWB helps reduce friction caused by inconsistent file formats. 5. **Public data sharing**: - The paper discusses how to publish NWB data to public data repositories (such as DANDI) to meet publication and funding requirements and increase opportunities for data reuse. In summary, by sharing the specific practical experiences of the authors' laboratory, this paper aims to provide guidance to the neuroscience community, helping to bridge the gap between experimental neuroscience and data science.