Abstract:Background and Objective Processing of medical imaging big data is deeply challenging due to the size of data, computational complexity, security storage and inherent privacy issues. Traditional picture archiving and communication system, which is an imaging technology used in the healthcare industry, generally uses centralized high performance disk storage arrays in the practical solutions. The existing storage solutions are not suitable for the diverse range of medical imaging big data that needs to be stored reliably and accessed in a timely manner. The economical solution is emerging as the cloud computing which provides scalability, elasticity, performance and better managing cost. Cloud based storage architecture for medical imaging big data has attracted more and more attention in industry and academia. Methods This study presents a novel, fast and scalable framework of medical image storage service based on distributed file system. Two innovations of the framework are introduced in this paper. An integrated medical imaging content indexing file model for large-scale image sequence is designed to adapt to the high performance storage efficiency on distributed file system. A virtual file pooling technology is proposed, which uses the memory-mapped file method to achieve an efficient data reading process and provides the data swapping strategy in the pool. Result The experiments show that the framework not only has comparable performance of reading and writing files which meets requirements in real-time application domain, but also bings greater convenience for clinical system developers by multiple client accessing types. The framework supports different user client types through the unified micro-service interfaces which basically meet the needs of clinical system development especially for online applications. The experimental results demonstrate the framework can meet the needs of real-time data access as well as traditional picture archiving and communication system. Conclusions This framework aims to allow rapid data accessing for massive medical images, which can be demonstrated by the online web client for MISS-D framework implemented in this paper for real-time data interaction. The framework also provides a substantial subset of features to existing open-source and commercial alternatives, which has a wide range of potential applications.

Scalable, reproducible, and cost-effective processing of large-scale medical imaging datasets

Scalable quality control on processing of large diffusion-weighted and structural magnetic resonance imaging datasets

MISS-D: A fast and scalable framework of medical image storage service based on distributed file system

Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies

High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms

Engineering AI Tools for Systematic and Scalable Quality Assessment in Magnetic Resonance Imaging

A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets

A reproducible and generalizable software workflow for analysis of large-scale neuroimaging data collections using BIDS Apps

Using MapReduce for Large-scale Medical Image Analysis

A Data Colocation Grid Framework for Big Data Medical Image Processing - Backend Design

TDat: an Efficient Platform for Processing Petabyte-Scale Whole-Brain Volumetric Images.

Implementation of a Semi-automated Post-processing System for Parametric MRI Mapping of Human Breast Cancer

The Trials and Tribulations of Assembling Large Medical Imaging Datasets for Machine Learning Applications

Multiscale Cloud-based Pipeline for Neuronal Electrophysiology Analysis and Visualization

High-performance Data Management for Whole Slide Image Analysis in Digital Pathology

An Automated Tool to Classify and Transform Unstructured MRI Data into BIDS Datasets

A highly parallelized framework for computationally intensive MR data analysis

Efficient and Reliable Data Management for Biomedical Applications

Digital asset management for heterogeneous biomedical data in an era of data-intensive science

A QR code-enabled framework for fast biomedical image processing in medical diagnosis using deep learning

A browser-based platform for storage, visualization, and analysis of large-scale 3D images in HPC environments