Towards a privacy-preserving distributed cloud service for preprocessing very large medical images

Yuandou Wang,Neel Kanwal,Kjersti Engan,Chunming Rong,Zhiming Zhao
DOI: https://doi.org/10.1109/ICDH60066.2023.00055
2023-09-13
Abstract:Digitized histopathology glass slides, known as Whole Slide Images (WSIs), are often several gigapixels large and contain sensitive metadata information, which makes distributed processing unfeasible. Moreover, artifacts in WSIs may result in unreliable predictions when directly applied by Deep Learning (DL) algorithms. Therefore, preprocessing WSIs is beneficial, e.g., eliminating privacy-sensitive information, splitting a gigapixel medical image into tiles, and removing the diagnostically irrelevant areas. This work proposes a cloud service to parallelize the preprocessing pipeline for large medical images. The data and model parallelization will not only boost the end-to-end processing efficiency for histological tasks but also secure the reconstruction of WSI by randomly distributing tiles across processing nodes. Furthermore, the initial steps of the pipeline will be integrated into the Jupyter-based Virtual Research Environment (VRE) to enable image owners to configure and automate the execution process based on resource allocation.
Computational Engineering, Finance, and Science,Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?