Abstract:Background and Objective: Preprocessing of data is a vital step for almost all deep learning workflows. In computer vision, manipulation of data intensity and spatial properties can improve network stability and can provide an important source of generalisation for deep neural networks. Models are frequently trained with preprocessing pipelines composed of many stages, but these pipelines come with a drawback; each stage that resamples the data costs time, degrades image quality, and adds bias to the output. Long pipelines can also be complex to design, especially in medical imaging, where cropping data early can cause significant artifacts. Methods: We present Lazy Resampling, a software that rephrases spatial preprocessing operations as a graphics pipeline. Rather than each transform individually modifying the data, the transforms generate transform descriptions that are composited together into a single resample operation wherever possible. This reduces pipeline execution time and, most importantly, limits signal degradation. It enables simpler pipeline design as crops and other operations become non-destructive. Lazy Resampling is designed in such a way that it provides the maximum benefit to users without requiring them to understand the underlying concepts or change the way that they build pipelines. Results: We evaluate Lazy Resampling by comparing traditional pipelines and the corresponding lazy resampling pipeline for the following tasks on Medical Segmentation Decathlon datasets. We demonstrate lower information loss in lazy pipelines vs. traditional pipelines. We demonstrate that Lazy Resampling can avoid catastrophic loss of semantic segmention label accuracy occuring in traditional pipelines when passing labels through a pipeline and then back through the inverted pipeline. Finally, we demonstrate statistically significant improvements when training UNets for semantic segmentation. Conclusion: Lazy Resampling reduces the loss of information that occurs when running processing pipelines that traditionally have multiple resampling steps and enables researchers to build simpler pipelines by making operations such as rotation and cropping effectively non-destructive. It makes it possible to invert labels back through a pipeline without catastrophic loss of accuracy. A reference implementation for Lazy Resampling can be found at https://github.com/KCLBMEIS/LazyResampling . Lazy Resampling is being implemented as a core feature in MONAI, an open source python-based deep learning library for medical imaging, with a roadmap for a full integration.

Automated Image Data Preprocessing with Deep Reinforcement Learning

Mastering Robot Control Through Point-based Reinforcement Learning with Pre-training.

Painless and Accurate Medical Image Analysis Using Deep Reinforcement Learning with Task-Oriented Homogenized Automatic Pre-Processing.

Impact of the Preprocessing Steps in Deep Learning-Based Image Classifications

Content Classification Tasks with Data Preprocessing Manifestations

Photo Cropping Via Deep Reinforcement Learning.

AutoTrain: an Efficient Auto-training System for Small-scale Image Classification.

Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning

Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines

Automatic Data Acquisition for Deep Learning

Interpreting pretext tasks for active learning: a reinforcement learning approach

Understanding Unconventional Preprocessors in Deep Convolutional Neural Networks for Face Identification

Unprocessing Images for Learned Raw Denoising

Active Learning for Image Classification: A Deep Reinforcement Learning Approach

Lazy Resampling: Fast and information preserving preprocessing for deep learning

DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data

DeepPrep: An accelerated, scalable, and robust pipeline for neuroimaging preprocessing empowered by deep learning

An Automated Portfolio Trading System with Feature Preprocessing and Recurrent Reinforcement Learning

Deep reinforcement learning enables adaptive-image augmentation for automated optical inspection of plant rust

Underwater Image Enhancement with Reinforcement Learning

Pretraining Representations for Data-Efficient Reinforcement Learning