Multi-patch Learning: Looking More Pixels in the Training Phase.

Lei Li,Jingzhu Tang,Ming Chen,Shijie Zhao,Junlin Li,Li Zhang
DOI: https://doi.org/10.1007/978-3-031-25063-7_34
2022-01-01
Abstract:Due to the limitations of computation capability and memory of GPUs, most image restoration tasks are trained with cropped patches instead of full-size images. Existing extensive experiments show that the model trained with a larger patch size could achieve better performance since a larger patch size typically means larger receptive fields. However, it comes at the cost of extremely long training times and significant memory consumption. To alleviate the dilemma mentioned above, we propose a multi-patch method to expand the receptive field with negligible memory and computation increase (less than $$1\%$$ ). In addition, we collect 100K high-quality images of 1K categories, following ImageNet, from flickr.com for low-level image tasks. Our method improves the quantitative performance by 0.3412dB on the validation set of the “Compressed Input Super-Resolution Challenge - Image Track”.
What problem does this paper attempt to address?