A guide to convolution arithmetic for deep learning

Vincent Dumoulin,Francesco Visin
DOI: https://doi.org/10.48550/arXiv.1603.07285
2018-01-12
Abstract:We introduce a guide to help deep learning practitioners understand and manipulate convolutional neural network architectures. The guide clarifies the relationship between various properties (input shape, kernel shape, zero padding, strides and output shape) of convolutional, pooling and transposed convolutional layers, as well as the relationship between convolutional and transposed convolutional layers. Relationships are derived for various cases, and are illustrated in order to make them intuitive.
Machine Learning,Neural and Evolutionary Computing
What problem does this paper attempt to address?