Dynamic convolution for image matching

Igor Soloviev,Andrey Kovalchuk,Vladimir Klinshov
DOI: https://doi.org/10.1140/epjs/s11734-024-01373-2
2024-10-31
The European Physical Journal Special Topics
Abstract:Convolution neural networks (CNNs) are omnipresent in modern computer vision models and also widely used in other tasks such as voice recognition, time series analysis, machine translation, etc. In the present paper, we introduce a novel architecture of CNNs using dynamic convolutions in which the kernels are generated based on the input data. We apply this architecture to the image matching problem and develop a two-branch network in which one branch generates kernels used in convolutional layers of the other branch. We test our model on a canonical MNIST benchmark and demonstrate that it shows faster learning and better performance than the baseline model with standard convolutions. Potential applications of our architecture includes numerous problems in image analysis, time series forecasting, physical-informed machine learning, etc.
physics, multidisciplinary
What problem does this paper attempt to address?