Attention Mechanisms and Their Applications to Complex Systems

Adrián Hernández,José M. Amigó
DOI: https://doi.org/10.3390/e23030283
IF: 2.738
2021-02-26
Entropy
Abstract:Deep learning models and graphics processing units have completely transformed the field of machine learning. Recurrent neural networks and long short-term memories have been successfully used to model and predict complex systems. However, these classic models do not perform sequential reasoning, a process that guides a task based on perception and memory. In recent years, attention mechanisms have emerged as a promising solution to these problems. In this review, we describe the key aspects of attention mechanisms and some relevant attention techniques and point out why they are a remarkable advance in machine learning. Then, we illustrate some important applications of these techniques in the modeling of complex systems.
physics, multidisciplinary
What problem does this paper attempt to address?