Abstract:This paper investigates the role of communication in improving coordination within robot swarms, focusing on a paradigm where learning and execution occur simultaneously in a decentralized manner. We highlight the role communication can play in addressing the credit assignment problem (individual contribution to the overall performance), and how it can be influenced by it. We propose a taxonomy of existing and future works on communication, focusing on information selection and physical abstraction as principal axes for classification: from low-level lossless compression with raw signal extraction and processing to high-level lossy compression with structured communication models. The paper reviews current research from evolutionary robotics, multi-agent (deep) reinforcement learning, language models, and biophysics models to outline the challenges and opportunities of communication in a collective of robots that continuously learn from one another through local message exchanges, illustrating a form of social learning.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to explore the role of communication in improving the coordination of swarms of robots, with a particular focus on a paradigm in which learning and execution are carried out in a decentralized manner simultaneously. Specifically, the paper attempts to solve the following core problems: 1. **Credit Assignment Problem**: - In a swarm of robots, how the individual contribution of each robot affects the overall performance is a complex problem. Especially in a decentralized environment, robots need to evaluate their contribution to the completion of collective tasks, which is the so - called "Credit Assignment Problem". The paper discusses how to solve this problem through communication, so that each robot can better understand the impact of its own behavior on the entire swarm. 2. **The role of communication in social learning**: - The paper emphasizes the importance of communication in social learning, that is, the ability of robots to learn from each other through local information exchange. This social learning can help swarms of robots adapt more effectively to unknown or dynamic environments and improve the quality of overall task completion. 3. **Classification of information selection and physical abstraction**: - The paper proposes a classification framework for describing existing and future research work on communication. This classification framework is mainly based on two dimensions: Information Selection and Physical Abstraction. Information Selection refers to reducing the amount of shared information, while Physical Abstraction is to change the representation of information to reveal its internal structure. These two dimensions help us understand the characteristics of different communication methods and their applicable scenarios. 4. **Challenges of the Decentralized Learning and Execution (DLE) paradigm**: - The paper explores the unique challenges of working under the Decentralized Learning and Execution (DLE) paradigm. The DLE paradigm means that all learning and computing are carried out on - site without the intervention of humans or a central computer. This brings complex problems such as unexpected competition that may exist between individuals. 5. **Multidisciplinary research directions**: - The paper reviews research from multiple fields such as evolutionary robotics, multi - agent (deep) reinforcement learning, language models, and biophysics, showing the potential and challenges of these fields in solving the communication problems of swarms of robots. Through an interdisciplinary approach, the paper provides new ideas and directions for future research. ### Formula Explanation Although this paper mainly focuses on conceptual and technical discussions, in order to ensure the accuracy of formula expressions, if specific mathematical or physical formulas are involved, they will be presented in Markdown format. For example, assume that the gradient descent formula is involved when discussing a certain algorithm, it can be represented as follows: \[ \theta_{t + 1}=\theta_t-\alpha\nabla J(\theta_t) \] where $\theta_t$ represents the value of the parameter at time step $t$, $\alpha$ is the learning rate, and $\nabla J(\theta_t)$ is the gradient of the objective function $J$ at $\theta_t$. ### Summary In general, by exploring the role of communication in swarms of robots, especially how to improve the effect of decentralized learning and execution through communication, this paper proposes a series of theoretical and practical challenges and opportunities. It not only provides a new perspective for the research of swarms of robots, but also points out the direction for multidisciplinary research.

Signaling and Social Learning in Swarms of Robots