Abstract:We revisit the classic signal-to-symbol barrier in light of the remarkable ability of deep neural networks to generate realistic synthetic data. DeepFakes and spoofing highlight the feebleness of the link between physical reality and its abstract representation, whether learned by a digital computer or a biological agent. Starting from a widely applicable definition of abstract concept, we show that standard feed-forward architectures cannot capture but trivial concepts, regardless of the number of weights and the amount of training data, despite being extremely effective classifiers. On the other hand, architectures that incorporate recursion can represent a significantly larger class of concepts, but may still be unable to learn them from a finite dataset. We qualitatively describe the class of concepts that can be "understood" by modern architectures trained with variants of stochastic gradient descent, using a (free energy) Lagrangian to measure information complexity. Even if a concept has been understood, however, a network has no means of communicating its understanding to an external agent, except through continuous interaction and validation. We then characterize physical objects as abstract concepts and use the previous analysis to show that physical objects can be encoded by finite architectures. However, to understand physical concepts, sensors must provide persistently exciting observations, for which the ability to control the data acquisition process is essential (active perception). The importance of control depends on the modality, benefiting visual more than acoustic or chemical perception. Finally, we conclude that binding physical entities to digital identities is possible in finite time with finite resources, solving in principle the signal-to-symbol barrier problem, but we highlight the need for continuous validation.

Statistical signatures of abstraction in deep neural networks

Deep learning systems as complex networks

A simple probabilistic neural network for machine understanding

A Relational Inductive Bias for Dimensional Abstraction in Neural Networks

Latent Communication in Artificial Neural Networks

How Deep Neural Networks Learn Compositional Data: The Random Hierarchy Model

Neural Networks Learn Statistics of Increasing Complexity

How Deep Networks Learn Sparse and Hierarchical Data: the Sparse Random Hierarchy Model

Using Single-Neuron Representations for Hierarchical Concepts as Abstractions of Multi-Neuron Representations

A mathematical theory of semantic development in deep neural networks

Learning Document Semantic Representation with Hybrid Deep Belief Network.

When Representations Align: Universality in Representation Learning Dynamics

An Information Theoretic Interpretation to Deep Neural Networks.

DORA: Exploring Outlier Representations in Deep Neural Networks

Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models

Representations and generalization in artificial and brain neural networks

On the Learnability of Physical Concepts: Can a Neural Network Understand What's Real?

Dimensions underlying the representational alignment of deep neural networks with humans

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Compositional generalization through abstract representations in human and artificial neural networks