Abstract:We revisit the classic signal-to-symbol barrier in light of the remarkable ability of deep neural networks to generate realistic synthetic data. DeepFakes and spoofing highlight the feebleness of the link between physical reality and its abstract representation, whether learned by a digital computer or a biological agent. Starting from a widely applicable definition of abstract concept, we show that standard feed-forward architectures cannot capture but trivial concepts, regardless of the number of weights and the amount of training data, despite being extremely effective classifiers. On the other hand, architectures that incorporate recursion can represent a significantly larger class of concepts, but may still be unable to learn them from a finite dataset. We qualitatively describe the class of concepts that can be "understood" by modern architectures trained with variants of stochastic gradient descent, using a (free energy) Lagrangian to measure information complexity. Even if a concept has been understood, however, a network has no means of communicating its understanding to an external agent, except through continuous interaction and validation. We then characterize physical objects as abstract concepts and use the previous analysis to show that physical objects can be encoded by finite architectures. However, to understand physical concepts, sensors must provide persistently exciting observations, for which the ability to control the data acquisition process is essential (active perception). The importance of control depends on the modality, benefiting visual more than acoustic or chemical perception. Finally, we conclude that binding physical entities to digital identities is possible in finite time with finite resources, solving in principle the signal-to-symbol barrier problem, but we highlight the need for continuous validation.

Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Abstraction Learning

Abstract Reasoning with Distracting Features

AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph

From Concrete to Abstract: A Multimodal Generative Approach to Abstract Concept Learning

Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?

Meaningful Learning: Advancing Abstract Reasoning in Large Language Models Via Generic Fact Guidance.

How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study

Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact Guidance

Emergence of a High-Dimensional Abstraction Phase in Language Transformers

Learning with Language-Guided State Abstractions

A learning perspective on the emergence of abstractions: the curious case of phonemes

One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Semantics, Representations and Grammars for Deep Learning

Can GPT-4 learn to analyse moves in research article abstracts?

Towards Uncovering How Large Language Model Works: An Explainability Perspective

How Does Pretraining Improve Discourse-Aware Translation?

On the Learnability of Physical Concepts: Can a Neural Network Understand What's Real?

Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

How to Do Things with Deep Learning Code

Preference-Conditioned Language-Guided Abstraction