On the Learnability of Physical Concepts: Can a Neural Network Understand What's Real?

Authors: Alessandro Achille, Stefano Soatto

Published: 2022-07-25 17:21:59+00:00

AI Summary

This paper analyzes the learnability of physical concepts by neural networks, focusing on the signal-to-symbol barrier. It shows that standard feed-forward architectures struggle to capture non-trivial concepts, while recurrent architectures, in principle, can but may not learn them from finite data. The authors conclude that binding physical entities to digital identities is possible but requires continuous validation.

Abstract

We revisit the classic signal-to-symbol barrier in light of the remarkable ability of deep neural networks to generate realistic synthetic data. DeepFakes and spoofing highlight the feebleness of the link between physical reality and its abstract representation, whether learned by a digital computer or a biological agent. Starting from a widely applicable definition of abstract concept, we show that standard feed-forward architectures cannot capture but trivial concepts, regardless of the number of weights and the amount of training data, despite being extremely effective classifiers. On the other hand, architectures that incorporate recursion can represent a significantly larger class of concepts, but may still be unable to learn them from a finite dataset. We qualitatively describe the class of concepts that can be understood by modern architectures trained with variants of stochastic gradient descent, using a (free energy) Lagrangian to measure information complexity. Even if a concept has been understood, however, a network has no means of communicating its understanding to an external agent, except through continuous interaction and validation. We then characterize physical objects as abstract concepts and use the previous analysis to show that physical objects can be encoded by finite architectures. However, to understand physical concepts, sensors must provide persistently exciting observations, for which the ability to control the data acquisition process is essential (active perception). The importance of control depends on the modality, benefiting visual more than acoustic or chemical perception. Finally, we conclude that binding physical entities to digital identities is possible in finite time with finite resources, solving in principle the signal-to-symbol barrier problem, but we highlight the need for continuous validation.


Key findings
Standard feed-forward networks cannot capture non-trivial concepts. Recurrent architectures can theoretically represent any computable concept but may fail to learn them from finite data. Binding physical entities to digital identities is possible, but continuous validation is necessary.
Approach
The paper uses a theoretical approach based on established results from model theory and computability theory to analyze the capabilities and limitations of different neural network architectures in learning abstract concepts, including physical objects. It defines 'understanding' a concept and examines the conditions under which it can be achieved, focusing on the role of active perception and the limitations of passive data acquisition.
Datasets
UNKNOWN
Model(s)
Feed-forward networks, recurrent networks, Transformers
Author countries
UNKNOWN