Abstract:In this paper, we study approximation properties of single hidden layer neural networks with weights varying on finitely many directions and thresholds from an open interval. We obtain a necessary and at the same time sufficient measure theoretic condition for density of such networks in the space of continuous functions. Further, we prove a density result for neural networks with a specifically constructed activation function and a fixed number of neurons.
What problem does this paper attempt to address?
The problem this paper attempts to address is the approximation properties of single hidden layer neural networks when weights and thresholds are constrained. Specifically, the authors investigate whether such neural networks can densely approximate any function in the space of continuous functions when weights vary in only a finite number of directions and thresholds come from an open interval. Additionally, the authors demonstrate the density results for neural networks with specifically constructed activation functions and a fixed number of neurons.
### Background of the Paper
Over the past 30 years, artificial neural networks have been a hot research area. Neural networks have been widely applied in various fields such as computer science, finance, medicine, engineering, and physics. Among them, the single hidden layer perceptron model has attracted much attention due to its powerful ability to approximate arbitrary functions. This paper mainly explores the density (or arbitrary precision approximation) problem of single hidden layer neural networks under constrained weights and thresholds.
### Main Contributions
1. **Necessary and Sufficient Conditions**: The authors obtained measure-theoretic conditions under which single hidden layer neural networks, with weights varying in only a finite number of directions and thresholds from an open interval, can densely approximate any function in the space of continuous functions.
2. **Smooth Activation Functions**: The authors proved the existence of smooth activation functions that allow neural networks to maintain density even with a fixed number of neurons.
### Research Methods
- **Measure-Theoretic Methods**: By introducing the concept of measure theory, particularly measures orthogonal to ridge functions, the authors established the necessary and sufficient conditions for the density of neural networks.
- **Geometric Methods**: Using geometric concepts such as lightning bolts, the authors further explored the density of neural networks under specific conditions.
### Conclusion
- **Density Conditions**: The condition for single hidden layer neural networks, with weights varying in only a finite number of directions and thresholds from an open interval, to densely approximate any function in the space of continuous functions is the absence of closed paths or closed lightning bolts.
- **Smooth Activation Functions**: There exist smooth activation functions that allow neural networks to maintain density even with a fixed number of neurons.
### Application Prospects
These results not only theoretically determine the effective boundaries of neural networks under constrained weights and thresholds but also provide a theoretical basis for further optimizing neural network structures. For example, in practical applications, suitable activation functions and network structures can be selected based on these theoretical results to improve the approximation performance and computational efficiency of neural networks.