The Rarity of Musical Audio Signals Within the Space of Possible Audio Generation

Nick Collins
2024-05-24
Abstract:white noise signal can access any possible configuration of values, though statistically over many samples tends to a uniform spectral distribution, and is highly unlikely to produce intelligible sound. But how unlikely? The probability that white noise generates a music-like signal over different durations is analyzed, based on some necessary features observed in real music audio signals such as mostly proximate movement and zero crossing rate. Given the mathematical results, the rarity of music as a signal is considered overall. The applicability of this study is not just to show that music has a precious rarity value, but that examination of the size of music relative to the overall size of audio signal space provides information to inform new generations of algorithmic music system (which are now often founded on audio signal generation directly, and may relate to white noise via such machine learning processes as diffusion). Estimated upper bounds on the rarity of music to the size of various physical and musical spaces are compared, to better understand the magnitude of the results (pun intended). Underlying the research are the questions `how much music is still out there?' and `how much music could a machine learning process actually reach?'.
Sound,Machine Learning,Audio and Speech Processing
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is: **the rarity of music signals in the space of all possible audio signals**. Specifically, by analyzing the probability of generating music signals from white noise, the author explores the following key issues: 1. **How to quantify the rarity of music signals relative to all possible audio signals?** - Through mathematical models and statistical methods, the author calculated the probability of white noise generating music - like signals. The results show that this probability is extremely small, almost impossible. 2. **How to measure the probability of music signals relative to a given audio model?** - The author used some necessary music features (such as zero - crossing rate and closeness of sample values) to define music signals and calculated the probability of white noise generating music signals based on these features. For example, for a 1 - second - long audio with a 44.1KHz sampling rate, the probability of generating consecutive sample values that are close is \(1.24355865\times 10^{- 2018}\), which is far smaller than the probability of selecting a single atom in the universe (\(10^{-80}\)). 3. **How much of the possible music audio signals have humans explored?** - By comparing different music resources and physical systems, the author shows that the proportion of music signals in the entire audio signal space is extremely small. Even considering the future ability of AI to generate music, the music that humans can create is only a small part of the entire music space. ### Main conclusions of the paper - **Music signals are a very small subset of all possible audio signals**. According to the probability analysis of generating music signals from white noise, real - music signals are rarer than the conditions set in this paper. - **The rarity of music is extremely high**. Even considering "existing recordings" or "all possible human - lifetime recordings", the number of music signals is far less than all possible audio signals. - **Although future AI music - generation systems can generate a large number of musical works, they still cannot exhaust the possibilities of the music space**. Therefore, human creators still have enough room for innovation. ### Summary of mathematical formulas 1. **Binomial distribution probability formula**: \[ P(k; p, n)=\binom{n}{k}p^{k}(1 - p)^{n - k} \] where \(P(k; p, n)\) represents the probability of exactly \(k\) successes in \(n\) trials, and \(p\) is the probability of success in each trial. 2. **Cumulative probability formula**: \[ P(k\geq K; p, n)=\sum_{k = K}^{n}\binom{n}{k}p^{k}(1 - p)^{n - k} \] where \(K\) is the minimum expected number of adjacent samples. 3. **Chernoff bound formula**: \[ \exp\left(-n\left(a\log\left(\frac{a}{p}\right)+(1 - a)\log\left(\frac{1 - a}{1 - p}\right)\right)\right) \] where \(a=\frac{k}{n}\), which is used to estimate the upper bound of the probability of generating no more than \(k\) zero - crossings. Through these formulas, the author shows the extreme rarity of music signals among all possible audio signals, thus emphasizing the uniqueness and preciousness of music creation.