Volume-Independent Music Matching by Frequency Spectrum Comparison

Anthony Lee,James Choi
DOI: https://doi.org/10.47611/jsrhs.v12i1.4149
2023-02-28
Journal of Student Research
Abstract:Currently, there are applications such as Shazam that provides music matching. However, a limitation is that the same piece performed by the same musician cannot be identified if it is not the same recording. This is because Shazam matches the variation in volume, not the frequencies of the sound. This research attempts to match music the way humans understand it: by the frequency spectrum of music, not the volume variation. We pre-computed the frequency spectrums of the music, then took the unknown piece and tried to match its frequency spectrum against every segment. We did so by sliding the window by 0.1 seconds and calculating the error by subtracting the normalized arrays and taking the sum of absolute differences. The segment that showed the least error was considered the candidate for the match. Matching simple pieces such as single-note pieces was successful, but complex pieces such as symphonies were not successful; that is, the algorithm couldn’t produce low error value in any of the music in the database. We suspect that it has to do with having “too many notes,” i.e., mismatches in the higher harmonics added up to significant amount of errors, which swamps the calculations.
English Else
What problem does this paper attempt to address?