Algorithms for the discovery of musical repetition have been developed in audio and symbolic domains more or less independently for over a decade. In this paper we combine algorithms for multiple F0 estimation, beat tracking, quantisation, and pattern discovery, so that for the first time, the note content of motifs, themes, and repeated sections can be discovered directly from polyphonic music audio. Testing on deadpan and expressive piano renditions of pieces, we compared pattern discovery performance against runs on symbolic representations of the same pieces. Comparing deadpan audio with deadpan-symbolic representations, establishment precision and recall fell by ~25%, and by ~50% when comparing expressive audio with deadpan-symbolic representations. The music data and evaluation results establish a benchmark for future work that attempts to bridge the audio-symbolic gap.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.