This paper illustrates a signal adaptive analysis technique to transcribe monophonic sounds. Unlike other models, which segment audio relying on the onset time-domain analysis, this model principally exploits pitch information. Pitch is locked after detection. The structure of a musical note, i.e. harmonic frequency structure and time-envelope model, is exploited to segment and transcribe the signal. The system is inspired by the Integrated Processing and Undestanding of Signals system (IPUS) where abstract explanation and best front-end configuration are iteratively searched. Onsets and pitch are searched in two different domains and integrated with the system knowledge to give a coherent interpretation of the signal. The system transcribes with success from fast trumpet riffs to long sustained violin vibrato.
https://www.aes.org/e-lib/browse.cfm?elib=11347
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!