While dynamics is an important characteristic in music performance, it has been rarely researched in automatic music transcription. We propose a method to estimate individual note intensities from a piano recording given pre-aligned score data of the recording. To this end, we use non-negative matrix factorization in a score-informed setting, where the basis vectors and weights are constrained to estimate harmonic note spectra and corresponding intensities, respectively. We examine various choices in the learning process including the use of synthesized note scale for initialization, type of spectrum, and temporal constraint. We evaluate our method using Saarland Music Data and estimate note intensities in MIDI velocity. The results show that the proposed method makes an improvement compare to previous work.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.