Perceptual audio codecs use psychoacoustic models for irrelevancy reduction by exploiting masking effects in the human auditory system. In masking, the tonality of the masker plays an important role and therefore should be evaluated in the psychoacoustic model. In this study a partial Spectral Flatness Measure (SFM) is applied to a filter bank-based psychoacoustic model to estimate tonality. The Infinite Impulse Response (IIR) band-pass filters are designed to take into account the spreading in simultaneous masking. Tonality estimation is adapted to temporal and spectral resolution of the auditory system. Employing subjective audio coding preference tests, the Partial SFM is compared with prediction-based tonality estimation.
https://www.aes.org/e-lib/browse.cfm?elib=16725
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!