The Perception of Audio Signals Reduced by Overmasking to the Most Prominent Spectral Amplitudes (Peaks)
Audio signals have been reduced to their 70%, 40%, and 5% most prominent spectral amplitudes (spectral peaks) by means of an overmasking function. Applying the commonly used spreading function with slopes of -27 dB/Bark and -24 dB/Bark results in the so-called Irrelevance Threshold with approximately 70% of all spectral components. According to the psychoacoustic theory of simultaneous masking no spectral components below that threshold can be perceived in steady state sounds. Nevertheless, the difference between the original and the processed signals can be made audible by spectral substraction. In case of 60% and 95% reduction, the irrelevance threshold is exceeded and the peaks of the amplitude spectra are enhanced. The intelligibility of speech signals drops increasingly and musical signals retain the leading voice only. The difference between the original and the processed signal can be perceived clearly. Again, the difference signal can also be made audible. The results are discussed in terms of psychoacoustic figure/background discrimination.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is temporarily free for AES members.