A Psychoacoustic-based Vocal Suppression for Enhanced Interactive Service Using Spatial Audio Object Coding
In this paper we present a new vocal suppression algorithm that can enhance the quality of music signal coded using Spatial Audio Object Coding (SAOC) in Karaoke mode. The remained vocal component in the coded music signal is estimated and suppressed by using a spectral subtraction method. Using the fact that the level of the remained vocal components is varied depending on the input object power, we propose a psychoacoustic rule where the suppression level is adapted according to the auditory masking property. Objective and subjective test were performed and the results confirm that the proposed algorithm offers an improved quality.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is temporarily free for AES members.