A Psychoacoustic-based Vocal Suppression for Enhanced Interactive Service Using Spatial Audio Object Coding

Lee, Tung Chin; Park, Young-cheol; Youn, Dae Hee

AES E-Library

A Psychoacoustic-based Vocal Suppression for Enhanced Interactive Service Using Spatial Audio Object Coding

In this paper we present a new vocal suppression algorithm that can enhance the quality of music signal coded using Spatial Audio Object Coding (SAOC) in Karaoke mode. The remained vocal component in the coded music signal is estimated and suppressed by using a spectral subtraction method. Using the fact that the level of the remained vocal components is varied depending on the input object power, we propose a psychoacoustic rule where the suppression level is adapted according to the auditory masking property. Objective and subjective test were performed and the results confirm that the proposed algorithm offers an improved quality.

Authors: Lee, Tung Chin; Park, Young-cheol; Youn, Dae Hee
Affiliations: Yonsei University, Seoul, Korea; Yonsei University, Wonju, Kwangwon-do, Korea(See document for exact affiliation information.)
AES Convention: 136 (April 2014) Paper Number: 9067
Publication Date: April 25, 2014 Import into BibTeX
Subject: Audio Signal Processing/Transducers/Recording/Network Audio
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=17214

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 136Papers) /conv/136/9067.pdf

Start a discussion about this paper!

AES E-Library

A Psychoacoustic-based Vocal Suppression for Enhanced Interactive Service Using Spatial Audio Object Coding

ABOUT AES

Contact Us