Journal Forum

A Meta-Analysis of High Resolution Audio Perceptual Evaluation - June 2016

Synthetic Reverberator - January 1960

Sound Board: High-Resolution Audio - October 2015

Access Journal Forum

AES E-Library

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music

Document Thumbnail

This paper presents an adaptive prediction method about source-specific ranges of binaural cues, such as inter-channel level difference (ILD) and inter-channel phase difference (IPD), for centrally positioned singing voice separation. To this end, we employ Gaussian mixture model (GMM) to cluster underlying distributions in the feature domain of mixture signal. By regarding responsibilities to those distinct Gaussians as unmixing coefficients of each mixture spectrogram sample, the proposed method can reduce artificial deformations that previous center channel extraction methods usually suffer, caused by their imprecise or rough decision about ranges of central subspaces. Experiments on commercial music show superiority of the proposed method.

AES Conference:
Paper Number:
Publication Date:

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is temporarily free for AES members.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!

Facebook   Twitter   LinkedIn   Google+   YouTube   RSS News Feeds  
AES - Audio Engineering Society