Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music

Kim, Minje and Beack, Seungkwon and Choi, Keunwoo and Kang, Kyeongok

AES E-Library

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music

This paper presents an adaptive prediction method about source-specific ranges of binaural cues, such as inter-channel level difference (ILD) and inter-channel phase difference (IPD), for centrally positioned singing voice separation. To this end, we employ Gaussian mixture model (GMM) to cluster underlying distributions in the feature domain of mixture signal. By regarding responsibilities to those distinct Gaussians as unmixing coefficients of each mixture spectrogram sample, the proposed method can reduce artificial deformations that previous center channel extraction methods usually suffer, caused by their imprecise or rough decision about ranges of central subspaces. Experiments on commercial music show superiority of the proposed method.

Author (s): Kim, Minje; Beack, Seungkwon; Choi, Keunwoo; Kang, Kyeongok;
Affiliation: Electronics and Telecommunications Research Institute (ETRI), Daejeon, Korea (See document for exact affiliation information.)
Publication Date: 2011-09-06
Session subject: Interactive Audio

DOI:

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Conference Paper

AES Conventions

AES Conferences

AES Training & Development

AES Inside Track

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

Richard C. Heyser Memorial Lecture Series

AES E-Library

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music

Choose your country of residence from this list:

AES E-Library

Login Institutions

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music

Choose your country of residence from this list: