H-Semantics: A Hybrid Approach to Singing Voice Separation

Sofianos, Stratis; Ariyaeeinia, Aladdin; Polfreman, Richard; Sotudeh, Reza

AES E-Library

H-Semantics: A Hybrid Approach to Singing Voice Separation

Separating the singing voice from accompanying instruments is important in music information-retrieval systems, since it allows for such applications as melody extraction, lyrics recognition, and singer identity. The authors investigate effective methods for unsupervised separation of the singing voice, called H-Semantics (Hybrid Singing Extraction through Multiband Amplitude Enhanced Thresholding and Independent Component Subtraction). The proposed method adds time-domain separation to the previous work that was based on frequency-domain cepstral methods. The results indicate separation of approximately 8.5 dB signal-to-distortion ratio over the baseline.

Authors: Sofianos, Stratis; Ariyaeeinia, Aladdin; Polfreman, Richard; Sotudeh, Reza
Affiliations: University of Hertfordshire, Hatfield, Hertfordshire, UK; University of Southampton, Southampton, UK(See document for exact affiliation information.)
JAES Volume 60 Issue 10 pp. 831-841; October 2012
Publication Date: November 26, 2012 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=16556

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD JAES60) /jaes60/10/pg831.pdf

Start a discussion about this paper!

AES E-Library

H-Semantics: A Hybrid Approach to Singing Voice Separation

ABOUT AES

Contact Us