Speech Music Discrimination Using an Ensemble of Biased Classifiers

Kim, Kibeom; Baijal, Anant; Ko, Byeong-Seob; Lee, Sangmoon; Hwang, Inwoo; Kim, Youngtae

AES E-Library

Speech Music Discrimination Using an Ensemble of Biased Classifiers

In this paper we present a novel framework for real-time speech/music discrimination (SMD). The proposed method improves the overall accuracy of automatically classifying the signals into speech, singing, or instrumental categories. In our work, first, we design several groups of classifiers such that each group’s classification decision is biased towards a certain class of sounds; the bias is induced by training different groups of classifiers on perceptual features extracted at different temporal resolutions. Then, we build our system using an ensemble of these biased classifiers organized in a parallel classification fashion. Last, these ensembles are combined with a weighting scheme, which can be tuned in either forward-weighting or inverse-weighting modes, to provide accurate results in real-time. We show, through extensive experimental evaluations, that the proposed ensemble of biased classifiers framework yields superior performance compared to the baseline approach.

Authors: Kim, Kibeom; Baijal, Anant; Ko, Byeong-Seob; Lee, Sangmoon; Hwang, Inwoo; Kim, Youngtae
Affiliation: Samsung Electronics Co. Ltd., Suwon, Gyeonggi-do, Korea
AES Convention: 139 (October 2015) Paper Number: 9457
Publication Date: October 23, 2015 Import into BibTeX
Subject: Applications in Audio
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=18013

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 139Papers) /conv/139/9457.pdf

Start a discussion about this paper!

AES E-Library

Speech Music Discrimination Using an Ensemble of Biased Classifiers

ABOUT AES

Contact Us