A Robust and Computationally Efficient Speech/Music Discriminator

Barbedo, Jayme Garcia Arnal; Lopes, Amauri

AES E-Library

A Robust and Computationally Efficient Speech/Music Discriminator

A New method for discriminating between speech and music signals is introduced. The strategy is based on the extraction of four features, whose values are combined linearly into a unique parameter. This parameter is used to distinguish between the two kinds of signals. The method has achieved an accuracy superior to 99%, even for severely degraded and noisy signals. Moreover, the low dimensionality of the feature space, together with a very simple information-merging technique, has resulted in a remarkable robustness to new situations. The low computational complexity of the method makes it appropriate for applications that demand real-time operation. Finally excellent resolution for the segmentation of audio streams is achieved by manipulating the analyzed data properly.

Authors: Barbedo, Jayme Garcia Arnal; Lopes, Amauri
Affiliation: FEEC, UNICAMP, Campinas, SP, Brazil
JAES Volume 54 Issue 7/8 pp. 571-588; July 2006
Publication Date: July 15, 2006 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=13897

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD JAES54) /jaes54/7/pg571.pdf

Start a discussion about this paper!

AES E-Library

A Robust and Computationally Efficient Speech/Music Discriminator

ABOUT AES

Contact Us