Solo Plucked String Sound Detection by the Energy-To-Spectral Flux Ratio (ESFR)

Jeong, Gyuhyeok; Kang, In Gyu; Lee, Byung Suk; Lee, Chang-Heon

AES E-Library

Solo Plucked String Sound Detection by the Energy-To-Spectral Flux Ratio (ESFR)

We address the problem of distinguishing solo plucked string sound from speech. Due to the harmonic components present in both types of signals, a low complexity music/speech classifier often misclassifies these signals. To capture the sustained harmonic structures observed in solo plucked string sound, we propose a new feature, the Energy-to-Spectral Flux Ratio (ESFR). The values and the statistics of the ESFR for solo plucked string sound were distinct from those for speech when calculated over windows of 20 to 50 ms. By building a low complexity detector with the ESFR, we demonstrate the discriminating performance of the ESFR feature for the considered problem.

Authors: Jeong, Gyuhyeok; Kang, In Gyu; Lee, Byung Suk ;Lee, Chang-Heon
Affiliations: LG Electronics, Inc., Seocho-gu, Seoul, Korea; Yonsei University, Seoul, Korea; Columbia University, New York, NY, USA(See document for exact affiliation information.)
AES Convention: 129 (November 2010) Paper Number: 8200
Publication Date: November 4, 2010 Import into BibTeX
Subject: Audio Processing
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=15622

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 129Papers) /129/8200.pdf

Start a discussion about this paper!

AES E-Library

Solo Plucked String Sound Detection by the Energy-To-Spectral Flux Ratio (ESFR)

ABOUT AES

Contact Us