AES E-Library

Detection of Audio Events by Boosted Learning of Local Time-Frequency Patterns

It is often desired to detect some particular short sound events from an audio recording. For example, in music analysis and processing, one may be interested in detection of percussive events. In environmental audio analysis one may look for individual sound events related to some activity, for example, sounds of footsteps from a walking person. Generally these problems can be solved by matching some prototype time-frequency (TF) patterns to a TF representation of the input signals to obtain time-varying probability functions for the prototype events. The method introduced in this paper is based on a small number of locally collected event patterns that are used directly to dene features for weighted cascade of weak classiffiers that is trained using the AdaBoost algorithm. The results of a comparison to a traditional sound event classier based on the mel-frequency cepstrum coecients and a hidden Markov model are very encouraging.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
Publication Date:
Session subject:

DOI:


Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
16938
Choose your country of residence from this list: