With the increased proliferation of interconnected devices that have built-in microphones, acoustic event classification and monitoring becomes possible in a wide variety of applications, such as surveillance, healthcare, military, machine diagnostics, and wildlife tracking. The promise and success of these applications depends on robust sensing of acoustic events in the environment. Typically, sound event classes are defined by annotating training data, which is a laborious process. This work introduces an extended version of non-negative matrix deconvolution (NMD), called low-resolution multi-label non-negative matrix deconvolution (LRM-NMD), where both the observation data and the available labeling information are used during training. The proposed extension of NMD was successfully applied to the classification of acoustic events even in noisy conditions with overlapping events. Low-resolution, multi-labeling information simply indicates that the sound classes of the events take place over a longer period of time in the acoustic data without identifying beginning or endings of the individual events.
http://www.aes.org/e-lib/browse.cfm?elib=19567
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!