The current work deals with audio event detection, segmentation and characterization, in order to be further utilized in post-production. Browsing, selection and characterization of audio-visual content is a tiresome task, especially in audio / video editing applications, where an enormous amount of recordings with different characteristics is usually involved. Automated detection, segmentation and general audio classification are essential to deploy flexible and effective audio-visual content management. A multi-resolution scanning procedure, based mainly in wavelet-processing, is currently proposed where various energy-based comparators and signal-complexity metrics have been tested for detection purposes. A variety of audio features, including MPEG-7 audio low level descriptors, have been considered for events’ characterization and indexing purposes. Extraction of the detection / characterization results via MPEG-7 description schemes or similar indexing files are considered.
https://www.aes.org/e-lib/browse.cfm?elib=14123
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!