Sound Identification from MPEG-Encoded Audio Files

Studniarz, Joseph G.; Maher, Robert C.

AES E-Library

Sound Identification from MPEG-Encoded Audio Files

Numerous methods have been proposed for searching and analyzing long-term audio recordings for specific sound sources. It is increasingly common that audio recordings are archived using perceptual compression, such as MPEG-1 Layer 3 (MP3). Rather than performing sound identification upon the reconstructed time waveform after decoding, we operate on the undecoded MP3 audio data as a way to improve processing speed and efficiency. The compressed audio format is only partially processed using the initial bitstream unpacking of a standard decoder, but then the sound identification is performed directly using the frequency spectrum represented by each MP3 data frame. Practical uses are demonstrated for identifying anthropogenic sounds within a natural soundscape recording.

Authors: Studniarz, Joseph G.; Maher, Robert C.
Affiliation: Montana State University, Bozeman, MT, USA
AES Convention: 135 (October 2013) Paper Number: 8984
Publication Date: October 16, 2013 Import into BibTeX
Subject: Applications in Audio
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=17032

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 135Papers) /conv/135/8984.pdf

Start a discussion about this paper!

AES E-Library

Sound Identification from MPEG-Encoded Audio Files

ABOUT AES

Contact Us