Soundscape Audio Signal Classification and Segmentation Using Listeners Perception of Background and Foreground Sound

Thorogood, Miles; Fan, Jianyu; Pasquier, Philippe

AES E-Library

Soundscape Audio Signal Classification and Segmentation Using Listeners Perception of Background and Foreground Sound

A soundscape recording captures the sonic environment at a given location at a given time using one or more fixed or moving microphones. In most cases, the soundscape is uncontrolled and unscripted. Human listeners experience sonic components as being either background or foreground depending on their salient perceptual characteristics, such as proximity, repetition, and spectral attributes. Analyzing soundscapes in research tasks requires the classification and segmentation of the important sonic components, but that process is time consuming when done manually. This research establishes the background and foreground classification task within a musicological and soundscape context and then presents a method for the automatic segmentation of soundscape recordings. Using a soundscape corpus with ground truth data obtained from a human perception study, the analysis shows that participants have a high level of agreement on the category assigned to background samples (92.5%), foreground samples (80.8%), and background with foreground samples (75.3%). Experiments demonstrate how smaller window sizes affect the performance of the classifier.

Authors: Thorogood, Miles; Fan, Jianyu; Pasquier, Philippe
Affiliation: Simon Fraser University, SIAT, Canada
JAES Volume 64 Issue 7/8 pp. 484-492; July 2016
Publication Date: August 11, 2016 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=18334

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD JAES64) /jaes64/7/pg484.pdf

DOI: https://doi.org/10.17743/jaes.2016.0021

Start a discussion about this paper!

AES E-Library

Soundscape Audio Signal Classification and Segmentation Using Listeners Perception of Background and Foreground Sound

ABOUT AES

Contact Us