Metadata for Audio
|Percussion-Related Semantic Descriptors of
Music Audio Files|
|Perfecto Herrera1, Vegard
Sandvold2, Fabien Gouyon1|
1Universitat Pompeu Fabra, Barcelona, Spain
2University of Oslo, Oslo, Sweden
|Automatic extraction of semantic music content metadata
from polyphonic audio files has traditionally focused
on melodic, rhythmic, and harmonic aspects. In
the present paper we will present several music content
descriptors that are related to percussion instrumentation.
The "percussion index" estimates the
amount of percussion that can be found in a music audio
file and yields a (numerical or categorical) value
that represents the amount of percussion detected in
the file. A further refinement is the "percussion profile,"
which roughly indicates the existing balance between
drums and cymbals. We finally present the "percusiveness"
descriptor, which represents the overall impulsiveness
or abruptness of the percussive events. Data
from initial evaluations, both objective (i.e., errors,
misses, false alarms) and subjective (usability, usefulness)
will also be presented and discussed.|