In This Section
- New "AES Portal" in Process of Being Rolled Out
- Take a moment to create your new Login today!
- AES 2016 Election Results Announced
- The AES has released the list of winning candidates from the balloting in the 2016 Audio Engineering Society international elections
- AES Opens Early Registration & Housing Options for AES Los Angeles, September 29 — October 2
- Use promo code AES141WEB at checkout for FREE Exhibit-Plus badge
- Research Finds Audible Differences with High-Resolution Audio
- Listeners can hear a difference between standard audio and better-than-CD quality, known as high-resolution audio
Music Structure Analysis
International Audio Laboratories Erlangen, Germany
One of the attributes distinguishing music from other sound sources is the hierarchical structure in which music is organized. Individual sound events corresponding to individual notes form larger structures such as motives, phrases, and chords, and these elements again form larger constructs that determine the overall layout of the composition. One important goal of music structure analysis is to divide a given audio recording into temporal segments that correspond to musical parts and to group these segments into musically meaningful categories. One challenge is that there are many different criteria for segmenting and structuring music. This results in conceptually different approaches, which may be loosely categorized in repetition-based, novelty-based, and homogeneity-based approaches. Furthermore, one has to account for different musical dimensions such as melody, harmony, rhythm, and timbre. In this talk, I will give an overview of current approaches for the computational analysis of the structure of music recordings, which has been a very active research problem within the area of music information retrieval. As one example, I present a novel audio thumbnailing procedure to determine the audio segment that best represents a given music recording. Furthermore, I show how path and block structures of self-similarity matrices, the most important tool used in automated structure analysis, can be enhanced and transformed. Finally, I report on a recent novelty-based segmentation approach that combines homogeneity and repetition principles in a single representation referred to as structure feature.
Bio of presenter
Meinard Müller studied mathematics (Diplom) and computer science (Ph.D.) at the University of Bonn, Germany. In 2002/2003, he conducted postdoctoral research in combinatorics at the Mathematical Department of Keio University, Japan. In 2007, he finished his Habilitation at Bonn University in the field of multimedia retrieval writing a book titled "Information Retrieval for Music and Motion," which appeared as Springer monograph. From 2007 to 2012, he was a member of the Saarland University and the Max-Planck Institut für Informatik leading the research group "Multimedia Information Retrieval and Music Processing" within the Cluster of Excellence on Multimodal Computing and Interaction. Since September 2012, Meinard Müller holds a professorship for "Semantic Audio Processing" at the International Audio Laboratories Erlangen, which is a joint institution of the Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) and the Fraunhofer-Institut für Integrierte Schaltungen IIS. His recent research interests include content-based multimedia retrieval, audio signal processing, music processing, music information retrieval, and motion processing.