In This Section
AES Store
- Learn From The Experts:

Bob Ludwig "Mastering"- Oral History Project Gallery
- Other AES Publications
Journal Forum
Virtual Localization by Blind Persons - July 2012
1 comment
Effect of Spatial Location and Presentation Rate on the Reaction to Auditory Displays - July 2012
1 comment
Watermark-Aided Pre-Echo Reduction in Low Bit-Rate Audio Coding - June 2012
1 comment
AES E-Library
Music Structure Segmentation Using the Azimugram in Conjunction with Principal Component Analysis
A novel method to segment stereo music recordings into formal musical structures such as verses and choruses is presented. The method performs dimensional reduction on a time-azimuth representation of audio which results in a set of time activation sequences, each of which corresponds to a repeating structural segment. This is based on the assumption that each segment type such as verse or chorus has a unique energy distribution across the stereo field. It can be shown that these unique energy distributions along with their time activation sequences are the latent principal components of the time-azimuth representation. It can be shown that each time activation sequence represents a structural segment such as a verse or chorus.
Click to purchase paper or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $20 for non-members, $5 for AES members and is free for E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!






