AES Paris 2006: Paper Session P2

120th AES Convention - Paris, France - Dates: Saturday May 20 - Tuesday May 23, 2006 - Porte de Versailles

Registration
Exhibition
Calendar
4 Day Planner
Paper Sessions
Workshops
Tutorials
Exhibitor Seminars
Application Seminars
Student Program
Special Events
Technical Tours
Heyser Lecture
Tech Comm Mtgs
Standards Mtgs
Hotel Reservation
Presenters

Last Updated: 20060331, mei

P2 - Analysis and Synthesis of Sound; Mobile Phone Audio; Automotive Audio

Saturday, May 20, 09:00 — 11:40

Chair: Tim Brooks, Institute of Sound Recording - Surrey, UK

P2-1 Application of Segmentation and Thumbnailing to Music Browsing and Searching—Mark Levy, Mark Sandler, Queen Mary, University of London - London, UK
We present a method for segmenting musical audio into structural sections and some rules for choosing a representative “thumbnail” segment. We demonstrate how audio thumbnails are an effective and natural way of returning results in music search applications. We investigate the use of segment-based models for music similarity searching and recommendation. We report experimental results of the performance and efficiency of these approaches in the context of SoundBite, a demonstration music thumbnailing and search engine.

[Associated Poster Presentation in Session P6, Saturday, May 20, at 16:00]

Presentation is scheduled to begin at 09:00
Convention Paper 6642 (Purchase now)

P2-2 Multiple F0 Tracking in Solo Recordings of Monodic Instruments—Chunghsin Yeh, Axel Röbel, Xavier Rodet, IRCAM - Paris, France
This paper is concerned with the F0 tracking in monodic instrument solo recordings. Due to reverberation, the observed signal is rather polyphonic, and single-F0 tracking techniques often give unsatisfying results. The proposed method is based on multiple-F0 estimation and makes use of the a priori knowledge that the observed spectrum is generated by a single monodic instrument. The predominant F0 is tracked first and the secondary F0 tracks are then established. The proposed method is tested on reverberant recordings and show significant improvements compared to single-F0 estimators.

[Associated Poster Presentation in Session P6, Saturday, May 20, at 16:00]

Presentation is scheduled to begin at 09:20
Convention Paper 6643 (Purchase now)

P2-3 Harmonic Plus Noise Decomposition: Time-Frequency Reassignment Versus a Subspace-Based Method—Bertrand David, Valentin Emiya, Roland Badeau, Yves Grenier, Ecole Nationale Supérieure de Télécommunications - Paris Cedex, France
This paper deals with the Harmonic + Noise decomposition and, as a targeted application, to extract transient background noise surrounded by a signal having a strong harmonic content (speech for instance). In that perspective, a method based on the reassigned spectrum and a high-resolution subspace tracker are compared, both on simulations and in a more realistic manner. The reassignment relocalizes the time-frequency energy around a given pair (analysis time index, analysis frequency bin) while the high resolution method benefits from a characterization of the signal in terms of a space spanned by the harmonic content and a space spanned by the stochastic content. Both methods are adaptive and the estimations are updated from a sample to the next.

[Associated Poster Presentation in Session P6, Saturday, May 20, at 16:00]

Presentation is scheduled to begin at 09:40
Convention Paper 6644 (Purchase now)

P2-4 Signal Analysis Using the Complex Spectral Phase Evolution (CSPE) Method—Kevin Short, Ricardo Garcia, Chaoticom Technologies - Andover, MA, USA
The Complex Spectral Phase Evolution (CSPE) method is introduced as a tool to analyze and detect the presence of short-term stable sinusoidal components in an audio signal. The method provides for super-resolution of frequencies by examining the evolution of the phase of the complex signal spectrum over time-shifted windows. It is shown that this analysis, when applied to a sinusoidal signal component, allows for the resolution of the true signal frequency with orders of magnitude greater accuracy than the DFT. Further, this frequency estimate is independent of the frequency bin and can be estimated from “leakage” bins far from spectral peaks. The method is robust in the presence of noise or nearby signal components, and is a fundamental tool in the front-end processing for the KOZ compression technology.

[Associated Poster Presentation in Session P6, Saturday, May 20, at 16:00]

Presentation is scheduled to begin at 10:00
Convention Paper 6645 (Purchase now)

P2-5 Upwind Leapfrog Schemes in Physical Models with Mixed Modeling Strategies—José Escolano, José J. López, Technical University of Valencia - Valencia, Spain
Block-based physical modeling with mixed modeling strategies is one of the most promising methods for digital sound synthesis. This technique proposes to model and discretize each element individually, and their interaction topology is separately implemented. In this paper the use of the Upwind Leapfrog Scheme or Linear Bicharacteristic Scheme (LBS) is proposed for digital sound synthesis of membranes (2-D), into block-based physical modeling context. It provides an efficient and accurate alternative stencil to the classical leapfrog scheme of the Finite Difference Time Domain (FDTD) method. Moreover, the conversion of dependent wave equation variables into characteristic variables makes this method suitable to interact with wave digital filter models and, then, with other paradigms. This technique is extensively presented and justified with some examples.

Presentation is scheduled to begin at 10:20
Convention Paper 6646 (Purchase now)

P2-6 Simple Modeling of Piano Inharmonicity Due to Soundboard Impedance—Luis Ortiz-Berenguer, Javier Casajus-Quiros, Marisol Torres-Guijarro, Jon Beracoechea, J. Perez-Aranda, Polytechnic University of Madrid - Madrid, Spain
Partials of piano sounds are inharmonic. This inharmonicity is due either to string stiffness and to soundboard impedance. The last has not been widely documented. Two problems arise: to know the value of the impedance and to evaluate the frequency deviation the partial suffers. In this paper that deviation has been calculated either by using Morse’s theoretical equations or by using the authors’ proposed method. To validate results, deviations for some piano notes have been measured. In addition, the soundboard impedance has also been measured to verify the relationship between deviation and impedance. Moreover, a method to evaluate impedance using measured deviations is also proposed. This last method could be useful during training stages in transcription systems and for parameter extraction schemes.

Presentation is scheduled to begin at 10:40
Convention Paper 6647 (Purchase now)

P2-7 Contextual Effects on Sound Quality Judgments: Listening Room and Automotive Environments—Kathryn Beresford, University of Surrey - Guildford, Surrey, UK; Natanya Ford, Harman Becker Automotive Systems - Bridgend, UK; Francis Rumsey, Slawomir Zielinski, University of Surrey - Guildford, Surrey, UK
This study was designed to assess the effect of the listening context on basic audio quality for stimuli with varied mid-range timbral degradations. An assessment of basic audio quality was carried out in two different listening environments: an ITU-R BS.1116 conformant listening room and a stationary vehicle. A group of untrained listeners graded basic audio quality using a novel single stimulus method. The listener population was divided into two subsets—one made evaluations in a listening room and the other in a vehicle. The single stimulus method was investigated as a possible subjective evaluation method for use in automotive environments.

[Associated Poster Presentation in Session P6, Saturday, May 20, at 16:00]

Presentation is scheduled to begin at 11:00
Convention Paper 6648 (Purchase now)

P2-8 Next Generation Automotive Research and Technologies—Brett Crockett, Michael Smithers, Eric Benjamin, Dolby Laboratories - San Francisco, CA, USA
The automobile is quickly becoming a prominent environment for listening to multichannel audio content. As a listening space, the automobile is both interesting and challenging due to its interior structure and materials, its predominant off-axis listening positions, and the amount and variability of background noise. This paper discusses these challenges, describes a number of existing multichannel sound technologies and their applicability to the automotive environment, and presents several novel sound technologies that provide new solutions to some of these challenges. Ongoing challenges and associated automotive sound research being investigated are also presented.

Presentation is scheduled to begin at 11:20
Convention Paper 6649 (Purchase now)