AES Munich 2009
Poster Session P23
Psychoacoustics and Perception
Saturday, May 9, 16:30 — 18:00
P23-1 Influence of the Listening Room in the Perception of a Musical Work—Nelia Valverde, Marcos D. Fernández, José Antonio Ballesteros, Leticia Martínez, Samuel Quintana, Isabel González, Escuela Universitaria Politénica de Cuenca - Cuenca, Spain
The listening of the same musical composition generates a unique perception for every listener but, simultaneously, the specific acoustic conditions of the chosen room have a decisive influence on the perception. In order to evaluate such differences depending on the listening room, a musical work for choir has been composed and recorded with a HATS in an anechoic room, in a reverberant room, and in a normal room. With those records, surveys to professional musicians and non-expert listeners have been carried out, once they have previously heard the recording with headphones, and finally, the answers obtained have been evaluated in order to determine the influence of the listening room in the perception of the musical work.
Convention Paper 7775 (Purchase now)
P23-2 Comparison of Methods for Measuring Sound Quality through HATS and Binaural Microphones—José Antonio Ballesteros, Marcos D. Fernández, Samuel Quintana, Isabel González, Laura Rodríguez, Escuela Universitaria Politécnica de Cuenca, Universidad de Castilla-La Mancha - Cuenca, Spain
Sound quality techniques are currently becoming more important as they take into account the human perception of sound. By now, there is no well established international standards for measuring sound quality and no well recognizable reference index for its assessment. Then, a HATS or a pair of binaural microphones can be used for measuring the typical sound quality parameters. A set of measurements, under the same condition, has been carried out using both devices for assessing the differences and the possible variation in the results. As a consequence of all of this, guidance is given for choosing the device that best fits depending on each measurement context.
Convention Paper 7776 (Purchase now)
P23-3 Improving Perceived Tempo Estimation by Statistical Modeling of Higher-Level Musical Descriptors—Ching-Wei Chen, Markus Cremer, Kyogu Lee, Peter DiMaria, Ho-Hsiang Wu, Gracenote, Inc. - Emeryville, CA, USA
Conventional tempo estimation algorithms generally work by detecting significant audio events and finding periodicities of repetitive patterns in an audio signal. However, human perception of tempo is subjective and relies on a far richer set of information, causing many tempo estimation algorithms to suffer from octave errors, or “double/half-time” confusion. In this paper we propose a system that uses higher-level musical descriptors such as mood to train a statistical model of perceived tempo classes, which can then be used to correct the estimate from a conventional tempo estimation algorithm. Our experimental results show reliable classification of perceived tempo class, as well as a significant reduction of octave errors when applied to an array of available tempo estimation algorithms.
Convention Paper 7777 (Purchase now)
P23-4 Perceptually-Motivated Audio Morphing: Softness—Duncan Williams, Tim Brookes, University of Surrey - Guildford, Surrey, UK
A system for morphing the softness and brightness of two sounds independently from their other perceptual or acoustic attributes was coded. The system is an extension of a previous one that morphed brightness only, that was based on the Spectral Modeling Synthesis additive/residual model. A Multidimensional Scaling analysis, of listener responses to paired comparisons of stimuli generated by the morpher, showed movement in three perceptually-orthogonal directions. These directions were labeled in a subsequent verbal elicitation experiment that found that the effects of the brightness and softness controls were perceived as intended. A Timbre Morpher, adjusting additional timbral attributes with perceptually-meaningful controls, can now be considered for further work.
Convention Paper 7778 (Purchase now)
P23-5 Resolution of Spatial Distribution Perception with Distributed Sound Source in Anechoic Conditions—Olli Santala, Ville Pulkki, Helsinki University of Technology - Espoo, Finland
The resolution of directional perception of spatially distributed sound sources was investigated with a listening test in an anechoic chamber using various sound source distributions. Fifteen loudspeakers were used to produce test cases that included sound sources with varying widths and wide sound sources with gaps in the distribution. The subjects were asked to distinguish which loudspeakers emitted sound according to their own perception. Results show that small gaps in the sound source were not perceived accurately and wide sound sources were perceived narrower than they actually were. The results also indicate that the resolution for fine spatial details was worse than 15 degrees when the sound source was wide.
Convention Paper 7779 (Purchase now)
P23-6 Perceived Roughness—A Recent Psychoacoustic Measurement—Robert Mores, Thorsten Smit, Jana-Marie Wiese, University of Applied Science - Hamburg, Germany
This paper relates to an investigation on perceived roughness from Aures in 1984 where findings are based on psychoacoustic tests with synthetic sounds and a small group of people. The related results have repeatedly been used for modeling roughness perception since then, for instance in the context of noise perception. Roughness is again an issue when investigating the perceived quality or timbre of musical sounds. In this context roughness is one among some ten mid-level features to be extracted. Here, perceived roughness is measured again, but on a wider basis than in the earlier investigation. This paper outlines the psychoacoustic investigation, basically following the method of Aures, but modifying some of the issues under question. The results are reasonable and differ from the earlier findings in various aspects.
Convention Paper 7780 (Purchase now)
P23-7 A Physiological Auditory Model—Václav Vencovsky, Czech Technical University in Prague - Prague, Czech Republic
A physiological auditory model is described. The model simulates a processing of a sound by an outer, middle, and inner ear. A nonlinear inner ear model comprises the cochlear frequency selectivity model and the inner hair cells model proposed according to mammalian physiological data. A capability of the auditory model to simulate human psychophysical masking data is verified.
Convention Paper 7781 (Purchase now)