AES Rome 2013
Poster Session P15
P15 - Spatial Audio
Monday, May 6, 15:00 — 16:30 (Foyer)
P15-1 Intelligent Acoustic Interfaces for Immersive Audio—Danilo Comminiello, Sapienza University of Rome - Rome, Italy; Michele Scarpiniti, Sapienza University of Rome - Rome, Italy; Raffaele Parisi, Sapienza University of Rome - Rome, Italy; Aurelio Uncini, Sapienza University of Rome - Rome, Italy
Oncoming audio technologies privilege the perceptive quality of audio signals, thus offering users an immersive audio experience, which involves listening and acquisition of audio signals. In such a scenario a fundamental role is played by intelligent acoustic interfaces that aim at acquiring audio information, processing it, and returning the processed information under the fulfillment of quality requirements demanded by users. In this paper we introduce intelligent acoustic interfaces for immersive audio experience and we prove their effectiveness within the context of immersive speech communications. In particular, we introduce an intelligent acoustic interface composed of a combined adaptive beamforming scheme in conjunction with a microphone array, which is able to enhance the processed signals in immersive scenarios.
Convention Paper 8898 (Purchase now)
P15-2 The Effects of Spatial Depth in the Combinations of 3-D Imagery and 7-Channel Surround with Height Channels—Toru Kamekawa, Tokyo University of the Arts - Tokyo, Japan; Atsushi Marui, Tokyo University of the Arts - Adachi-ku, Tokyo, Japan; Toshihiko Date, AVC Networks Company, Panasonic Corporation - Osaka, Japan; Masaaki Enatsu, marimoRECORDS, Inc., - Tokyo, Japan
The effect of the speakers of the height direction in a 3-D imagery focused on the spatial depth were studied and conducted. The first experiment was carried out using a method of magnitude estimation asking how near or far the combination of perceived visual and auditory event is. In the second experiment, the subjects were asked to rate on suitability of the sound to the image using the same materials as the previous experiment. The results show that 7ch and 5ch surround were felt closely and 2ch stereo was felt far under the condition that there is no image. Regarding suitability of sound to an image, 3-D imagery with 7ch surround gives higher score in the near distance.
Convention Paper 8899 (Purchase now)
P15-3 Comparative Analysis on Compact Representation for Spatial Variation of Individual Head-Related Transfer Functions Based on Singular Value Decomposition—Shouichi Takane, Akita Prefectural University - Yurihonjo, Akita, Japan
In this paper the compact representation of the head-related transfer functions (HRTFs) or the Head-Related Impulse Responses (HRIRs) based on singular value decomposition (SVD) was investigated, focusing on the difference in the parameters to construct the average vectors and the covariance matrices in two points. One of them is on what the derived eigenvectors reflect the properties of HRTFs and/or HRIRs. As a result, high correlation between the parameters concerning SVD and the amplitude of HRTFs was found, and the high correlation was obtained in the wide region except the contralateral side. The second investigation is on the required number of the HRTFs or the HRIRs to construct the average vectors and the covariance matrices. It was found that the number of HRTFs are decreased to about 1/4 of the whole directions for the used HRTFs. Among three conditions of the HRIRs, the amplitude of HRTFs, and the log-amplitude of HRTFs, it was also shown that the amplitude of HRTFs is the most effective to construct the parameters of the SVD.
Convention Paper 8900 (Purchase now)
P15-4 Calculation of Individualized Near-Field Head-Related Transfer Function Database Using Boundary Element Method—Yuanqing Rui, South China University of Technology - Guangzhou, China; Guangzheng Yu, South China University of Technology - Guangzhou, Guangdong, China; Bosun Xie, South China University of Technology - Guangzhou, China; Yu Liu, South China University of Technology - Guangzhou, China
Measurement is a common method to obtain the far-field HRTFs. Due to the difficulties in measurement, near-field HRTF databases for an artificial head are rare and an individualized database for human subjects is now unavailable. The present work adopts a laser 3-D scanner to acquire geometrical surfaces of human subjects and then uses boundary element methods to calculate the near-field HRTFs. At last, an individualized near-field HRTF database with 56 human subjects is established. To evaluate the accuracy of the database, the HRTFs for KEMAR are also calculated and compared to the measured ones.
Convention Paper 8901 (Purchase now)
P15-5 A Standardized Repository of Head-Related and Headphone Impulse Response Data—Michele Geronazzo, University of Padova - Padova, Italy; Fabrizio Granza, University of Padova - Padova, Italy; Simone Spagnol, University of Padova - Padova, Italy; Federico Avanzini, University of Padova - Padova, Italy
This paper proposes a repository for storing full- and partial-body Head-Related Impulse Responses (HRIRs/pHRIRs) and Headphone Impulse Responses (HpIRs) from several databases in a standardized environment. The main differences among the available databases concern coordinate systems, sound source stimuli, sampling frequencies, and other important specifications. The repository is organized so as to consider all these differences. The structure of our repository is an improvement with respect to the MARL-NYU data format, born as an attempt to unify HRIR databases. The introduced information supports flexible analysis and synthesis processes and robust headphone equalization.
Convention Paper 8902 (Purchase now)
P15-6 Influence of Different Microphone Arrays on IACC as an Objective Measure Of Spaciousness—Marco Conceição, Trinity College - Dublin, Ireland; Instituto Politécnico do Porto - Porto, Portugal; Dermot Furlong, Trinity College - Dublin, Ireland
Inter-Aural Cross Correlation measurements are used as physical measures that relate to listener spaciousness experience in a comparative study of the influence on spaciousness of different microphone arrays, thus allowing an objective approach to be adopted in the exploration of how microphone arrays affect the perceived spaciousness for stereo and surround sound reconstructions. The different microphone arrays recorded simulated direct and indirect sound components. The recorded signals were played back in three different rooms and IACC measurements were made for the reconstructed sound fields using a dummy head microphone system. The results achieved show how microphone array details influence the IACC peak and lead to a better understanding of how spaciousness can be controlled for 2 channel stereo and 5.1 presentations. Parametric variation of microphone arrays can therefore be employed to facilitate spaciousness control for reconstructed sound fields.
Convention Paper 8885 (Purchase now)