Optimizations of the Spatial Decomposition Method for Binaural Reproduction
×
Cite This
Citation & Abstract
SE. V.. Amengual Garí, JO. M.. Arend, PA. T.. Calamia, and PH. W.. Robinson, "Optimizations of the Spatial Decomposition Method for Binaural Reproduction," J. Audio Eng. Soc., vol. 68, no. 12, pp. 959-976, (2020 December.). doi: https://doi.org/10.17743/jaes.2020.0063
SE. V.. Amengual Garí, JO. M.. Arend, PA. T.. Calamia, and PH. W.. Robinson, "Optimizations of the Spatial Decomposition Method for Binaural Reproduction," J. Audio Eng. Soc., vol. 68 Issue 12 pp. 959-976, (2020 December.). doi: https://doi.org/10.17743/jaes.2020.0063
Abstract: The spatial decomposition method (SDM) can be used to parameterize and reproduce a sound field based on measured multichannel room impulse responses (RIRs). In this paper we propose optimizations of SDM to address the following questions and issues that have recently emerged in the development of the method: (a) accuracy in direction-of-arrival (DOA) estimation with open microphone arrays utilizing time differences of arrival as well as with B-format arrays using pseudo-intensity vectors; (b) optimal array size and temporal processing window size for broadband DOA estimation based on open microphone arrays; (c) spatial and spectral distortion of single events caused by unstable DOA estimation; and (d) spectral whitening of late reverberation as a consequence of rapidly varying DOA estimates. Through simulations we analyze DOA estimation accuracy (a) and explore processing parameters (b) in search of optimal settings. To overcome the unnatural DOA spread (c), we introduce spatial quantization of the DOA as a post-processing step at the expense of spatial distortion for successive reflections. To address the spectral whitening (d), we propose an equalization approach specifically designed for rendering SDM data directly to binaural signals with a spatially dense HRTF dataset. Finally, through perceptual experiments, we evaluate the proposed equalization and investigate the consequences of quantizing the spatial information of SDM auralizations by directly comparing binaural renderings with real loudspeakers. The proposed improvements for binaural rendering are released in an open source repository.
@article{amengual garí2021optimizations,
author={amengual garí, sebastià v. and arend, johannes m. and calamia, paul t. and robinson, philip w.},
journal={journal of the audio engineering society},
title={optimizations of the spatial decomposition method for binaural reproduction},
year={2021},
volume={68},
number={12},
pages={959-976},
doi={https://doi.org/10.17743/jaes.2020.0063},
month={december},}
@article{amengual garí2021optimizations,
author={amengual garí, sebastià v. and arend, johannes m. and calamia, paul t. and robinson, philip w.},
journal={journal of the audio engineering society},
title={optimizations of the spatial decomposition method for binaural reproduction},
year={2021},
volume={68},
number={12},
pages={959-976},
doi={https://doi.org/10.17743/jaes.2020.0063},
month={december},
abstract={the spatial decomposition method (sdm) can be used to parameterize and reproduce a sound field based on measured multichannel room impulse responses (rirs). in this paper we propose optimizations of sdm to address the following questions and issues that have recently emerged in the development of the method: (a) accuracy in direction-of-arrival (doa) estimation with open microphone arrays utilizing time differences of arrival as well as with b-format arrays using pseudo-intensity vectors; (b) optimal array size and temporal processing window size for broadband doa estimation based on open microphone arrays; (c) spatial and spectral distortion of single events caused by unstable doa estimation; and (d) spectral whitening of late reverberation as a consequence of rapidly varying doa estimates. through simulations we analyze doa estimation accuracy (a) and explore processing parameters (b) in search of optimal settings. to overcome the unnatural doa spread (c), we introduce spatial quantization of the doa as a post-processing step at the expense of spatial distortion for successive reflections. to address the spectral whitening (d), we propose an equalization approach specifically designed for rendering sdm data directly to binaural signals with a spatially dense hrtf dataset. finally, through perceptual experiments, we evaluate the proposed equalization and investigate the consequences of quantizing the spatial information of sdm auralizations by directly comparing binaural renderings with real loudspeakers. the proposed improvements for binaural rendering are released in an open source repository.},}
TY - paper
TI - Optimizations of the Spatial Decomposition Method for Binaural Reproduction
SP - 959
EP - 976
AU - Amengual Garí, Sebastià V.
AU - Arend, Johannes M.
AU - Calamia, Paul T.
AU - Robinson, Philip W.
PY - 2021
JO - Journal of the Audio Engineering Society
IS - 12
VO - 68
VL - 68
Y1 - December 2020
TY - paper
TI - Optimizations of the Spatial Decomposition Method for Binaural Reproduction
SP - 959
EP - 976
AU - Amengual Garí, Sebastià V.
AU - Arend, Johannes M.
AU - Calamia, Paul T.
AU - Robinson, Philip W.
PY - 2021
JO - Journal of the Audio Engineering Society
IS - 12
VO - 68
VL - 68
Y1 - December 2020
AB - The spatial decomposition method (SDM) can be used to parameterize and reproduce a sound field based on measured multichannel room impulse responses (RIRs). In this paper we propose optimizations of SDM to address the following questions and issues that have recently emerged in the development of the method: (a) accuracy in direction-of-arrival (DOA) estimation with open microphone arrays utilizing time differences of arrival as well as with B-format arrays using pseudo-intensity vectors; (b) optimal array size and temporal processing window size for broadband DOA estimation based on open microphone arrays; (c) spatial and spectral distortion of single events caused by unstable DOA estimation; and (d) spectral whitening of late reverberation as a consequence of rapidly varying DOA estimates. Through simulations we analyze DOA estimation accuracy (a) and explore processing parameters (b) in search of optimal settings. To overcome the unnatural DOA spread (c), we introduce spatial quantization of the DOA as a post-processing step at the expense of spatial distortion for successive reflections. To address the spectral whitening (d), we propose an equalization approach specifically designed for rendering SDM data directly to binaural signals with a spatially dense HRTF dataset. Finally, through perceptual experiments, we evaluate the proposed equalization and investigate the consequences of quantizing the spatial information of SDM auralizations by directly comparing binaural renderings with real loudspeakers. The proposed improvements for binaural rendering are released in an open source repository.
The spatial decomposition method (SDM) can be used to parameterize and reproduce a sound field based on measured multichannel room impulse responses (RIRs). In this paper we propose optimizations of SDM to address the following questions and issues that have recently emerged in the development of the method: (a) accuracy in direction-of-arrival (DOA) estimation with open microphone arrays utilizing time differences of arrival as well as with B-format arrays using pseudo-intensity vectors; (b) optimal array size and temporal processing window size for broadband DOA estimation based on open microphone arrays; (c) spatial and spectral distortion of single events caused by unstable DOA estimation; and (d) spectral whitening of late reverberation as a consequence of rapidly varying DOA estimates. Through simulations we analyze DOA estimation accuracy (a) and explore processing parameters (b) in search of optimal settings. To overcome the unnatural DOA spread (c), we introduce spatial quantization of the DOA as a post-processing step at the expense of spatial distortion for successive reflections. To address the spectral whitening (d), we propose an equalization approach specifically designed for rendering SDM data directly to binaural signals with a spatially dense HRTF dataset. Finally, through perceptual experiments, we evaluate the proposed equalization and investigate the consequences of quantizing the spatial information of SDM auralizations by directly comparing binaural renderings with real loudspeakers. The proposed improvements for binaural rendering are released in an open source repository.
Open Access
Authors:
Amengual Garí, Sebastià V.; Arend, Johannes M.; Calamia, Paul T.; Robinson, Philip W.
Affiliations:
Facebook Reality Labs Research, Redmond, WA; Facebook Reality Labs Research, Redmond, WA; TH Köln - University of Applied Sciences, Cologne, Germany; Facebook Reality Labs Research, Redmond, WA; Facebook Reality Labs Research, Redmond, WA(See document for exact affiliation information.) JAES Volume 68 Issue 12 pp. 959-976; December 2020
Publication Date:
January 11, 2021Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=21010