Assessing Spherical Harmonics Interpolation of Time-Aligned Head-Related Transfer Functions
×
Cite This
Citation & Abstract
JO. M.. Arend, F. Brinkmann, and C. Pörschmann, "Assessing Spherical Harmonics Interpolation of Time-Aligned Head-Related Transfer Functions," J. Audio Eng. Soc., vol. 69, no. 1/2, pp. 104-117, (2021 January.). doi: https://doi.org/10.17743/jaes.2020.0070
JO. M.. Arend, F. Brinkmann, and C. Pörschmann, "Assessing Spherical Harmonics Interpolation of Time-Aligned Head-Related Transfer Functions," J. Audio Eng. Soc., vol. 69 Issue 1/2 pp. 104-117, (2021 January.). doi: https://doi.org/10.17743/jaes.2020.0070
Abstract: High-quality spatial audio reproduction over headphones requires head-related transfer functions (HRTFs) with high spatial resolution. However, acquiring datasets with a large number of (individual) HRTFs is not always possible, and using large datasets can be problematic for real-time applications with limited resources. Consequently, interpolation methods for sparsely sampled HRTFs are of great interest, with spherical harmonics (SH) interpolation becoming increasingly popular. However, the SH representation of sparse HRTFs suffers from spatial aliasing and order truncation errors. To mitigate this, preprocessing methods have been introduced that time-align the sparse HRTFs before SH interpolation. This reduces the effective SH order and thus the number of HRTFs required for SH interpolation. In this paper, we present a physical evaluation of four state-of-the-art preprocessing methods, which showed very similar performance of the methods with notable differences only at low SH orders and contralateral HRTFs. We also performed a listening experiment with one selected method to determine the minimum required SH order required for perceptually transparent interpolation. For the selected method, a sparse HRTF set of order N ˜ 7 is sufficient for interpolating a frontal source presenting speech or percussion. Higher orders are, however, required for a lateral source and noise.
@article{arend2021assessing,
author={arend, johannes m. and brinkmann, fabian and pörschmann, christoph},
journal={journal of the audio engineering society},
title={assessing spherical harmonics interpolation of time-aligned head-related transfer functions},
year={2021},
volume={69},
number={1/2},
pages={104-117},
doi={https://doi.org/10.17743/jaes.2020.0070},
month={january},}
@article{arend2021assessing,
author={arend, johannes m. and brinkmann, fabian and pörschmann, christoph},
journal={journal of the audio engineering society},
title={assessing spherical harmonics interpolation of time-aligned head-related transfer functions},
year={2021},
volume={69},
number={1/2},
pages={104-117},
doi={https://doi.org/10.17743/jaes.2020.0070},
month={january},
abstract={high-quality spatial audio reproduction over headphones requires head-related transfer functions (hrtfs) with high spatial resolution. however, acquiring datasets with a large number of (individual) hrtfs is not always possible, and using large datasets can be problematic for real-time applications with limited resources. consequently, interpolation methods for sparsely sampled hrtfs are of great interest, with spherical harmonics (sh) interpolation becoming increasingly popular. however, the sh representation of sparse hrtfs suffers from spatial aliasing and order truncation errors. to mitigate this, preprocessing methods have been introduced that time-align the sparse hrtfs before sh interpolation. this reduces the effective sh order and thus the number of hrtfs required for sh interpolation. in this paper, we present a physical evaluation of four state-of-the-art preprocessing methods, which showed very similar performance of the methods with notable differences only at low sh orders and contralateral hrtfs. we also performed a listening experiment with one selected method to determine the minimum required sh order required for perceptually transparent interpolation. for the selected method, a sparse hrtf set of order n ˜ 7 is sufficient for interpolating a frontal source presenting speech or percussion. higher orders are, however, required for a lateral source and noise.},}
TY - paper
TI - Assessing Spherical Harmonics Interpolation of Time-Aligned Head-Related Transfer Functions
SP - 104
EP - 117
AU - Arend, Johannes M.
AU - Brinkmann, Fabian
AU - Pörschmann, Christoph
PY - 2021
JO - Journal of the Audio Engineering Society
IS - 1/2
VO - 69
VL - 69
Y1 - January 2021
TY - paper
TI - Assessing Spherical Harmonics Interpolation of Time-Aligned Head-Related Transfer Functions
SP - 104
EP - 117
AU - Arend, Johannes M.
AU - Brinkmann, Fabian
AU - Pörschmann, Christoph
PY - 2021
JO - Journal of the Audio Engineering Society
IS - 1/2
VO - 69
VL - 69
Y1 - January 2021
AB - High-quality spatial audio reproduction over headphones requires head-related transfer functions (HRTFs) with high spatial resolution. However, acquiring datasets with a large number of (individual) HRTFs is not always possible, and using large datasets can be problematic for real-time applications with limited resources. Consequently, interpolation methods for sparsely sampled HRTFs are of great interest, with spherical harmonics (SH) interpolation becoming increasingly popular. However, the SH representation of sparse HRTFs suffers from spatial aliasing and order truncation errors. To mitigate this, preprocessing methods have been introduced that time-align the sparse HRTFs before SH interpolation. This reduces the effective SH order and thus the number of HRTFs required for SH interpolation. In this paper, we present a physical evaluation of four state-of-the-art preprocessing methods, which showed very similar performance of the methods with notable differences only at low SH orders and contralateral HRTFs. We also performed a listening experiment with one selected method to determine the minimum required SH order required for perceptually transparent interpolation. For the selected method, a sparse HRTF set of order N ˜ 7 is sufficient for interpolating a frontal source presenting speech or percussion. Higher orders are, however, required for a lateral source and noise.
High-quality spatial audio reproduction over headphones requires head-related transfer functions (HRTFs) with high spatial resolution. However, acquiring datasets with a large number of (individual) HRTFs is not always possible, and using large datasets can be problematic for real-time applications with limited resources. Consequently, interpolation methods for sparsely sampled HRTFs are of great interest, with spherical harmonics (SH) interpolation becoming increasingly popular. However, the SH representation of sparse HRTFs suffers from spatial aliasing and order truncation errors. To mitigate this, preprocessing methods have been introduced that time-align the sparse HRTFs before SH interpolation. This reduces the effective SH order and thus the number of HRTFs required for SH interpolation. In this paper, we present a physical evaluation of four state-of-the-art preprocessing methods, which showed very similar performance of the methods with notable differences only at low SH orders and contralateral HRTFs. We also performed a listening experiment with one selected method to determine the minimum required SH order required for perceptually transparent interpolation. For the selected method, a sparse HRTF set of order N ˜ 7 is sufficient for interpolating a frontal source presenting speech or percussion. Higher orders are, however, required for a lateral source and noise.
Open Access
Authors:
Arend, Johannes M.; Brinkmann, Fabian; Pörschmann, Christoph
Affiliations:
TH Köln – University of Applied Sciences, Cologne, Germany; Technical University of Berlin, Berlin, Germany; Technical University of Berlin, Berlin, Germany; TH Köln – University of Applied Sciences, Cologne, Germany(See document for exact affiliation information.) JAES Volume 69 Issue 1/2 pp. 104-117; January 2021
Publication Date:
February 19, 2021Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=21019