A Method for Spatial Upsampling of Voice Directivity by Directional Equalization
×
Cite This
Citation & Abstract
C. Pörschmann, and JO. M.. Arend, "A Method for Spatial Upsampling of Voice Directivity by Directional Equalization," J. Audio Eng. Soc., vol. 68, no. 9, pp. 649-663, (2020 September.). doi: https://doi.org/10.17743/jaes.2020.0033
C. Pörschmann, and JO. M.. Arend, "A Method for Spatial Upsampling of Voice Directivity by Directional Equalization," J. Audio Eng. Soc., vol. 68 Issue 9 pp. 649-663, (2020 September.). doi: https://doi.org/10.17743/jaes.2020.0033
Abstract: To describe the sound radiation of the human voice into all directions, measurements need to be performed on a spherical grid. However, the resolution of such captured directivity patterns is limited and methods for spatial upsampling are required, for example by interpolation in the spherical harmonics (SH) domain. As the number of measurement directions limits the resolvable SH order, the directivity pattern suffers from spatial aliasing and order-truncation errors. We present an approach for spatial upsampling of voice directivity by spatial equalization. It is based on preprocessing, which equalizes the sparse directivity pattern by spectral division with corresponding directional rigid sphere transfer functions, resulting in a time-aligned and spectrally matched directivity pattern that has a significantly reduced spatial complexity. The directivity pattern is then transformed into the SH domain, interpolated to a dense grid by an inverse spherical Fourier transform and subsequently de-equalized by spectral multiplication with corresponding rigid sphere transfer functions. Based on measurements of a dummy head with an integrated mouth simulator, we compare this approach to reference measurements on a dense grid. The results show that the method significantly decreases errors of spatial undersampling and this allows a meaningful high-resolution voice directivity to be determined from sparse measurements.
@article{pörschmann2020a,
author={pörschmann, christoph and arend, johannes m.},
journal={journal of the audio engineering society},
title={a method for spatial upsampling of voice directivity by directional equalization},
year={2020},
volume={68},
number={9},
pages={649-663},
doi={https://doi.org/10.17743/jaes.2020.0033},
month={september},}
@article{pörschmann2020a,
author={pörschmann, christoph and arend, johannes m.},
journal={journal of the audio engineering society},
title={a method for spatial upsampling of voice directivity by directional equalization},
year={2020},
volume={68},
number={9},
pages={649-663},
doi={https://doi.org/10.17743/jaes.2020.0033},
month={september},
abstract={to describe the sound radiation of the human voice into all directions, measurements need to be performed on a spherical grid. however, the resolution of such captured directivity patterns is limited and methods for spatial upsampling are required, for example by interpolation in the spherical harmonics (sh) domain. as the number of measurement directions limits the resolvable sh order, the directivity pattern suffers from spatial aliasing and order-truncation errors. we present an approach for spatial upsampling of voice directivity by spatial equalization. it is based on preprocessing, which equalizes the sparse directivity pattern by spectral division with corresponding directional rigid sphere transfer functions, resulting in a time-aligned and spectrally matched directivity pattern that has a significantly reduced spatial complexity. the directivity pattern is then transformed into the sh domain, interpolated to a dense grid by an inverse spherical fourier transform and subsequently de-equalized by spectral multiplication with corresponding rigid sphere transfer functions. based on measurements of a dummy head with an integrated mouth simulator, we compare this approach to reference measurements on a dense grid. the results show that the method significantly decreases errors of spatial undersampling and this allows a meaningful high-resolution voice directivity to be determined from sparse measurements.},}
TY - paper
TI - A Method for Spatial Upsampling of Voice Directivity by Directional Equalization
SP - 649
EP - 663
AU - Pörschmann, Christoph
AU - Arend, Johannes M.
PY - 2020
JO - Journal of the Audio Engineering Society
IS - 9
VO - 68
VL - 68
Y1 - September 2020
TY - paper
TI - A Method for Spatial Upsampling of Voice Directivity by Directional Equalization
SP - 649
EP - 663
AU - Pörschmann, Christoph
AU - Arend, Johannes M.
PY - 2020
JO - Journal of the Audio Engineering Society
IS - 9
VO - 68
VL - 68
Y1 - September 2020
AB - To describe the sound radiation of the human voice into all directions, measurements need to be performed on a spherical grid. However, the resolution of such captured directivity patterns is limited and methods for spatial upsampling are required, for example by interpolation in the spherical harmonics (SH) domain. As the number of measurement directions limits the resolvable SH order, the directivity pattern suffers from spatial aliasing and order-truncation errors. We present an approach for spatial upsampling of voice directivity by spatial equalization. It is based on preprocessing, which equalizes the sparse directivity pattern by spectral division with corresponding directional rigid sphere transfer functions, resulting in a time-aligned and spectrally matched directivity pattern that has a significantly reduced spatial complexity. The directivity pattern is then transformed into the SH domain, interpolated to a dense grid by an inverse spherical Fourier transform and subsequently de-equalized by spectral multiplication with corresponding rigid sphere transfer functions. Based on measurements of a dummy head with an integrated mouth simulator, we compare this approach to reference measurements on a dense grid. The results show that the method significantly decreases errors of spatial undersampling and this allows a meaningful high-resolution voice directivity to be determined from sparse measurements.
To describe the sound radiation of the human voice into all directions, measurements need to be performed on a spherical grid. However, the resolution of such captured directivity patterns is limited and methods for spatial upsampling are required, for example by interpolation in the spherical harmonics (SH) domain. As the number of measurement directions limits the resolvable SH order, the directivity pattern suffers from spatial aliasing and order-truncation errors. We present an approach for spatial upsampling of voice directivity by spatial equalization. It is based on preprocessing, which equalizes the sparse directivity pattern by spectral division with corresponding directional rigid sphere transfer functions, resulting in a time-aligned and spectrally matched directivity pattern that has a significantly reduced spatial complexity. The directivity pattern is then transformed into the SH domain, interpolated to a dense grid by an inverse spherical Fourier transform and subsequently de-equalized by spectral multiplication with corresponding rigid sphere transfer functions. Based on measurements of a dummy head with an integrated mouth simulator, we compare this approach to reference measurements on a dense grid. The results show that the method significantly decreases errors of spatial undersampling and this allows a meaningful high-resolution voice directivity to be determined from sparse measurements.
Authors:
Pörschmann, Christoph; Arend, Johannes M.
Affiliations:
Institute of Communications Engineering, TH Köln - University of Applied Sciences, 50679 Cologne, Germany; Audio Communication Group, Technical University of Berlin, 10587 Berlin, Germany(See document for exact affiliation information.) JAES Volume 68 Issue 9 pp. 649-663; September 2020
Publication Date:
September 30, 2020Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=20896