Towards encoding perceptually salient early reflections for parametric spatial audio rendering
×
Cite This
Citation & Abstract
F. Brinkmann, H. Gamper, N. Raghuvanshi, and I. Tashev, "Towards encoding perceptually salient early reflections for parametric spatial audio rendering," Paper 10380, (2020 May.). doi:
F. Brinkmann, H. Gamper, N. Raghuvanshi, and I. Tashev, "Towards encoding perceptually salient early reflections for parametric spatial audio rendering," Paper 10380, (2020 May.). doi:
Abstract: Parametric spatial audio rendering promises fast and perceptually convincing audio cues that remain playback-system agnostic and enable aesthetic modifications of the acoustic experience within games and virtual reality. We propose a parametric encoder for spatial room impulse responses that is tested with nine simulated rooms spanning a large range of sizes and reverberation times. A key component of the pipeline is a perceptually inspired model for determining a minimal set of salient early reflections to reduce computational complexity. The results of a listening study with 27 subjects suggest that rendering six early reflections is indiscernible from a fully-rendered reference for the tested speech content and frequency-independent room simulations based on the image source method. However, the proposed model requires further improvements with respect to detecting and selecting the most-salient early reflections.
@article{brinkmann2020towards,
author={brinkmann, fabian and gamper, hannes and raghuvanshi, nikunj and tashev, ivan},
journal={journal of the audio engineering society},
title={towards encoding perceptually salient early reflections for parametric spatial audio rendering},
year={2020},
volume={},
number={},
pages={},
doi={},
month={may},}
@article{brinkmann2020towards,
author={brinkmann, fabian and gamper, hannes and raghuvanshi, nikunj and tashev, ivan},
journal={journal of the audio engineering society},
title={towards encoding perceptually salient early reflections for parametric spatial audio rendering},
year={2020},
volume={},
number={},
pages={},
doi={},
month={may},
abstract={parametric spatial audio rendering promises fast and perceptually convincing audio cues that remain playback-system agnostic and enable aesthetic modifications of the acoustic experience within games and virtual reality. we propose a parametric encoder for spatial room impulse responses that is tested with nine simulated rooms spanning a large range of sizes and reverberation times. a key component of the pipeline is a perceptually inspired model for determining a minimal set of salient early reflections to reduce computational complexity. the results of a listening study with 27 subjects suggest that rendering six early reflections is indiscernible from a fully-rendered reference for the tested speech content and frequency-independent room simulations based on the image source method. however, the proposed model requires further improvements with respect to detecting and selecting the most-salient early reflections.},}
TY - paper
TI - Towards encoding perceptually salient early reflections for parametric spatial audio rendering
SP -
EP -
AU - Brinkmann, Fabian
AU - Gamper, Hannes
AU - Raghuvanshi, Nikunj
AU - Tashev, Ivan
PY - 2020
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2020
TY - paper
TI - Towards encoding perceptually salient early reflections for parametric spatial audio rendering
SP -
EP -
AU - Brinkmann, Fabian
AU - Gamper, Hannes
AU - Raghuvanshi, Nikunj
AU - Tashev, Ivan
PY - 2020
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2020
AB - Parametric spatial audio rendering promises fast and perceptually convincing audio cues that remain playback-system agnostic and enable aesthetic modifications of the acoustic experience within games and virtual reality. We propose a parametric encoder for spatial room impulse responses that is tested with nine simulated rooms spanning a large range of sizes and reverberation times. A key component of the pipeline is a perceptually inspired model for determining a minimal set of salient early reflections to reduce computational complexity. The results of a listening study with 27 subjects suggest that rendering six early reflections is indiscernible from a fully-rendered reference for the tested speech content and frequency-independent room simulations based on the image source method. However, the proposed model requires further improvements with respect to detecting and selecting the most-salient early reflections.
Parametric spatial audio rendering promises fast and perceptually convincing audio cues that remain playback-system agnostic and enable aesthetic modifications of the acoustic experience within games and virtual reality. We propose a parametric encoder for spatial room impulse responses that is tested with nine simulated rooms spanning a large range of sizes and reverberation times. A key component of the pipeline is a perceptually inspired model for determining a minimal set of salient early reflections to reduce computational complexity. The results of a listening study with 27 subjects suggest that rendering six early reflections is indiscernible from a fully-rendered reference for the tested speech content and frequency-independent room simulations based on the image source method. However, the proposed model requires further improvements with respect to detecting and selecting the most-salient early reflections.
Authors:
Brinkmann, Fabian; Gamper, Hannes; Raghuvanshi, Nikunj; Tashev, Ivan
Affiliations:
Audio Communication Group, TU Berlin; Microsoft Research Redmond, WA, USA; Microsoft Research Redmond, WA, USA; Microsoft Research Redmond, WA, USA(See document for exact affiliation information.)
AES Convention:
148 (May 2020)
Paper Number:
10380
Publication Date:
May 28, 2020Import into BibTeX
Subject:
Room Acoustics
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=20797