Knowledge of room geometry is a fundamental component for modeling acoustic environments. Since most common methods for room geometry estimation are based on prior knowledge, the generalization to unknown environments is somewhat limited. Deep learning based approaches have delivered promising results for the blind estimation of acoustic parameters considering mainly monaural signals. The purpose of this contribution is to investigate the effect of multichannel higher-order Ambisonics (HOA) signals on the performance of a convolutional recurrent neural network for blind room geometry estimation. Therefore a HOA-dataset of noisy speech signals in simulated rooms with realistic frequency-dependent reflection coefficients is introduced. Results show that for each additional Ambisonics order the estimation performance increases with the fourth-order model achieving a mean absolute error of 1.24 m averaged over all three room dimensions.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.