AES E-Library

AES E-Library

Loudspeaker position identification using human speech directivity index

Document Thumbnail

A regular user of a multichannel loudspeaker system in typical living rooms sets the loudspeakers in a non-uniform manner, with angles and distances that don’t necessarily follow the recommended ITU-R BS.2159-4 standard. Assuming a multichannel audio system equipped with N number of loudspeakers and M very near-field (NF) microphones attached to each loudspeaker, the user location with respect to the loudspeakers can be estimated by utilizing a supervised machine learning (ML) model. Two neural networks (NN) were trained with the human speech directivity index (DI) computed by room simulations, where the sound source was the typical directivity radiation pattern of human speech, and the receivers were the NF microphones attached to the loudspeakers. The distances between loudspeakers and the DI data was combined as input for the two NN models. One network was dedicated to estimate distances from loudspeaker to user, and the other network was dedicated to the angle estimation. The results shown a 95% confidence interval (CI) of ±1.7 cm and a CI of ±7 degrees for the incidence angle.

Open Access

Open
Access

Authors:
Affiliations:
Express Paper 138; AES Convention 155; October 2023
Publication Date:
Subject:
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=22292


Download Now (1.2 MB)

This paper is Open Access which means you can download it for free.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this Immersive & Spatial Audio!


AES - Audio Engineering Society