On the Use of Bottleneck Features of CNN Auto-Encoder for Personalized HRTFs

Lee, Geon Woo; Moon, Jung Min; Chun, Chan Jun; Kim, Hong Kook

AES E-Library

On the Use of Bottleneck Features of CNN Auto-Encoder for Personalized HRTFs

The most effective way of providing immersive sound effects is to use head-related transfer functions (HRTFs). HRTFs are defined by the path from a given sound source to the listener's ears. However, sound propagation by HRTFs differs slightly between people because the head, body, and ears differ for each person. Recently, a method for estimating HRTFs using a neural network has been developed, where anthropometric pinna measurements and head-related impulse responses (HRIRs) are used as the input and output layer of the neural network. However, it is inefficient to accurately measure such anthropometric data. This paper proposes a feature extraction method for the ear image instead of measuring anthropometric pinna measurements directly. The proposed method utilizes the bottleneck features of a convolutional neural network (CNN) auto-encoder from the edge detected ear image. The proposed feature extraction method using the CNN-based auto-encoder will be incorporated into the HRTF estimation approach.

Authors: Lee, Geon Woo; Moon, Jung Min; Chun, Chan Jun; Kim, Hong Kook
Affiliations: Korea Institute of Civil Engineering and Building Technology (KICT), Goyang, Korea; Gwangju Institute of Science and Tech (GIST), Gwangju, Korea(See document for exact affiliation information.)
AES Convention: 144 (May 2018) Paper Number: 10023
Publication Date: May 14, 2018 Import into BibTeX
Subject: Posters: Spatial Audio
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=19419

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: /conv/144/10023.pdf

Start a discussion about this paper!

AES E-Library

On the Use of Bottleneck Features of CNN Auto-Encoder for Personalized HRTFs

ABOUT AES

Contact Us