Exploiting Deep Neural Networks for Two-to-Five Channel Surround Decoder

Choi, Jeonghwan; Chang, Joon-Hyuk

AES E-Library

Exploiting Deep Neural Networks for Two-to-Five Channel Surround Decoder

We exploited deep neural networks (DNN) for two-to-five channel surround decoding. Specifically DNNs are used to replace the primary-ambient separation and ambient-signal-rendering modules. For the training, the mean-squared error of the magnitude spectra between the decoded and five-channel target signals and the interchannel level differences between the target signals were used as the loss functions. Through this procedure the DNNs can derive the spectral weights that can be used to produce the decoded signals, similar to that for the target signals. The log spectral distance, signal-to-distortion ratio, and multiple stimuli with hidden reference and anchor tests were used for objective and subjective evaluations. The experimental results show that exploiting the DNNs can generate decoded signals that are more similar to the target signals than those obtained via previous methods.

Authors: Choi, Jeonghwan; Chang, Joon-Hyuk
Affiliation: Hanyang University, Seoul, Republic of Korea
JAES Volume 68 Issue 12 pp. 938-949; December 2020
Publication Date: January 11, 2021 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=21008

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD JAES68) /jaes68/12/pg938.pdf

DOI: https://doi.org/10.17743/jaes.2020.0020

Start a discussion about this paper!

AES E-Library

Exploiting Deep Neural Networks for Two-to-Five Channel Surround Decoder

ABOUT AES

Contact Us