Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization

Nikunen, Joonas; Virtanen, Tuomas; Vilermo, Miikka

AES E-Library

Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization

The expanding use of portable multimedia devices has intensified the need for better forms of scalable spatial audio coding (SAC) that match the connectivity rate and multichannel playback capabilities of the receiving device. A new SAC method is based on the parameterization of multichannel audio by representing it as a linear combination of objects composed of fixed spectral bases with time-varying gain and channel-dependent spatial gain. Spatial parameters can be estimated from the original multichannel signal using psychoacoustic properties of sound source localization. The base audio can be monophonic or downmixed stereophonic. Listening tests showed that the proposed SAC algorithm achieved the performance of conventional spatial audio coding methods with similar bit rates. The sound separation performance was evaluated and found applicable for separating sound sources in the coding domain directly.

Authors: Nikunen, Joonas; Virtanen, Tuomas; Vilermo, Miikka
Affiliations: Tampere University of Technology, Tampere, Finland; Nokia Research Center, Tampere, Finland(See document for exact affiliation information.)
JAES Volume 60 Issue 10 pp. 794-806; October 2012
Publication Date: November 26, 2012 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=16553

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD JAES60) /jaes60/10/pg794.pdf

Start a discussion about this paper!

AES E-Library

Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization

ABOUT AES

Contact Us