Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization
×
Cite This
Citation & Abstract
J. Nikunen, T. Virtanen, and M. Vilermo, "Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization," J. Audio Eng. Soc., vol. 60, no. 10, pp. 794-806, (2012 October.). doi:
J. Nikunen, T. Virtanen, and M. Vilermo, "Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization," J. Audio Eng. Soc., vol. 60 Issue 10 pp. 794-806, (2012 October.). doi:
Abstract: The expanding use of portable multimedia devices has intensified the need for better forms of scalable spatial audio coding (SAC) that match the connectivity rate and multichannel playback capabilities of the receiving device. A new SAC method is based on the parameterization of multichannel audio by representing it as a linear combination of objects composed of fixed spectral bases with time-varying gain and channel-dependent spatial gain. Spatial parameters can be estimated from the original multichannel signal using psychoacoustic properties of sound source localization. The base audio can be monophonic or downmixed stereophonic. Listening tests showed that the proposed SAC algorithm achieved the performance of conventional spatial audio coding methods with similar bit rates. The sound separation performance was evaluated and found applicable for separating sound sources in the coding domain directly.
@article{nikunen2012multichannel,
author={nikunen, joonas and virtanen, tuomas and vilermo, miikka},
journal={journal of the audio engineering society},
title={multichannel audio upmixing by time-frequency filtering using non-negative tensor factorization},
year={2012},
volume={60},
number={10},
pages={794-806},
doi={},
month={october},}
@article{nikunen2012multichannel,
author={nikunen, joonas and virtanen, tuomas and vilermo, miikka},
journal={journal of the audio engineering society},
title={multichannel audio upmixing by time-frequency filtering using non-negative tensor factorization},
year={2012},
volume={60},
number={10},
pages={794-806},
doi={},
month={october},
abstract={the expanding use of portable multimedia devices has intensified the need for better forms of scalable spatial audio coding (sac) that match the connectivity rate and multichannel playback capabilities of the receiving device. a new sac method is based on the parameterization of multichannel audio by representing it as a linear combination of objects composed of fixed spectral bases with time-varying gain and channel-dependent spatial gain. spatial parameters can be estimated from the original multichannel signal using psychoacoustic properties of sound source localization. the base audio can be monophonic or downmixed stereophonic. listening tests showed that the proposed sac algorithm achieved the performance of conventional spatial audio coding methods with similar bit rates. the sound separation performance was evaluated and found applicable for separating sound sources in the coding domain directly.},}
TY - paper
TI - Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization
SP - 794
EP - 806
AU - Nikunen, Joonas
AU - Virtanen, Tuomas
AU - Vilermo, Miikka
PY - 2012
JO - Journal of the Audio Engineering Society
IS - 10
VO - 60
VL - 60
Y1 - October 2012
TY - paper
TI - Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization
SP - 794
EP - 806
AU - Nikunen, Joonas
AU - Virtanen, Tuomas
AU - Vilermo, Miikka
PY - 2012
JO - Journal of the Audio Engineering Society
IS - 10
VO - 60
VL - 60
Y1 - October 2012
AB - The expanding use of portable multimedia devices has intensified the need for better forms of scalable spatial audio coding (SAC) that match the connectivity rate and multichannel playback capabilities of the receiving device. A new SAC method is based on the parameterization of multichannel audio by representing it as a linear combination of objects composed of fixed spectral bases with time-varying gain and channel-dependent spatial gain. Spatial parameters can be estimated from the original multichannel signal using psychoacoustic properties of sound source localization. The base audio can be monophonic or downmixed stereophonic. Listening tests showed that the proposed SAC algorithm achieved the performance of conventional spatial audio coding methods with similar bit rates. The sound separation performance was evaluated and found applicable for separating sound sources in the coding domain directly.
The expanding use of portable multimedia devices has intensified the need for better forms of scalable spatial audio coding (SAC) that match the connectivity rate and multichannel playback capabilities of the receiving device. A new SAC method is based on the parameterization of multichannel audio by representing it as a linear combination of objects composed of fixed spectral bases with time-varying gain and channel-dependent spatial gain. Spatial parameters can be estimated from the original multichannel signal using psychoacoustic properties of sound source localization. The base audio can be monophonic or downmixed stereophonic. Listening tests showed that the proposed SAC algorithm achieved the performance of conventional spatial audio coding methods with similar bit rates. The sound separation performance was evaluated and found applicable for separating sound sources in the coding domain directly.
Authors:
Nikunen, Joonas; Virtanen, Tuomas; Vilermo, Miikka
Affiliations:
Tampere University of Technology, Tampere, Finland; Nokia Research Center, Tampere, Finland(See document for exact affiliation information.) JAES Volume 60 Issue 10 pp. 794-806; October 2012
Publication Date:
November 26, 2012Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=16553