Spatial Decomposition of Time-frequency Regions: Subbands or Sinusoids

Harma, Aki; Faller, Christof

AES E-Library

Spatial Decomposition of Time-frequency Regions: Subbands or Sinusoids

Techniques where a stereo or a multichannel signal is decomposed into spatial source-labeled time-frequency slots by level, time-difference, and coherence metrics have become popular in recent years. Good examples are binaural cue coding and up/downmixing techniques. In the article, we will provide an overview and discuss parallel approaches in the field of array processing and blind source separation. Typically, time-frequency slots are formed from subband representations of signals. However, it is also possible to produce a similar spatial decomposition for a parametric representation (sinusoids, transients, and noise) of a stereo or multichannel audio signal. Advantages and disadvantages of the two approaches for audio coding applications are discussed in this article.

Authors: Harma, Aki; Faller, Christof
Affiliations: Laboratory of Acoustics and Audio Signal Processing, Helsinki University of Technology, HUT, Finland ; Mobile Terminals Division, Agere Systems, Allentown, PA,(See document for exact affiliation information.)
AES Convention: 116 (May 2004) Paper Number: 6061
Publication Date: May 1, 2004 Import into BibTeX
Subject: Low bit-Rate Audio Coding
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=12740

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 116Papers) /116/6061.pdf

Start a discussion about this paper!

AES E-Library

Spatial Decomposition of Time-frequency Regions: Subbands or Sinusoids

ABOUT AES

Contact Us