Perceptual Audio Modeling Based on Total Least Squares Algorithms

Hermus, Kris; Verhelst, Werner; Wambacq, Patrick

AES E-Library

Perceptual Audio Modeling Based on Total Least Squares Algorithms

Total Least Squares (TLS) algorithms automatically decompose (audio) frames into a number of exponentially damped sinusoids. This can provide for more efficient modeling than plain sinusoidal modeling, especially in the case of transitional frames. Straightforward implementations of TLS optimize a SNR criterion. In our implementation we apply TLS in a subband scheme in which the number of damped sinusoids is both frame and subband dependent. This is made possible through the use of perceptual information provided by the MPEG-I psycho-acoustic model I. Experiments on different audio tracks provide proof of concept for our perceptual ESM, and illustrate the significant reduction in modeling components compared to a non-perceptual ESM.

Authors: Hermus, Kris; Verhelst, Werner; Wambacq, Patrick
Affiliations: Katholieke Universiteit Leuven, dept. ESAT - div. PSI, Leuven, BELGIUM ; Vrije Universiteit Brussel, dept. ETRO - div. DSSP, Brussels, BELGIUM(See document for exact affiliation information.)
AES Convention: 112 (April 2002) Paper Number: 5571
Publication Date: April 1, 2002 Import into BibTeX
Subject: Low Bit-Rate Audio Coding
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=11322

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD aes19) /pp2002/pp0205/000233.pdf

Start a discussion about this paper!

AES E-Library

Perceptual Audio Modeling Based on Total Least Squares Algorithms

ABOUT AES

Contact Us