Perceptual Audio Modeling Based on Total Least Squares Algorithms
Total Least Squares (TLS) algorithms automatically decompose (audio) frames into a number of exponentially damped sinusoids. This can provide for more efficient modeling than plain sinusoidal modeling, especially in the case of transitional frames. Straightforward implementations of TLS optimize a SNR criterion. In our implementation we apply TLS in a subband scheme in which the number of damped sinusoids is both frame and subband dependent. This is made possible through the use of perceptual information provided by the MPEG-I psycho-acoustic model I. Experiments on different audio tracks provide proof of concept for our perceptual ESM, and illustrate the significant reduction in modeling components compared to a non-perceptual ESM.
Click to purchase paper or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $20 for non-members, $5 for AES members and is free for E-Library subscribers.