Perceptual Distortion-Rate Optimization of Long Term Prediction in MPEG AAC
Long Term Prediction (LTP) in MPEG Advanced Audio Coding (AAC) exploits inter-frame redundancies via predictive coding of the current frame, given previously reconstructed data. Particularly, AAC Low Delay mandates LTP, to exploit correlations that would otherwise be ignored due to the shorter frame size. The LTP parameters are typically selected by time-domain techniques aimed at minimizing the mean squared prediction error, which is mismatched with the ultimate perceptual criteria of audio coding. We thus propose a novel trellis-based approach that optimizes the LTP parameters, in conjunction with the quantization and coding parameters of the frame, explicitly in terms of the perceptual distortion and rate tradeoffs. A low complexity "two-loop" search alternative to the trellis is also proposed. Objective and subjective results provide evidence for substantial gains.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is temporarily free for AES members.