We propose a new approach to achieve efficient scalability in audio coders, and demonstrate its performance using the MPEG-4 Advanced Audio Coder (AAC). In conventional scalable coding, the enhancement-layer performs straightforward re-quantization of the base-layer reconstruction error. This coding scheme implicitly discards useful information from the base-layer, and does not truly minimize a perceptually meaningful distortion criterion such as the noise-mask ratio. We reformulate the problem of scalable coding within a companding framework, and show that re-quantization in the compander's compressed domain achieves, in the asymptotic sense, optimal scalability. Based on this observation, we develop a scalable AAC coder which performs enhancement-layer quantization while exploiting all the information available at that layer. Simulation results of a two-layer scalable coder on the standard test database of 44.1kHz sampled audio show that the proposed approach yields substantial savings in bit rate for a given reproduction quality.
https://www.aes.org/e-lib/browse.cfm?elib=10011
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!