The performance of perceptual audio coders depends on the efficiency of the quantization operation in masking the quantization noise under the audio signal. This objective is better addressed by coding separately different signal components such as sinusoids, transients and stationary noise. In this paper we use an audio coder that normalizes the MDCT spectrum by a smooth spectral envelope and by periodicities due to sinusoids. The resulting flattened MDCT coefficients are shown to exhibit a probability density function with small uncertainty allowing the design of an optimum non-uniform scalar quantizer. Its distortion--rate function is derived, is compared to that of of known quantizers, and compared to that obtained under real audio coding conditions.
https://www.aes.org/e-lib/browse.cfm?elib=12378
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!