Bit-Rate Scalable Intraframe Sinusoidal Audio Coding Based on Rate-Distortion Optimization

Heusdens, Richard; Jensen, Jesper; Kleijn, W. Bastiaan; Kot, Valery; Niamut, Omar A.; Van De Par, Steven; Van Schijndel, Micholle H.

AES E-Library

Bit-Rate Scalable Intraframe Sinusoidal Audio Coding Based on Rate-Distortion Optimization

A coding methodology that aims at rate-distortion optimal sinusoid + noise coding of audio and speech signals is presented. The coder divides the input signal into variable-length time segments and distributes sinusoidal components over the segments such that the resulting distortion (as measured by a perceptual distortion measure) is minimized subject to a prespecified rate constraint. The coder is bit-rate scalable. For a given target bit budget it automatically adapts the segmentation and distribution of sinusoids in a rate-distortion optimal manner. The coder uses frequency-differential coding techniques in order to exploit intrasegment correlations for efficient quantization and encoding of the sinusoidal model parameters. This technique makes the coder more robust toward packet losses when used in a lossy-packet channel environment as compared to time-differential coding techniques, which are commonly used in audio or speech coders. In a subjective listening experiment the present coder showed similar or better performance than a set of four MPEG-4 coders operating at bit rates of 16, 24, 32, and 48 kbit/s, each of which was state of the art for the given target bit rate.

Authors: Heusdens, Richard; Jensen, Jesper; Kleijn, W. Bastiaan; Kot, Valery; Niamut, Omar A.; Van De Par, Steven; Van Schijndel, Micholle H.
Affiliations: Delft University of Technology, Delft, The Netherlands; Royal Institute of Technology, Stockholm, Sweden; Philips Research Laboratories, Eindhoven, The Netherlands(See document for exact affiliation information.)
JAES Volume 54 Issue 3 pp. 167-188; March 2006
Publication Date: March 15, 2006 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=13673

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD JAES54) /jaes54/3/pg167.pdf

Start a discussion about this paper!

AES E-Library

Bit-Rate Scalable Intraframe Sinusoidal Audio Coding Based on Rate-Distortion Optimization

ABOUT AES

Contact Us