Jointly Optimal Time Segmentation, Distribution and Quantisation for Sinusoidal Audio Coding
In this paper we propose a rate-distortion optimal algorithm for sinusoidal coding of audio and speech. The algorithm determines for a pre-specified target bit-rate the optimal (variable-length) time segmentation, the optimal distribution of sinusoidal components over the segments and the optimal (scalar) quantisers for quantising the sinusoid parameters amplitude, phase and frequency. The optimisation is done by jointly optimising the segment lengths, number of sinusoids and quantisers using high-resolution quantisation theory and dynamic programming techniques, which makes it possible to execute the algorithm in polynomial time. A particular advantage of the proposed method is that, given a target bit-rate, it solves the problem of finding the optimal balance between total number of sinusoids and number of bits per sinusoid.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is temporarily free for AES members.