The application of the matching pursuit algorithm for extracting sinusoidal components and transients from audio signals is proposed. The resulting residue is perceptually modelled as a noise like signal. This multi-part model (Sines + Transients + Noise) is used for audio coding purposes. First of all, an accurate detection of transients in audio signals is required. When a transient is detected, energy-adapted matching pursuits are accomplished using a wavelet-packet based dictionary and a dictionary of sinusoidal functions. Otherwise, the matching pursuit algorithm is only applied with the harmonic dictionary. In both cases, the resulting residue is then modelled as a noise-like signal using the Equivalent Rectangular Bandwidth (ERB) model. The parameters of this multi-part model are efficiently quantized, taking into account psycho-acoustical information, so as to assure high perceptual quality at low bit rates. The combination of these all ideas results in nearly transparent audio coding at binary rates lower than 32 kbps for most of the CD-quality one channel audio signals considered for testing.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.