AES E-Library

Energy-adapted Matching Pursuits in Multi-parts Models for Audio Coding Purposes

The application of the matching pursuit algorithm for extracting sinusoidal components and transients from audio signals is proposed. The resulting residue is perceptually modelled as a noise like signal. This multi-part model (Sines + Transients + Noise) is used for audio coding purposes. First of all, an accurate detection of transients in audio signals is required. When a transient is detected, energy-adapted matching pursuits are accomplished using a wavelet-packet based dictionary and a dictionary of sinusoidal functions. Otherwise, the matching pursuit algorithm is only applied with the harmonic dictionary. In both cases, the resulting residue is then modelled as a noise-like signal using the Equivalent Rectangular Bandwidth (ERB) model. The parameters of this multi-part model are efficiently quantized, taking into account psycho-acoustical information, so as to assure high perceptual quality at low bit rates. The combination of these all ideas results in nearly transparent audio coding at binary rates lower than 32 kbps for most of the CD-quality one channel audio signals considered for testing.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
AES Convention: Paper Number:
Publication Date:
Session subject:

DOI:


Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
16938
Choose your country of residence from this list: