Harmonic Representation and Auditory Model-Based Parametric Matching and Its Application in Speech/Audio Analysis

Petrovsky, Alexey; Azarov, Elias; Petrovsky, Alexander

AES E-Library

Harmonic Representation and Auditory Model-Based Parametric Matching and Its Application in Speech/Audio Analysis

The paper presents new methods for the selection of sinusoids and transients components in hybrid sinusoidal modeling of speech/audio. The instantaneous harmonic parameters (magnitude, frequency and phase) are calculated as the result of the narrow band filtering of speech/audio. The frequency-modulated filters synthesis with the closed form impulse response has been proposed. The filter frequency bounds can be determined during the components frequency tracking and can be adjusted according to the fundamental frequency modulations. It can be implemented speech/audio harmonic/noise decomposition. The transient components modeling are presented by matching pursuit with frame-based psychoacoustic optimized wavelet packet dictionary. The choice of most relevant coefficients is based on maximizing the matching between the auditory excitation scalograms of original and modeled signals.

Authors: Petrovsky, Alexey; Azarov, Elias; Petrovsky, Alexander
Affiliations: Bialystok Technical University, Bialystok, Poland; Belarusian State University of Informatics and Radioelectronics, Minsk, Belarus(See document for exact affiliation information.)
AES Convention: 126 (May 2009) Paper Number: 7705
Publication Date: May 1, 2009 Import into BibTeX
Subject: Audio for Telecommunications
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=14901

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 126Papers) /126/7705.pdf

Start a discussion about this paper!

AES E-Library

Harmonic Representation and Auditory Model-Based Parametric Matching and Its Application in Speech/Audio Analysis

ABOUT AES

Contact Us