Speech Analysis Based on Sinusoidal Model with Time-Varying Parameters

Azarov, Elias; Vashkevich, Maxim; Petrovsky, Alexander

AES E-Library

Speech Analysis Based on Sinusoidal Model with Time-Varying Parameters

Extracting speech-specific characteristics from a signal such as spectral envelope and pitch is essential for parametrical speech processing. These characteristics are used in many speech applications including coding, parametrical text-to-speech synthesis, voice morphing, and others. This paper presents some original estimation techniques that extract these characteristics using a sinusoidal model of speech with instantaneous parameters. The analysis scheme consists of two steps: first the parameters of sinusoidal model are extracted from the signal, and then these parameters are transformed to required characteristics. Some evaluations of the presented techniques are carried out on synthetic and natural speech signals to show potential of the presented approach.

Authors: Azarov, Elias; Vashkevich, Maxim; Petrovsky, Alexander
Affiliation: Belarusian State University of Informatics and Radioelectronics, Minsk, Belarus
AES Convention: 138 (May 2015) Paper Number: 9267
Publication Date: May 6, 2015 Import into BibTeX
Subject: Audio Signal Processing
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=17691

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 138Papers) /conv/138/9267.pdf

Start a discussion about this paper!

AES E-Library

Speech Analysis Based on Sinusoidal Model with Time-Varying Parameters

ABOUT AES

Contact Us