Extracting speech-specific characteristics from a signal such as spectral envelope and pitch is essential for parametrical speech processing. These characteristics are used in many speech applications including coding, parametrical text-to-speech synthesis, voice morphing, and others. This paper presents some original estimation techniques that extract these characteristics using a sinusoidal model of speech with instantaneous parameters. The analysis scheme consists of two steps: first the parameters of sinusoidal model are extracted from the signal, and then these parameters are transformed to required characteristics. Some evaluations of the presented techniques are carried out on synthetic and natural speech signals to show potential of the presented approach.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.