Creation of New Virtual Patterns for Emotion Recognition through PSOLA
×
Cite This
Citation & Abstract
I. Mohino-Herranz, HÉ. A.. Sánchez-Hevia, R. Gil-Pita, and M. Rosa-Zurera, "Creation of New Virtual Patterns for Emotion Recognition through PSOLA," Paper 9037, (2014 April.). doi:
I. Mohino-Herranz, HÉ. A.. Sánchez-Hevia, R. Gil-Pita, and M. Rosa-Zurera, "Creation of New Virtual Patterns for Emotion Recognition through PSOLA," Paper 9037, (2014 April.). doi:
Abstract: Human emotions can be recognized through speech analysis. One main problem of this discipline is the lack of databases with a sufficient number of patterns for a correct learning. This fact makes generalization in the learning process be more difficult. One possible solution is the creation of new virtual patterns, enlarging the training set. In order to carry out this enlargement, we modify the average pitch by using the technique known as Pitch Synchronous Overlap and Add combined with resampling, that allows to change the average pitch without altering neither the pitch variations nor the speech rate. Therefore, the emotion in the utterance is unaltered. Results over the original test set show that it is possible to achieve a significant reduction in the generalization effects with the proposed creation of new virtual training patterns.
@article{mohino-herranz2014creation,
author={mohino-herranz, inma and sánchez-hevia, héctor a. and gil-pita, roberto and rosa-zurera, manuel},
journal={journal of the audio engineering society},
title={creation of new virtual patterns for emotion recognition through psola},
year={2014},
volume={},
number={},
pages={},
doi={},
month={april},}
@article{mohino-herranz2014creation,
author={mohino-herranz, inma and sánchez-hevia, héctor a. and gil-pita, roberto and rosa-zurera, manuel},
journal={journal of the audio engineering society},
title={creation of new virtual patterns for emotion recognition through psola},
year={2014},
volume={},
number={},
pages={},
doi={},
month={april},
abstract={human emotions can be recognized through speech analysis. one main problem of this discipline is the lack of databases with a sufficient number of patterns for a correct learning. this fact makes generalization in the learning process be more difficult. one possible solution is the creation of new virtual patterns, enlarging the training set. in order to carry out this enlargement, we modify the average pitch by using the technique known as pitch synchronous overlap and add combined with resampling, that allows to change the average pitch without altering neither the pitch variations nor the speech rate. therefore, the emotion in the utterance is unaltered. results over the original test set show that it is possible to achieve a significant reduction in the generalization effects with the proposed creation of new virtual training patterns.},}
TY - paper
TI - Creation of New Virtual Patterns for Emotion Recognition through PSOLA
SP -
EP -
AU - Mohino-Herranz, Inma
AU - Sánchez-Hevia, Héctor A.
AU - Gil-Pita, Roberto
AU - Rosa-Zurera, Manuel
PY - 2014
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - April 2014
TY - paper
TI - Creation of New Virtual Patterns for Emotion Recognition through PSOLA
SP -
EP -
AU - Mohino-Herranz, Inma
AU - Sánchez-Hevia, Héctor A.
AU - Gil-Pita, Roberto
AU - Rosa-Zurera, Manuel
PY - 2014
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - April 2014
AB - Human emotions can be recognized through speech analysis. One main problem of this discipline is the lack of databases with a sufficient number of patterns for a correct learning. This fact makes generalization in the learning process be more difficult. One possible solution is the creation of new virtual patterns, enlarging the training set. In order to carry out this enlargement, we modify the average pitch by using the technique known as Pitch Synchronous Overlap and Add combined with resampling, that allows to change the average pitch without altering neither the pitch variations nor the speech rate. Therefore, the emotion in the utterance is unaltered. Results over the original test set show that it is possible to achieve a significant reduction in the generalization effects with the proposed creation of new virtual training patterns.
Human emotions can be recognized through speech analysis. One main problem of this discipline is the lack of databases with a sufficient number of patterns for a correct learning. This fact makes generalization in the learning process be more difficult. One possible solution is the creation of new virtual patterns, enlarging the training set. In order to carry out this enlargement, we modify the average pitch by using the technique known as Pitch Synchronous Overlap and Add combined with resampling, that allows to change the average pitch without altering neither the pitch variations nor the speech rate. Therefore, the emotion in the utterance is unaltered. Results over the original test set show that it is possible to achieve a significant reduction in the generalization effects with the proposed creation of new virtual training patterns.
Authors:
Mohino-Herranz, Inma; Sánchez-Hevia, Héctor A.; Gil-Pita, Roberto; Rosa-Zurera, Manuel
Affiliation:
Universidad de Alcalá, Alcalá de Henares, Madrid, Spain
AES Convention:
136 (April 2014)
Paper Number:
9037
Publication Date:
April 25, 2014Import into BibTeX
Subject:
Signal Processing
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=17184