Monaural Speech Source Separation by Estimating the Power Spectrum Using Multi-Frequency Harmonic Product Spectrum

×

Cite This

Citation & Abstract

D. Ayllon, R. Gil-Pita, and M. Rosa-Zurera, "Monaural Speech Source Separation by Estimating the Power Spectrum Using Multi-Frequency Harmonic Product Spectrum," Paper 8832, (2013 May.). doi:
D. Ayllon, R. Gil-Pita, and M. Rosa-Zurera, "Monaural Speech Source Separation by Estimating the Power Spectrum Using Multi-Frequency Harmonic Product Spectrum," Paper 8832, (2013 May.). doi:
Abstract: This paper proposes an algorithm to perform monaural speech source separation by means of time-frequency masking. The algorithm is based on the estimation of the power spectrum of the original speech signals as a combination of a carrier signal multiplied by an envelope. A Multi-Frequency Harmonic Product Spectrum (MF-HPS) algorithm is used to estimate the fundamental frequency of the signals in the mixture. These frequencies are used to estimate both the carrier and the envelope from the mixture. Binary masks are generated comparing the estimated spectra of the signals. Results show an important improvement in the separation in comparison to the original algorithm that only uses the information from the HPS.

@article{ayllon2013monaural,
author={ayllon, david and gil-pita, roberto and rosa-zurera, manuel},
journal={journal of the audio engineering society},
title={monaural speech source separation by estimating the power spectrum using multi-frequency harmonic product spectrum},
year={2013},
volume={},
number={},
pages={},
doi={},
month={may},}
@article{ayllon2013monaural,
author={ayllon, david and gil-pita, roberto and rosa-zurera, manuel},
journal={journal of the audio engineering society},
title={monaural speech source separation by estimating the power spectrum using multi-frequency harmonic product spectrum},
year={2013},
volume={},
number={},
pages={},
doi={},
month={may},
abstract={this paper proposes an algorithm to perform monaural speech source separation by means of time-frequency masking. the algorithm is based on the estimation of the power spectrum of the original speech signals as a combination of a carrier signal multiplied by an envelope. a multi-frequency harmonic product spectrum (mf-hps) algorithm is used to estimate the fundamental frequency of the signals in the mixture. these frequencies are used to estimate both the carrier and the envelope from the mixture. binary masks are generated comparing the estimated spectra of the signals. results show an important improvement in the separation in comparison to the original algorithm that only uses the information from the hps.},}

TY - paper
TI - Monaural Speech Source Separation by Estimating the Power Spectrum Using Multi-Frequency Harmonic Product Spectrum
SP -
EP -
AU - Ayllon, David
AU - Gil-Pita, Roberto
AU - Rosa-Zurera, Manuel
PY - 2013
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2013
TY - paper
TI - Monaural Speech Source Separation by Estimating the Power Spectrum Using Multi-Frequency Harmonic Product Spectrum
SP -
EP -
AU - Ayllon, David
AU - Gil-Pita, Roberto
AU - Rosa-Zurera, Manuel
PY - 2013
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2013
AB - This paper proposes an algorithm to perform monaural speech source separation by means of time-frequency masking. The algorithm is based on the estimation of the power spectrum of the original speech signals as a combination of a carrier signal multiplied by an envelope. A Multi-Frequency Harmonic Product Spectrum (MF-HPS) algorithm is used to estimate the fundamental frequency of the signals in the mixture. These frequencies are used to estimate both the carrier and the envelope from the mixture. Binary masks are generated comparing the estimated spectra of the signals. Results show an important improvement in the separation in comparison to the original algorithm that only uses the information from the HPS.

This paper proposes an algorithm to perform monaural speech source separation by means of time-frequency masking. The algorithm is based on the estimation of the power spectrum of the original speech signals as a combination of a carrier signal multiplied by an envelope. A Multi-Frequency Harmonic Product Spectrum (MF-HPS) algorithm is used to estimate the fundamental frequency of the signals in the mixture. These frequencies are used to estimate both the carrier and the envelope from the mixture. Binary masks are generated comparing the estimated spectra of the signals. Results show an important improvement in the separation in comparison to the original algorithm that only uses the information from the HPS.

Authors:
Ayllon, David; Gil-Pita, Roberto; Rosa-Zurera, Manuel
Affiliation:
University of Alcala, AlcalĂˇ de Henares, Spain
AES Convention:
134 (May 2013)
Paper Number:
8832
Publication Date:
May 4, 2013Import into BibTeX
Subject:
Speech Processing
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=16733